Minitab MTW files
Categories:
Extract-MTW Command — Minitab File Extraction Made Simple
The extract-mtw command allows Indexly to directly read, extract, and process Minitab MTW files.
With it, you can access embedded worksheet data, decode textual content, and optionally extract extended metadata from internal WorksheetInfo sections — all while keeping the generated files readable.
This feature bridges the gap between data science tools and practical search/indexing pipelines, making .mtw files fully searchable, indexable, and analyzable.
Key Features
- 🧮 Native MTW support for
.mtw(Minitab) project files - 🧩 Automatic stream decoding — detects worksheet streams, text notes, and binary fallbacks
- 🧮 Cleaner worksheet CSVs — extracts contiguous numeric worksheet columns instead of dumping raw binary text
- 📝 Notes files — preserves readable worksheet descriptions separately from numeric data
- 🗂 Optional WorksheetInfo extraction for diagnostic metadata (
--mtw-extended) - 🧹 Intelligent cleaning removes control characters and internal stream markers
- 💾 CSV output for decoded worksheets
- ⚙️ Graceful fallback for compressed, unknown, or binary-only streams
- 📑 Metadata storage integration with Indexly’s core database
- 🚀 Resource-aware mode — extended extraction only triggered when explicitly requested
Getting Started
Before you begin, make sure your Indexly installation is updated to version 1.0.3 or higher.
To view all options and parameters for this feature, run:
indexly extract-mtw --help
This displays all available flags, including the --mtw-extended option and output configuration parameters.
Basic Extraction
To extract all readable worksheets and streams from a Minitab file, simply run:
indexly extract-mtw path/to/datafile.mtw --output ./mtw-output
After processing, Indexly generates clean .csv, .txt, or .bin outputs in the selected output directory.
Example output:
datafile_worksheet.csv
datafile_worksheet_notes.txt
The worksheet CSV contains decoded numeric columns. The notes file contains readable descriptions, citations, or column explanations found in the MTW stream. Binary or non-decodable data is safely written as .bin files for later inspection.
Extended WorksheetInfo Extraction
Some .mtw files include a hidden section called WorksheetInfo — containing detailed metadata about how the worksheet was created, formatted, or analyzed.
This data can be large and is not always necessary, so Indexly keeps it optional.
To enable it, simply pass the --mtw-extended flag:
indexly extract-mtw --mtw-extended path/to/statistics-report.mtw
You may then see additional output files such as:
statistics-report_worksheetinfo.csv
statistics-report_worksheetinfo.bin
Readable WorksheetInfo content is cleaned and saved as text-like CSV output. Unreadable WorksheetInfo or compressed streams are preserved as .bin files instead of being forced into noisy text.
Text Cleaning and Normalization
When the extractor encounters readable worksheet text or WorksheetInfo, Indexly applies a consistent cleaning process:
- Removes control symbols and internal stream markers
- Normalizes spacing and punctuation
- Preserves readable sections like source notes, titles, timestamps, or column descriptions
- Writes notes separately from worksheet numeric data
This keeps worksheet CSVs lightweight, structured, and easy to inspect or analyze.
For example, a raw block like:
G,@,@ j Data from Iceland in figures 1 9 9 9 - 2 0 0 0
is automatically cleaned and saved as:
Data from Iceland in figures 1999 - 2000
Output Structure
After extraction, Indexly organizes generated files in the selected output directory. Depending on your flags and file content, you might see outputs such as:
| Output Type | Example Filename | Description |
|---|---|---|
| Worksheet CSV | analysis_worksheet.csv |
Decoded numeric worksheet columns |
| Notes text | analysis_worksheet_notes.txt |
Readable worksheet descriptions/notes |
| WorksheetInfo CSV | analysis_worksheetinfo.csv |
Cleaned extended metadata, when text |
| Binary file | analysis_worksheetdata.bin |
Raw binary stream fallback |
All generated files are normalized paths to ensure cross-platform compatibility and consistency.
Performance & Resource Control
Extracting diagnostic streams can be resource-intensive for large MTW archives.
That’s why unreadable WorksheetInfo and binary fallback streams are most useful when --mtw-extended is used.
If omitted, Indexly will skip WorksheetInfo processing — improving speed while still extracting worksheet data and text streams.
Example: Full Workflow
Here’s a complete example combining extraction and metadata generation:
indexly extract-mtw --mtw-extended ./datasets/lab-results.mtw
Output:
lab-results_worksheet.csv
lab-results_worksheet_notes.txt
lab-results_worksheetinfo.csv
lab-results_worksheetdata.bin
Tips & Best Practices
-
Start without
--mtw-extendedif processing many files at once. -
Use the flag only for deep data inspection or metadata analysis.
-
Review extracted worksheet CSVs with a spreadsheet editor or pandas for quick inspection.
-
Use notes files to understand source descriptions and column meaning.
-
Treat
.binfiles as diagnostic fallbacks for compressed/newer MTW layouts. -
Combine with Indexly’s
analyze-csvto summarize and visualize extracted data. -
When unsure about parameters, use:
indexly extract-mtw --help
See Also
analyze-csv— generate statistics from CSV or worksheet dataindex— index newly extracted data into the FTS5 databasesearch— perform fast text or metadata queries