Index Files and Folders with Indexly
Categories:
Getting Started with Indexing
To begin, index a folder once to make everything searchable. This is the foundation of your file management system.
indexly index ./docs
Filtering by File Type
Index one file type at a time when you want a smaller, targeted refresh.
indexly index ./docs --filetype .pdf
indexly index ./docs --filetype .docx
Search can filter multiple file types later with --filetype .pdf .docx.
Advanced Extraction
For more detailed content extraction, install the document extras and choose the PDF OCR behavior intentionally.
python -m pip install "indexly[documents]"
indexly index ./docs --ocr
indexly index ./docs --no-ocr
--ocr forces OCR for PDFs. --no-ocr disables OCR for PDFs. Without either flag, Indexly uses the default PDF extraction policy.
You can also enable extended MTW extraction when working with Minitab archives or complex MTW inputs.
indexly index ./archives --mtw-extended
Keeping Your Index Updated
Once your initial index is set up, consider using the watch feature to keep everything current. This way, any created, modified, or deleted file is automatically handled without manual intervention.
indexly watch ./docs
Quick Tip
In addition to basic indexing, you can view database statistics to get a quick overview of your index. This shows indexed files, tagged files, untagged files, tag coverage, database size, unique tags, total tag assignments, and top tags at a glance.
indexly stats
Next Steps
For a deeper dive into how this process works, check out Semantic Indexing.