✨ Features Overview
Indexly helps researchers and power users search Word, PDF, and text documents locally. Fast, offline, with tagging and FTS5.
Indexly is a lightweight, modular document indexing + search engine. Here’s the latest overview:
Search
- Full-text search (FTS5) across content and metadata
- Boolean operators:
AND,OR,NOT,NEAR - Phrase search with quotes
search "term" - Fuzzy search support via SQLite extensions
- Smart ranking and scoring
Tag Detection
- Extracts custom tags from document content
- Regex-based virtual tag matcher (
fts_core.py) - Works with
.pdf,.docx,.eml,.msg,.txt,.md,.xlsx - CLI previews via
tag listcommand
Caching System
- Smart result caching for repeat searches
- Auto refresh if documents change
- Control via
--no-cache
CSV Analyzer
- Auto-detects delimiters
- Computes mean, median, stddev, IQR
- Value counts for categorical data
- Outputs in Markdown or TXT
- Cleaning CSV Data and Analyze CSV
Supported Formats
.pdf, .docx, .xlsx, .csv, .msg, .eml, .md, .txt, .json, .xml, .epub, .pptx, .odt, HTML, JS, CSS, Python, Logs, images (.jpg, .png, .tiff, .bmp)
CLI & Extensibility
- Modular CLI in
cli_utils.py - Scriptable, clear logging system
- Fully open-source
Metadata Indexing
- Extracts title, author, subject, dates
- PDF and Office metadata
- Image EXIF metadata
Developer Focus
- Extend
tag_fieldsinfts_core.py - Modify
filetype_utils.pyto support new formats



