✨ Features Overview
Indexly helps researchers and power users search Word, PDF, and text documents locally. Fast, offline, with tagging and FTS5.
Indexly is a lightweight, modular document indexing + search engine. Here’s the latest overview:
Search
- Full-text search (FTS5) across content and metadata
- Boolean operators:
AND,OR,NOT,NEAR - Phrase search with quotes
search "term" - Fuzzy search support via SQLite extensions
- Smart ranking and scoring
Tag Detection
- Extracts custom tags from document content
- Regex-based virtual tag matcher (
fts_core.py) - Works with
.pdf,.docx,.eml,.msg,.txt,.md,.xlsx - CLI previews via
tag listcommand
Caching System
- Smart result caching for repeat searches
- Auto refresh if documents change
- Control via
--no-cache
CSV Analyzer
- Auto-detects delimiters
- Computes mean, median, stddev, IQR
- Value counts for categorical data
- Outputs in Markdown or TXT
- Cleaning CSV Data →
- Analyze CSV →
Supported Formats
.pdf, .docx, .xlsx, .csv, .msg, .eml, .md, .txt, .json, .xml, .epub, .pptx, .odt, HTML, JS, CSS, Python, Logs, images (.jpg, .png, .tiff, .bmp)
CLI & Extensibility
- Modular CLI in
cli_utils.py - Scriptable, clear logging system
- Fully open-source
Metadata Indexing
- Extracts title, author, subject, dates
- PDF and Office metadata
- Image EXIF metadata
Developer Focus
- Extend
tag_fieldsinfts_core.py - Modify
filetype_utils.pyto support new formats



