✨ Features Overview

Indexly helps researchers and power users search Word, PDF, and text documents locally. Fast, offline, with tagging and FTS5.

Indexly is a lightweight, modular document indexing + search engine. Here’s the latest overview:


  • Full-text search (FTS5) across content and metadata
  • Boolean operators: AND, OR, NOT, NEAR
  • Phrase search with quotes search "term"
  • Fuzzy search support via SQLite extensions
  • Smart ranking and scoring

Search demo placeholder


Tag Detection

  • Extracts custom tags from document content
  • Regex-based virtual tag matcher (fts_core.py)
  • Works with .pdf, .docx, .eml, .msg, .txt, .md, .xlsx
  • CLI previews via tag list command

Tags placeholder


Caching System

  • Smart result caching for repeat searches
  • Auto refresh if documents change
  • Control via --no-cache

CSV Analyzer

CSV analysis placeholder


Supported Formats

.pdf, .docx, .xlsx, .csv, .msg, .eml, .md, .txt, .json, .xml, .epub, .pptx, .odt, HTML, JS, CSS, Python, Logs, images (.jpg, .png, .tiff, .bmp)


CLI & Extensibility

  • Modular CLI in cli_utils.py
  • Scriptable, clear logging system
  • Fully open-source

Metadata Indexing

  • Extracts title, author, subject, dates
  • PDF and Office metadata
  • Image EXIF metadata

metadata-indexing


Developer Focus

  • Extend tag_fields in fts_core.py
  • Modify filetype_utils.py to support new formats

Docs