Database Design

Database responsibilities by design

Indexly’s database schema is intentionally simple and stable. Semantic intelligence lives in the indexing layer — not the schema.

No schema changes were required to introduce semantic filtering.

clean_content stores only human text, aggressively filtered.

This enables future features such as:

At the moment, all searches still use content, ensuring backward compatibility.

Metadata is stored in two forms:

Only semantic metadata (Tier 2) is ever converted into FTS tokens.

file_metadata
├─ structured columns
│  ├─ title
│  ├─ author
│  ├─ camera
│  └─ …
└─ metadata (JSON)

Technical metadata remains queryable without polluting search results.

This design provides:

Semantic Indexing & Vocabulary Quality The technical model, measured results, and why a database update is required.