Feb 21, 2026

DataLens โ€” Intelligent Data Dictionary Agent

ai data-dictionary python fastapi llm data-quality sqlite groq react schema-analysis

DataLens is a full-stack AI-powered data dictionary agent that connects to any database (SQLite, Snowflake, PostgreSQL) and automatically generates living, enriched documentation โ€” with zero manual effort.

๐Ÿ” What it does:
- Auto-extracts complete schema: tables, columns, foreign keys, indexes
- Runs quality analysis grading every table A+ to F across completeness, uniqueness, and freshness
- AI enrichment via Groq (Llama 3.3 70B) writes business-ready descriptions, domain tags, and usage recommendations for every table and column
- Interactive D3 force-directed lineage graph visualises all FK relationships
- Natural language AI Chat โ€” ask questions in plain English, get SQL + answers instantly
- One-click Query Runner executes AI-generated SQL against real data
- โŒ˜K semantic search finds any table or column by business meaning
- Exports a complete data dictionary as Markdown or JSON

๐Ÿ›  Built with: FastAPI ยท React ยท D3.js ยท Groq API ยท SQLite/Snowflake/PostgreSQL
๐Ÿ“Š Demo dataset: Olist Brazilian E-Commerce (9 tables, 569,774 rows) โ€” Quality Score 97/100, Grade A+, fully enriched in under 2 minutes on free-tier AI

This build was uploaded as a hackathon project

Hackathon

HackFest 2.0

View All Projects

1

Give a star to encourage!Discussion
Start a new conversation!
Login to join the discussion
Updates
  • undefined
    Saturday, Feb 21st, 2026
  • undefined
    Sunday, Feb 22nd, 2026