Intelligent Data Dictionary Agent
Link to open source: https://github.com/Samridhi1213/Intelligent-data-dictionary
Intelligent Data Dictionary Agent is an AI-powered platform that automatically connects to enterprise databases like PostgreSQL, Snowflake, and SQL Server to generate structured, business-friendly data documentation. It extracts schema metadata (tables, columns, relationships, constraints), performs real-time data quality analysis (null %, duplicates, freshness, statistical metrics), and generates AI-enhanced summaries for better business understanding.
We built this solution to solve the common enterprise challenge of outdated or missing database documentation. Our platform reduces dependency on engineers by enabling natural language interaction — users can ask questions about their schema and receive explanations or SQL suggestions instantly.
This system helps organizations improve data transparency, accelerate analytics, reduce errors, and enhance decision-making.
Technology stack
Backend: Python with FastAPI
Database: PostgreSQL
Metadata extraction via system catalogs
Data quality analysis using Pandas and NumPy
AI integration using Large Language Models
Frontend built with React.js
This build was uploaded as a hackathon project



.jpg)