Intelligent Data Dictionary Agent – AI-Powered Schema & Insight Engine
Link to open source: https://github.com/Ayati-Jain/intelligent-data-dictionary-agent
Overview
The Intelligent Data Dictionary Agent is an AI-assisted system that analyzes structured datasets (Excel or SQL databases) and automatically generates business-level explanations from technical schema metadata.
The goal is to bridge the gap between database structures and non-technical stakeholders by converting tables, columns, keys, and data statistics into understandable insights.
Problem Statement Alignment
This project addresses Problem Statement 1: Data Dictionary Agent from HackFest 2.0.
The system is designed to:
• Extract tables, columns, and key metadata
• Perform automated data profiling (null detection, data types, statistics)
• Interpret schema structure in business context
• Provide a chat-style interface for interactive dataset querying
Architecture Flow
-
User selects data source (Excel upload or SQL database connection
-
Backend extracts schema metadata
-
Data profiling is performed
-
Structured metadata is passed to an AI interpretation module
-
Contextual business insights are generated
-
User interacts via chat interface
Current Implementation
• Backend built in Python
• SQLite database integration
• Dashboard-based UI prototype
• Modular architecture (Schema Extraction + Data Profiling + AI Layer)
The system is currently implemented using SQLite for lightweight deployment and easy evaluation. It is extendable to MySQL/PostgreSQL.
Future Enhancements
• Foreign key detection
• ER diagram auto-generation
• Natural language to SQL conversion
• Advanced AI reasoning layer
This build was uploaded as a hackathon project




