Hack to Crack 1.0

Completed

AI Community Lucknow (TFUG)

5Tracks

1Problem Statement

1AI for Social Good: Empowering and Assisting Underserved Communities

Leverage advanced AI technologies to develop solutions that address the challenges faced by individuals with disabilities and underserved communities. By harnessing the power of AI, we can create tools that enhance communication, accessibility, and opportunities for these individuals, ultimately contributing to a more inclusive and equitable society.

Potential Problem Statements(Not Limited to):

Sign Language Translator:
- Develop a sign language translation system that accurately recognizes and interprets sign language gestures in real-time.
- Convert the recognized gestures into text or speech output, enabling seamless communication between hearing-impaired individuals and others.
- Explore advanced computer vision techniques, such as pose estimation and hand tracking, to capture and analyze sign language gestures accurately.

Image Transcriptors for the Visually Impaired:

Create an image-to-text transcription system that describes images or scenes in real-time for visually impaired individuals.
Utilize deep learning models, such as convolutional neural networks (CNNs) and object detection algorithms, to identify and describe the content of images accurately.
Integrate the transcription system into assistive devices or mobile applications, making it easily accessible to users.

Product Image Search and Retrieval using RAG Model:

Develop a Retrieval Augmented Generation (RAG) model that enables efficient product image search and retrieval from a database.
Train the RAG model on a large dataset of product images and their associated metadata, such as product names, descriptions, and categories.
Implement a user-friendly interface that allows users to upload product images and retrieve similar or related products from the database, enhancing the shopping experience for visually impaired individuals or those with limited access to physical stores.

Image-based other use-cases:

Resume Matching and Scoring:

Build an AI-powered tool that automatically matches resumes with job descriptions and scores them based on relevance and qualification criteria.
Utilize natural language processing (NLP) techniques to extract key information from resumes and job descriptions, such as skills, experience, and education.
Develop a scoring algorithm that assesses the compatibility between resumes and job requirements, providing recruiters with a ranked list of the most suitable candidates.

2Automated AI/ML System for Detecting and Mitigating Online Fraud 1 Problem Statement

Create and implement an AI/ML-based system that can autonomously analyze and categorize online content, distinguishing between authentic and fake/fraudulent websites, advertisements, and customer care numbers. The system aims to achieve the following:

Website Authentication: Develop algorithms to assess the legitimacy of websites based on domain, SSL certificates, and other authentication indicators.
Ad Content Analysis: Implement NLP and image recognition techniques to evaluate the authenticity and accuracy of ad content.
Real-time Detection: Enable real-time analysis of online content to prevent users from accessing fake or malicious websites.
User Feedback Integration: Incorporate mechanisms for user feedback to enhance the system's accuracy and adapt to evolving fraudulent tactics.

Problem Statements

ps41

The rapid growth of digital platforms has led to a surge in fraudulent activities, including deceptive websites, misleading ads, and scam customer care numbers. These activities pose risks such as financial losses, compromise of user data, and a decline in trust in online services. To safeguard consumers and ensure a secure digital environment, there is a pressing need for an AI/ML-based system capable of identifying and flagging fake/fraudulent online entities.

Challenges:

Escalation in the number of fraudulent websites.
Continuous creation of new fraudulent entities with similar names.
Diverse types of fraud, including deceptive shopping, job scams, etc.
Adaptation to changes made by fraudulent websites and numbers.

3AI-Powered Local Information Chatbot

Develop a chatbot that leverages local language data, such as information about local shops, hospitals, tourist attractions, and restaurants, to provide quick and accurate answers to narrow, location-specific questions.

The chatbot should eliminate the need for users to spend significant time searching through multiple websites or Google results. For example, when a user asks, "What are the hospitals near the engineering college?" the chatbot should provide a ranked list of hospitals along with additional information such as visiting hours, typical crowd levels, the best times to visit, and online registration links - all in one place.

Potential Solutions (not limited to):

Building a Retrieval Augmentation System (RAG) with local data: Utilize RAG to efficiently retrieve relevant information from a large corpus of local data.
Fine-tuning a small custom model to compress local knowledge: Develop a compact model that encapsulates a vast amount of local information, enabling quick and accurate responses.
Utilizing knowledge graphs or search APIs: Integrate knowledge graphs or search APIs to enhance the chatbot's ability to provide comprehensive and up-to-date local information.
Document Retrieval and Knowledge-Grounded Response Generation: Create AI systems that can accurately retrieve documents and generate knowledgeable responses based on the content, helping users quickly understand complex policy details or other relevant information.

4Bridging the Language Gap: AI-Powered Local Language Transcription and Translation

Develop a user-friendly audio transcription system that converts spoken local Indian languages, such as Awadhi, Kanpuriya, Brij, or Tamil, into English transcripts. This will enable people from various regions, including parents and village residents, to access and benefit from advanced AI solutions like ChatGPT. The English transcripts can then be used for further operations, such as translating the text back into standard languages or local dialects, making the information accessible to a wider audience.

Potential Problem Statements (not limited to):

Local Language Data Collection Platform:

Create a platform where local people can contribute by recording their audio in local languages along with the corresponding English transcripts.
This collected data will serve as a valuable resource for training speech recognition and translation models specific to these low-resource languages.

Speech-Based Model Exploration for Low-Resource Languages:

Utilize pre-trained models like the Whisper model available on Hugging Face as a starting point for speech recognition in low-resource languages.
Fine-tune and adapt these models using the collected local language data to improve their accuracy and performance for specific Indian languages.

Word2Vec 2.0 Training for Local Languages:

Train Word2Vec models on the collected local language data to capture the semantic relationships between words in these languages.
These trained Word2Vec models can be used to improve the accuracy of translation and facilitate better understanding of the local language content.

Transfer Learning and Model Adaptation:

Explore transfer learning techniques to leverage knowledge from well-resourced languages and adapt it to low-resource Indian languages.
Utilize pre-trained models in languages like Hindi or English and fine-tune them with the collected local language data to improve performance.

5Enhancing Cybersecurity with Large Language Models (LLMs)

Leverage the power of Large Language Models (LLMs) to strengthen the cybersecurity posture of businesses by analyzing vast amounts of cybersecurity data, detecting potential threats, and enabling proactive defense mechanisms. By harnessing the capabilities of LLMs, businesses can improve their ability to anticipate, identify, and respond to cyber threats effectively.

Malware Detection:

Train LLMs on a comprehensive dataset of known malware samples and benign software to develop a deep understanding of malicious patterns and behaviors.
Utilize the trained LLMs to analyze and classify new software samples in real-time, accurately identifying potential malware threats.
Integrate the LLM-based malware detection system into existing security infrastructure, such as antivirus software or intrusion detection systems, to enhance their effectiveness.

Phishing Detection:

Develop an LLM-based phishing detection system that analyzes images, screenshots, and email content to identify potential scams and fraudulent activities.
Train the LLMs on a dataset of known phishing examples, including visual cues, language patterns, and common tactics used by attackers.
Utilize the trained LLMs to scan incoming emails, attachments, and suspicious URLs in real-time, accurately identifying and flagging potential phishing attempts.
Integrate the phishing detection system into email clients, web browsers, or security gateways to provide an additional layer of protection against phishing attacks.

23 May

Online

4 Prizes

Hackathon Dates

Starts23 May 202410:00 AM IST

Ends25 May 202412:00 AM IST

INTERESTED MEMBERS!

... + 23 More