0%

Aadya Madankar

AI Engineer

Great code
doesn’t just execute
commands.

It learns,
adapts, and
creates solutions.

"Work that
taught me most."

AI Associate Demo

AI-Associate

Designed and deployed a production-ready voice assistant serving 30+ Indian languages with real-time multimodal processing.

Provides accessible voice technology to underserved linguistic communities with intelligent routing, multimodal processing, and culturally aware responses.

Published research in IEEE OTCON 2025, demonstrating scalable architecture for culturally inclusive conversational AI.

Saahayak

Saahayak

Saahayak is an AI-powered teaching assistant designed to support educators managing multi-grade classrooms. Built with Google Gemini and Genkit, it generates hyper-localized content, differentiated worksheets, visual aids, and lesson plans—all from text, voice, or image inputs in 25+ Indian languages including Hindi, Marathi, and more.

Built to address the unique challenges of rural Indian schools where one teacher manages multiple grades with limited resources. This project demonstrates how practical AI can save educators hours of preparation time while maintaining the human touch that makes great teaching possible.

Custom SLM: Personalize AI Assistant

Custom SLM: Personalized AI Assistant

I'm training a Small Language Model on my experiences, knowledge, and problem-solving patterns—creating an AI version of how I think, code, and approach challenges.

This isn't about replacing myself; it's about capturing my expertise in a model that can assist others even when I'm not available.

This project bridges AI engineering with self-documentation, demonstrating both technical capability and philosophical understanding.

It's a living portfolio that evolves with me, learns from my work, and serves as an accessible interface to my knowledge—showing how SLMs can be personalized tools, not just generic assistants.

The world is one big data problem.

Tools and frameworks I work with daily.

Credentials & Recognition

IBM Logo
x1
The AI Ladder Framework
Deeplearning.AI Logo
x1
Intro to TensorFlow for AI
IEEE Logo
x1
Published Research Paper
GitHub Logo
x2
Open Source Contributions
Projects Icon
x3+
AI/ML Projects
Certificate Icon
x4
Total Certifications

I build
solutions.

Featured Projects

Multimodal PDF Assistant

This project is a sophisticated RAG-based system designed to answer queries from PDF documents by leveraging both text and images. It uses LangChain and Google Gemini Pro to provide accurate, context-aware responses.

View Project on GitHub
TECHNOLOGIES USED
LangChain
Google Gemini Pro
Streamlit
RAG
Ollama_UI

This project demonstrates how to run and manage models locally using Ollama by creating an interactive UI with Streamlit.

View Project on GitHub
TECHNOLOGIES USED
Ollama
Streamlit
Python
OpenAI
Project-Generator

Ever found yourself staring at a blank screen, desperate for a data project idea that actually shows off your skills? It’s a tool built for data folks like us—analysts, scientists, engineers—who want project ideas that feel fresh, relevant, and fun. Using a bit of AI magic, it takes your job title, favorite tools, and industry, then whips up personalized project suggestions complete with details, timelines, and skills graphs to help you bring them to life.

View Project on GitHub
TECHNOLOGIES USED
Google Gemini
Streamlit
Python
Pandas
Matplotlib
Plotly
Multi-Language Invoice Generator using Gemini Vision Model

project leverages the power of the Gemini Vision Model to create an advanced system for generating invoices in multiple languages. Whether you're managing international transactions or need versatile language support, this application provides a solution with state-of-the-art computer vision capabilities.

View Project on GitHub
TECHNOLOGIES USED
Google Gemini
Streamlit
Python
LangChain
PyPDF2
Chroma-DB
Faiss-Index
Multi-Modal-Screen-Assistant

A comprehensive AI-powered desktop assistant that combines multiple AI models to create an intelligent programming companion. This tool integrates visual processing, text analysis, and voice interaction to enhance programmer productivity through an intuitive interface.

View Project on GitHub
TECHNOLOGIES USED
Open AI Whisper
Google Gemini
Groq
PyAudio
Pillow
Paperclip
RAG Notebooks - Advanced Retrieval-Augmented Generation Systems

The purpose of this repository is thus to provide a comprehensive and dynamic hub to showcase the latest retrieval augmented generation (RAG) systems, and to enhance their error reduction as well as processing speed and contextual richness. It’s the go-to resource for those looking to become an RAG master — an assembly of curated Jupyter notebooks for learning and teaching RAG concepts. If you’re trying your hand at retrieval-based approaches to improve AI models or are just a fungi out for some advanced RAG methods, this repository is your thing.

View Project on GitHub
TECHNOLOGIES USED
Retrival Augemented Generation(RAG)
Python
LlamaIndex
VectorStore
Open AI
Gemini
Cohere
DallE, Elevan-Labs
Crawl
Hugging-Face
Perplexity
JSON
Web-Search
Advance-Rag-with-Langchain

This project is a comprehensive exploration of advanced techniques in building chatbots using LangChain, a powerful framework for developing applications with large language models (LLMs).

View Project on GitHub
TECHNOLOGIES USED
OpenAI
Groq
Streamlit
Python
Langchain
LangServe
BeautifulSoup
ChromaDB
Wikipedia
Arvix
Sentence Transformer
PyPDF2
Crew-AI

A Python-based multi-agent AI system built with the CrewAI framework for orchestrating collaborative autonomous agents that work together to solve complex tasks.

View Project on GitHub
TECHNOLOGIES USED
Crew-AI
Hugging-Face
Python
NVIDIA Model with Langserve

Deploy NVIDIA'S GPU Accelerated AI models as API using Langserve

View Project on GitHub
TECHNOLOGIES USED
Nvdia AI Models
Langchain
Langserve
Python
Streamlit
Object Tracking

This project contains a Python script for real-time object tracking using OpenCV.

View Project on GitHub
TECHNOLOGIES USED
Google Gemini
Open-CV
Channel and Spatial Reliability Tracking
Python
Food Classification

A deep learning project that uses the VGG-16 convolutional neural network for automated food image classification. The model leverages transfer learning with pre-trained ImageNet weights to accurately classify different types of food from images.

View Project on GitHub
TECHNOLOGIES USED
TensorFlow
Keras
NumPy
Matplotlib
Pandas
OpenCV
VGG-16
Voice-to-image

This project enables users to generate images from voice input by combining speech recognition with high-speed text-to-image generation. The system leverages NVIDIA TensorRT optimization for real-time image synthesis, capable of generating images in under a second.

View Project on GitHub
TECHNOLOGIES USED
SDXL Turbo
NVIDIA TensorRT
Stable Diffusion XL
Automatic Speech Recognition (ASR)
Natural Language Processing (NLP)
CLIP
U-Net - Image denoising
VAE - Image decoding
AI Lecture Transcriber: YouTube to Notes Converter

A project aims to provide detailed notes based on YouTube video transcripts across various subjects including Open-CV, Machine Learning, Large language Models, and Data Science & Statistics, Generative -AI. With the power of AI, you can now convert video transcripts into comprehensive study materials.

View Project on GitHub
TECHNOLOGIES USED
Streamlit
LangChain
00:03
00:15
MANTIS LAUNCH REEL ■
2025