Aadya Madankar

AI Engineer

Great code

doesn’t just execute

commands.

It learns,

adapts, and

creates solutions.

"Work that

taught me most."

AI-Associate

Designed and deployed a production-ready voice assistant serving 30+ Indian languages with real-time multimodal processing.

Provides accessible voice technology to underserved linguistic communities with intelligent routing, multimodal processing, and culturally aware responses.

Published research in IEEE OTCON 2025, demonstrating scalable architecture for culturally inclusive conversational AI.

Saahayak

GitHub

Saahayak is an AI-powered teaching assistant designed to support educators managing multi-grade classrooms. Built with Google Gemini and Genkit, it generates hyper-localized content, differentiated worksheets, visual aids, and lesson plans—all from text, voice, or image inputs in 25+ Indian languages including Hindi, Marathi, and more.

Built to address the unique challenges of rural Indian schools where one teacher manages multiple grades with limited resources. This project demonstrates how practical AI can save educators hours of preparation time while maintaining the human touch that makes great teaching possible.

Custom SLM: Personalized AI Assistant

GitHub

I'm training a Small Language Model on my experiences, knowledge, and problem-solving patterns—creating an AI version of how I think, code, and approach challenges.

This isn't about replacing myself; it's about capturing my expertise in a model that can assist others even when I'm not available.

This project bridges AI engineering with self-documentation, demonstrating both technical capability and philosophical understanding.

It's a living portfolio that evolves with me, learns from my work, and serves as an accessible interface to my knowledge—showing how SLMs can be personalized tools, not just generic assistants.

The world is one big data problem.

Tools and frameworks I work with daily.

Credentials & Recognition

The AI Ladder Framework

Intro to TensorFlow for AI

Published Research Paper

Open Source Contributions

x3+

AI/ML Projects

Total Certifications

I build

solutions.

Featured Projects

Multimodal PDF Assistant

This project is a sophisticated RAG-based system designed to answer queries from PDF documents by leveraging both text and images. It uses LangChain and Google Gemini Pro to provide accurate, context-aware responses.

View Project on GitHub

TECHNOLOGIES USED

LangChain

Google Gemini Pro

Streamlit

RAG

Ollama_UI

This project demonstrates how to run and manage models locally using Ollama by creating an interactive UI with Streamlit.

View Project on GitHub

TECHNOLOGIES USED

Ollama

Streamlit

Python

OpenAI

Project-Generator

Ever found yourself staring at a blank screen, desperate for a data project idea that actually shows off your skills? It’s a tool built for data folks like us—analysts, scientists, engineers—who want project ideas that feel fresh, relevant, and fun. Using a bit of AI magic, it takes your job title, favorite tools, and industry, then whips up personalized project suggestions complete with details, timelines, and skills graphs to help you bring them to life.

View Project on GitHub

TECHNOLOGIES USED

Google Gemini

Streamlit

Python

Pandas

Matplotlib

Plotly

Multi-Language Invoice Generator using Gemini Vision Model

project leverages the power of the Gemini Vision Model to create an advanced system for generating invoices in multiple languages. Whether you're managing international transactions or need versatile language support, this application provides a solution with state-of-the-art computer vision capabilities.

View Project on GitHub

TECHNOLOGIES USED

Google Gemini

Streamlit

Python

LangChain

PyPDF2

Chroma-DB

Faiss-Index

Multi-Modal-Screen-Assistant

A comprehensive AI-powered desktop assistant that combines multiple AI models to create an intelligent programming companion. This tool integrates visual processing, text analysis, and voice interaction to enhance programmer productivity through an intuitive interface.

View Project on GitHub

TECHNOLOGIES USED

Open AI Whisper

Google Gemini

Groq

PyAudio

Pillow

Paperclip

RAG Notebooks - Advanced Retrieval-Augmented Generation Systems

The purpose of this repository is thus to provide a comprehensive and dynamic hub to showcase the latest retrieval augmented generation (RAG) systems, and to enhance their error reduction as well as processing speed and contextual richness. It’s the go-to resource for those looking to become an RAG master — an assembly of curated Jupyter notebooks for learning and teaching RAG concepts. If you’re trying your hand at retrieval-based approaches to improve AI models or are just a fungi out for some advanced RAG methods, this repository is your thing.

View Project on GitHub

TECHNOLOGIES USED

Retrival Augemented Generation(RAG)

Python

LlamaIndex

VectorStore

Open AI

Gemini

Cohere

DallE, Elevan-Labs

Crawl

Hugging-Face

Perplexity

JSON

Web-Search

Advance-Rag-with-Langchain

This project is a comprehensive exploration of advanced techniques in building chatbots using LangChain, a powerful framework for developing applications with large language models (LLMs).

View Project on GitHub

TECHNOLOGIES USED

OpenAI

Groq

Streamlit

Python

Langchain

LangServe

BeautifulSoup

ChromaDB

Wikipedia

Arvix

Sentence Transformer

PyPDF2

Crew-AI

A Python-based multi-agent AI system built with the CrewAI framework for orchestrating collaborative autonomous agents that work together to solve complex tasks.

View Project on GitHub

TECHNOLOGIES USED

Crew-AI

Hugging-Face

Python

NVIDIA Model with Langserve

Deploy NVIDIA'S GPU Accelerated AI models as API using Langserve

View Project on GitHub

TECHNOLOGIES USED

Nvdia AI Models

Langchain

Langserve

Python

Streamlit

Object Tracking

This project contains a Python script for real-time object tracking using OpenCV.

View Project on GitHub

TECHNOLOGIES USED

Google Gemini

Open-CV

Channel and Spatial Reliability Tracking

Python

Food Classification

A deep learning project that uses the VGG-16 convolutional neural network for automated food image classification. The model leverages transfer learning with pre-trained ImageNet weights to accurately classify different types of food from images.

View Project on GitHub

TECHNOLOGIES USED

TensorFlow

Keras

NumPy

Matplotlib

Pandas

OpenCV

VGG-16

Voice-to-image

This project enables users to generate images from voice input by combining speech recognition with high-speed text-to-image generation. The system leverages NVIDIA TensorRT optimization for real-time image synthesis, capable of generating images in under a second.

View Project on GitHub

TECHNOLOGIES USED

SDXL Turbo

NVIDIA TensorRT

Stable Diffusion XL

Automatic Speech Recognition (ASR)

Natural Language Processing (NLP)

CLIP

U-Net - Image denoising

VAE - Image decoding

AI Lecture Transcriber: YouTube to Notes Converter

A project aims to provide detailed notes based on YouTube video transcripts across various subjects including Open-CV, Machine Learning, Large language Models, and Data Science & Statistics, Generative -AI. With the power of AI, you can now convert video transcripts into comprehensive study materials.

View Project on GitHub

TECHNOLOGIES USED

Streamlit

LangChain

00:03

00:15

MANTIS LAUNCH REEL ■

2025

"Work that taught me most."

AI-Associate

Saahayak

Custom SLM: Personalized AI Assistant

The world is one big data problem.

Tools and frameworks I work with daily.

Credentials & Recognition

I build solutions.

Featured Projects

"Work that

taught me most."

I build

solutions.