Work
Projects
A selection of projects spanning multi-modal AI, computer vision, and autonomous systems.
Mind Palace
Search your memories like you chat but with photos, screenshots, and moments. Mind Palace lets you describe what you're looking for in plain English and instantly finds the visual memory that matches. Built with privacy at its core, with active research into running everything on-device so your data never leaves your hands, and your memory stays with you.
View projectNetrr
A lightweight wearable that gives visually impaired people a real-time understanding of the world around them — from recognizing familiar faces to describing entire scenes through voice. Works offline for everyday tasks and connects online for richer interactions. Co-designed with the Ahmedabad Blind People Association to solve real problems, not imagined ones.
View projectEcoBot
A self-driving robot that navigates public spaces, identifies waste, and picks it up — no human operator needed. It learns to coordinate its movement and arm control through trial and error, getting smarter with every run. Built in simulation as a proof of concept for scalable urban cleanup.
GitHubVoice Biometrics Pipeline
Know who's speaking — even in noisy, real-world audio. This production system identifies speakers from their voice alone, and gets smarter over time by intelligently choosing which samples need human review. Less manual work, better accuracy, built for scale.
Thout.aiMulti-Modal RAG Pipeline
Ask a question, get an answer grounded in your actual documents — not hallucinated. This system reads and understands large, messy document collections (PDFs, slides, reports) and pulls the right context to give accurate, source-backed answers every time.
Thout.ai6-Agent LLM Orchestrator
Tell it what you need in plain English — "create a sprint for the auth feature" or "summarize last week's standup notes" — and six AI agents coordinate behind the scenes to get it done across Jira, Notion, and 28 other tools. If something breaks, it fixes itself and tries again.
Thout.ai