Projects

This is the long-form version of my work—systems, metrics, and links in one place. Featured builds first; older research and hackathon repos underneath. All public code lives on GitHub.

Featured

Clause.ai

Parent-document RAG on Supabase pgvector—searchable chunks plus full-document context so clause generation stays faithful.
LangSmith end to end—distributed tracing to benchmark performance (e.g. ~P50 0.87s, P99 < 5.2s sustained in our runs).
FastAPI async backend with streaming tokens, Google OAuth 2.0, custom rate limits, and a React front end. LangGraph reflection workflow for draft → critique → human-in-the-loop.

LangGraph
Supabase · pgvector
FastAPI · React
LangSmith · OAuth2

Live Source

Munchy Mumbai

LangGraph semantic router classifies intent and dispatches to sub-agents to keep token spend disciplined.
Groq LPUs for fast inference; Supabase hybrid search (vector + trigram) to cross-check Llama 3 outputs and curb hallucinations.
Google Maps & YouTube APIs in a React SPA; FastAPI behind Docker, deployed to Vercel.

LangGraph
Groq · Supabase
React · FastAPI
Docker · Vercel

Live Source

Amazon ML Challenge

Dual-model ensemble on 75K samples—fused Qwen-0.6B text embeddings with ViT vision features through a 3-layer regression head (~44.8% SMAPE).
Fine-tuned RoBERTa-Large FFN with a custom 3-layer head (GELU, Smooth L1); averaged both architectures for robust price prediction.
Competition placement: Rank 183 / 20,000+ teams (top ~1%).

PyTorch · ViT
Qwen · RoBERTa
Multimodal · tabular

Notebooks

CF Time Tale

Estimator for total time on Codeforces—given a handle and derived profile signals (ratings, tags, contest activity, etc.), it predicts how much time you have sunk into the platform, not a toy single-problem timer.
LightGBM on engineered features, with heavy hyperparameter tuning; validation MAE 9.93 min and RMSE 14.06 min on the setup we reported.
Flask app behind nginx, Redis for caching/session-style speedups, containerised with Docker, deployed on Render with a dashboard for breakdowns and analysis.

LightGBM
Redis · nginx
Flask · Docker
Render · Python

Live Codeforces

More on GitHub

AURA: Cosmic Weather

Predictive analytics for space weather financial impact—FastAPI backend, React frontend, plus ML and Monte Carlo simulations to aid insurers.

React · FastAPI ML Forecasting Gemini AI

View code

ProSizeNet

Hybrid CNN-Transformer architecture achieving SOTA performance in predicting protein residue sizes; includes paths toward on-device inference with Core ML for Apple deployment.

CNN · Transformer Core ML Bioinformatics

View code

Multi-Agent Diagnosis

Multi-agent architecture for triage and routing: CrewAI + Gemini agents orchestrated behind a FastAPI service and a React UI.

Multi-agent React · FastAPI CrewAI · Gemini

View code

AI Job Assistant

Career co-pilot with smart job matching, ATS analysis, cover letters, and course suggestions—wired to Google (Gemini) APIs and complementary tooling for search and generation.

Streamlit · FastAPI Google Gemini API ATS · NLP

View code

SkillMorph

Career recommendation platform combining live job listings and course suggestions with auth—built with React, FastAPI, and Firebase for rapid hackathon iteration.

React · FastAPI Firebase Hackathon

View code

Paper Title Gen

Fine-tuned GPT-2 for research-style title generation using full fine-tuning and LoRA / QLoRA-style parameter-efficient training—highlights end-to-end model training, alongside the embedding, chunking, and RAG patterns I use in larger retrieval systems.

LoRA · QLoRA GPT-2 fine-tuning RAG · NLP

Live demo Source

Student Heatmap

Interactive heatmap visualizing the distribution of Indian students using Pandas and Folium.

Shiny Data Viz

Live demo Source

Therapy Chatbot

Multi-agent mental wellness assistant with empathetic AI chat, journaling, and mood tracking.

Wellness LLM

View code

Drug Target MLP

MLP model to classify drug target statuses using TF-IDF, achieving 95% validation accuracy.

Scikit-learn TF-IDF

View code