I'm a final-year B.Tech CSE student with a strong interest in applied AI and machine learning. I enjoy working across the full spectrum of AI/ML — from classical ML and computer vision to training and deploying large language models. Currently, I work as a Data Science Intern at Bajaj Finserv Health Limited, where I build and maintain production AI systems for healthcare claims processing.
Systems I've designed, built, and shipped in production during my internship. Click a card to expand the detail.
End-to-end automated system for individual eye assessment using MTCNN and 12-layer Vision Transformers (BioMedCLIP) for high-precision diagnostic classification.
Initial human detection (99% accuracy) followed by MTCNN for eye frame extraction. Applied Lab-space preprocessing: 1.2x whiteness enhancement for cataracts, 0.8x darkening for pupils, and advanced haze removal/de-noising.
Leveraged Microsoft's 12-layer BioMedCLIP. Froze first 9 layers and utilized last 3 Vision Transformer (ViT) layers. Standardized eye frames to 224×224 and generated 512-dimensional embeddings for complex pattern recognition.
Radius-based pupil detection (validated at >87% confidence) mapping to 128-dim features. Combined with eye embeddings and MTCNN physical scores (whiteness, haziness, edges) into a 4-layer custom MLP (512→256→64→4) optimized via Optuna.
Automated end-to-end pipeline for IPD insurance claims — document verification, classification, and rule-based adjudication for dialysis and cataract claim types.
Fine-tuned a 38B vision-language model to extract structured data from handwritten OPD prescriptions at production scale.
A self-service ML platform that lets non-technical teams train, benchmark, and deploy custom document classifiers — the backbone of all claims verification workflows at BFHL.
Teams upload a folder of labelled documents — the folder name becomes the class label. The platform automatically trains all three approaches, benchmarks them against the dataset, and recommends the optimal model. Weights are exported in standard formats (ONNX, TorchScript, etc.) ready for deployment. No training or testing scripts needed.
An internal tool to objectively benchmark and compare multiple LLMs on medical document extraction — helping make informed model selection decisions backed by data.
Instrumented the IPD and OPD claim pipelines with full request tracing, cost monitoring, and performance dashboards using LangFuse.
A mix of healthcare, agriculture, industry, and general ML projects.
A broad foundation across the AI/ML stack — built through coursework, side projects, and real production work.
Hackathons, awards, and milestones along the way.
One of the best ways I stay sharp is by staying current — I run a small Instagram page for that.
Feel free to reach out — whether it's about a role, a project idea, or just to talk AI/ML.