Work

Jan 2021 – present
Orbi Health (EkaCare) Chief Data Scientist

Leading AI/ML at EkaCare to build sovereign, purpose-built intelligence for Indian healthcare. The core thesis: general-purpose LLMs are not enough — Indian healthcare needs models trained on its own language, its own clinical context, its own constraints.

Built the Parrotlet family — a suite of small, domain-specific models. Parrotlet-A (medical ASR, 5B parameters) achieves state-of-the-art on Indian-accented medical speech in Hindi, Tamil, Bengali, and 15+ other languages. Parrotlet-V Lite (4B vision LLM) reads prescriptions, lab reports, and clinical documents with handwriting OCR and structured extraction. Parrotlet-E powers multilingual medical embeddings across 22 Indian languages, anchored by IndicMTEB — a benchmark we built and open-sourced for medical NLP evaluation in India.

Built KARMA (OpenMedEvalKit), an open-source evaluation framework for medical AI — because you can't improve what you can't measure, and most medical AI benchmarks don't reflect real Indian clinical conditions. Released four domain-specific evaluation datasets alongside it.

On the product side: EkaScribe (ambient AI medical scribe, generates structured SOAP notes in real-time from doctor-patient conversations), DocAssist (clinical AI assistant with drug interaction alerts and voice documentation), and Document Understanding (automated parsing of lab reports, prescriptions, and insurance claims with SNOMED-CT/LOINC coding). These run at production scale across thousands of doctors on the EkaCare platform.

Jan 2018 – Jan 2021
VY Labs Technologies (Synaptic) Head of Data Science

Built the data science function at Synaptic, a company selling AI-powered intelligence on private companies to institutional investors. The challenge: most of the interesting signal about private companies is unstructured, noisy, and scattered — employee reviews, job postings, news, regulatory filings, web footprints.

Built NLP pipelines to classify and extract signal from Glassdoor reviews at scale — sentiment, topic modeling, and forward-looking indicators of company trajectory. Built a graph ML system to map competitive relationships across thousands of private companies, identifying clusters, tracking market structure shifts, and flagging emerging competitors before they become obvious.

Also worked on time-series signals as alternate economic indicators: NYC 311 noise complaints as a proxy for urban economic recovery post-COVID, US electricity demand tracking as a leading industrial activity signal. The underlying idea throughout: any trace of human behavior at scale, if measured carefully, tells you something real about what's happening in the economy.

2016 – 2017
MusicMuni Labs Co-Founder

Co-founded MusicMuni Labs to commercialize research on AI for Indian classical music. Built Riyaz — an AI-powered guru for Indian classical vocal practice. The app listens as you sing, understands the raga you're practicing, and gives real-time feedback on pitch accuracy, tonal quality, and adherence to the raga's grammar.

Reached 1M+ installs and 10K daily active users. Led the full stack: ML model development, product design, content creation, and growth. The core technical challenge was building a pitch analysis and singing assessment system that worked robustly on a smartphone, in real time, without internet — for a musical tradition where the margin for error is measured in microtones.

2011 – 2016

Five years of doctoral research at one of the world's leading music technology labs, under Prof. Xavier Serra, within the ERC-funded CompMusic project — a landmark effort to build computational tools for non-Western music traditions.

My focus was Indian classical music: how do you teach a computer to understand a raga? A raga is not a scale — it's a complex melodic grammar, a set of rules about which notes to use, how to ornament them, how phrases unfold over time. Formalizing this computationally, from raw audio, was the central problem.

Built computational models for tonic identification (the reference pitch everything else is measured against), raga recognition using phrase-level melodic representations, melodic similarity for large-scale pattern discovery, and automated methods for mining melodic motifs across hundreds of hours of archival recordings. Published 22 papers across ISMIR, ICASSP, Journal of New Music Research, and related venues.

Also core member of the CAMUT project, working on commercial exploitation of CompMusic technologies. Co-developed Dunya, a web platform for exploring and analyzing the CompMusic music corpora. Contributed to Essentia, the open-source audio analysis library (now widely used across the industry).

2012 – 2013
Aynur Labs (Stringwars.co) R&D Consultant

Built real-time chord recognition and pitch tracking algorithms for mobile guitar learning applications. The constraint: accurate, low-latency signal processing running entirely on-device. Developed and optimized these systems for integration into consumer guitar learning apps.

2011

Built a robust song version identification system capable of detecting remixes and cover versions across large catalogs. Also built systems for automatic song structure detection (verse, chorus, bridge segmentation) and music hook identification — finding the most memorable or commercially significant segments of a track.

2010 – 2011
Digital Audio Processing Lab, IIT Bombay Researcher

Research with Prof. Preeti Rao on Music Information Retrieval. Worked on pattern matching-based approaches for tempo estimation from audio recordings of Indian classical and folk music.

Dec 2008 – Jan 2010
Anveshan Telecom Senior Engineer

Built noise reduction, automatic volume control, and echo cancellation systems for VOIP and wire-line networks. Implemented Group 3 FAX MODEMs and G.766 FAX protocols for digital circuit multiplexing equipment.

Jul 2008 – Dec 2008
ITTIAM Systems Engineer

Platform-specific optimization of Windows Media Audio (WMA) decoder on ARM Cortex-A8. First job out of IIT Kanpur — learned how much performance lives in the gap between algorithm and hardware.