Projects
Here are some projects I have worked on for school, work, or fun.
Trained multi-modal text-vision models for detecting previously unseen objects in video frames based on natural language queries.
Fine-tuned 7B parameter LLMs (LLaMa), BERT-based models (XLM-R), and zero-shot LLM Prompting for Hindi-English hate speech detection.
Video Prediction with convolution-based SimVP and semantic segmentation with U-Net for modeling the physics of colliding objects.
Causal Inference x Machine Learning for forecasting counterfactual outcomes to answer what-if questions.
Collaborative Filtering Recommender System in PySpark on a dataset with >179M user interactions.
Statistical analysis of job data (Hypothesis Testing), Salary Prediction (Regression), and Job Title Prediction (Classification).
Novel method of using Language Models and Machine Learning to predict COVID-19 cases and emerging variants.
Statistical Analysis of contact mixing patterns to devise interventions to curb the spread of infectious diseases.
CV and NLP based pipeline to extract regulatory inquiries in image based PDFs and generate automated responses. Reduced the manual workload of typing over 1200 responses each month.
Studied the relationship between Tweets and covid cases in India, analyzed the significant themes associated with the latent dimensions.
Python service to scan the text content and metadata of emails to identify Business Email Compromise (BEC). We identified a 90 year old customer who had been compromised for over a year.