|

Data Science Student & Machine Learning Enthusiast

Vedant Vardhaan

About Me

I am a Data Science student at UC San Diego with a passion for machine learning, AI, and software development. My goal is to create meaningful solutions that leverage data to solve complex problems.

Machine Learning

Experience with TensorFlow, Scikit-learn, Neural Networks, NLP, and Generative AI models. Skilled in developing predictive algorithms and data pipelines.

Data Analytics

Proficient in Pandas, NumPy, Matplotlib, Seaborn, Statistical Modeling, Time Series Analysis, and Monte Carlo Simulations.

Full Stack Development

Experience in React.js, React Native, Node.js, Express.js, RESTful APIs, PostgreSQL, and MongoDB for creating seamless applications.

My Resume

My professional journey, education, and relevant skills in the field of data science and software development.

Education

Sept 2023 - Mar 2027 (expected)

Bachelor of Science in Data Science

University of California - San Diego (UCSD)

(Junior Standing)

GPA: 3.96 | Honors: Provost Honors (6 consecutive quarters)

UCSD Logo

Relevant Coursework:

Data Structures & Algorithms, Statistical Methods, Linear Algebra, Calculus for Science & Engineering, Introduction to Data Science, Managing Diverse Teams, Statistical Analysis, Data Visualization, Probabilistic Modeling, Machine Learning

Experience

Oct 2025 - Present

Project Lead – Mangrove Monitoring

Engineers for Exploration (E4E) @ UC San Diego

E4E Logo
  • Leading an interdisciplinary team to develop an AI-driven system that predicts mangrove presence by fusing drone imagery and Sentinel-2 satellite data, advancing coastal ecosystem monitoring through remote sensing and machine learning.
  • Overseeing data pipeline design, feature fusion, and model integration, collaborating with data and ML engineers to build scalable, high-performance workflows using Python, TensorFlow, NumPy, and Google Earth Engine APIs.
  • Driving the project roadmap and research direction, bridging geospatial analytics, computer vision, and environmental informatics to deliver a deployable predictive model for the Scripps Institution of Oceanography.
Jun 2025 - Sep 2025

Technology Consultant Intern

PricewaterhouseCoopers Services LLP (PwC India)

PwC Logo
  • Built agentic AI chatbot for natural language–based AWS/Azure onboarding and infrastructure provisioning across 15+ cloud services.
  • Designed multi-agent orchestration with LangGraph, Gemini 2.5 Pro, and LangChain, enabling modular service integration and cutting manual configuration time by 60%.
  • Implemented secure credential management and automated baseline security (IAM, GuardDuty, CloudTrail, Config) with Azure Key Vault, accelerating migration timelines by 40% while ensuring compliance.
Jul 2025 - Present

Assistant Projects Director (Projects Mentor)

Data Science Student Society (DS3), UC San Diego

DS3 Logo
  • Lead the ideation and selection of 10-12+ quarterly data science projects by sourcing high-quality datasets, defining problem statements, and aligning scope with industry-relevant methodologies.
  • Mentor multiple project teams through full data pipelines, providing guidance on data preprocessing, exploratory analysis, feature engineering, model selection, and deployment strategies.
  • Collaborate with senior leadership in weekly technical meetings to coordinate project timelines, establish evaluation metrics, and oversee execution for end-of-quarter showcases.
Mar 2025 - Jun 2025

Tutor/Instructional Assistant

Halicioglu Data Science Institute, UC San Diego

HDSI Logo
  • Conduct weekly discussion sections and office hours for DSC 40A, reinforcing key concepts in empirical risk minimization, optimization, regression, classification, and discrete probability.
  • Provide targeted tutoring to clarify theoretical machine learning principles, ensuring students develop rigorous problem-solving and mathematical reasoning skills.
  • Evaluate assignments and exams with a focus on consistency and fairness, offering structured feedback to enhance comprehension.
Nov 2024 - Present

Engineering Manager & Software Developer

Computer Science and Engineering Society (CSES Open-Source)

CSES Logo
  • Lead development of TritonSpend, a cross-platform financial management app built with React Native, Node.js, and PostgreSQL.
  • Oversee sprint planning, developer onboarding, code reviews, and architectural decisions to ensure scalable and secure feature delivery (e.g., budget tracking, real-time analytics, AI-based insights).
  • Implement CI/CD best practices, perform QA testing, and maintain technical documentation to streamline open-source contributions.
Oct 2024 - Jan 2025

Quantitative Analyst

Triton Quantitative Trading (TQT)

Triton Quantitative Trading Logo
  • Built a hybrid LSTM/GRU forecasting model with Monte Carlo simulations using Geometric Brownian Motion for probabilistic stock predictions.
  • Integrated technical indicators, VADER-based sentiment analysis, and risk metrics (Sharpe Ratio) for multi-modal input.
  • Deployed as a modular Streamlit app with yFinance ingestion, sklearn preprocessing, and REST API support.
Jun 2024 - Sept 2024

Data Analyst Research Intern

Indian Institute of Technology (IIT) - Guwahati

IIT Guwahati Logo
  • Developed a Convolutional Neural Network Image-processing model with ResNet50 for SAR image classification, achieving 80% accuracy in segmenting land cover across agriculture, barren land, grassland, and urban areas.
  • Applied advanced speckle filters (Lee and Gamma MAP) to enhance SAR image quality by effectively reducing noise.
  • Conducted extensive model evaluation, using precision, recall, F1-scores, and confusion matrices to ensure robust classification performance.

Technical Skills

Programming Languages

Python
Java
JavaScript
R
SQL
HTML
CSS
TypeScript

Cloud & Infrastructure

AWS
EC2
S3
RDS
VPC
IAM
CloudTrail
GuardDuty
Azure
Azure Key Vault
Config
Docker
CI/CD

Machine Learning & AI

TensorFlow
Scikit-learn
Neural Networks
NLP
Generative AI
Hugging Face
LangChain
OpenCV
ChromaDB
Gemini Pro
Agentic AI
MCP
Computer Vision
Remote Sensing
Google Earth Engine
LSTM
GRU
CNN
ResNet50
Random Forest
KNN
GridSearchCV
Cross-validation
RAG
PyPDFLoader
VADER
Sentiment Analysis
SAR Image Processing
Speckle Filtering
LangGraph

Data Science & Analytics

Pandas
NumPy
Matplotlib
Seaborn
Statistical Modeling
Time Series Analysis
Monte Carlo Simulations
Data Preprocessing
Feature Engineering
Data Visualization
Exploratory Data Analysis
yFinance
Technical Indicators
Risk Metrics
Sharpe Ratio
Precision/Recall
F1-Score
Confusion Matrix
Geospatial Analytics
Environmental Informatics

Development

React.js
React Native
Node.js
Express.js
PostgreSQL
MongoDB
RESTful APIs
FastAPI
Streamlit
Tkinter
MySQL
GUI Development
Cross-platform
PIL
Image Processing
Audio Analysis
Sprint Planning
Code Reviews
QA Testing
Technical Documentation
D3.js
TopoJSON
Scrollama
Data Visualization
Interactive Storytelling

My Projects

A showcase of my technical projects in machine learning, data science, and software development.

Reinforcement Learning Poker Bot preview

Reinforcement Learning Poker Bot

Nov 2025 - Dec 2025

Comprehensive comparison of game-theoretic and deep RL approaches for Texas Hold'em poker, implementing MCCFR, DQN, and NFSP algorithms.

  • • Implemented MCCFR (95.3% win rate, +119.26 BB/100), DQN with round-specific models, and NFSP v2 with pot-aware reward shaping (93.9% win rate, +87.51 BB/100)
  • • Built evaluation framework testing agents against Random and OddsAgentV21 with statistical analysis (confidence intervals, p-values, BB/100 metrics)
  • • Developed GPU-accelerated training pipelines using TensorFlow/PyTorch with RLCard, including visualization tools and model persistence
Python TensorFlow PyTorch Reinforcement Learning MCCFR DQN NFSP RLCard
Steam Game Recommendation System preview

Steam Game Recommendation System

Nov 2025 - Dec 2025

ML system that recommends Steam games to users by combining user preferences, game statistics, and semantic text embeddings from reviews.

  • • Achieved 92.79% AUC-ROC and 95.17% precision by combining statistical features, user encoding, and SBERT semantic embeddings, with 20.9% improvement over baseline
  • • Tested multiple approaches incrementally on ~10,000 games and ~25,000 reviews, showing how each feature type (statistics, user patterns, text semantics) contributes to performance
  • • Handled cold-start scenarios achieving 85.32% AUC-ROC for new users with limited history, making it production-ready for real-world deployment
Python Machine Learning Recommendation Systems SBERT Logistic Regression NLP AUC-ROC
PantryPal recipe recommender preview

PantryPal – Smart Recipe Recommender

Oct 2025 - Dec 2025

Streamlit app that suggests recipes from text or pantry photos using image recognition, fuzzy matching, and nutrition-aware ranking.

  • • Combines text input + EfficientNetB0 ingredient detection with OpenCV preprocessing
  • • Fuzzy ingredient and recipe matching with duplicate handling and quick delete actions
  • • Scores and ranks recipes by match overlap and nutrition, served via Streamlit UI
Python PyTorch EfficientNetB0 OpenCV Streamlit RapidFuzz pandas
TubeScope dashboard preview

TubeScope – Trending Lifecycle Analytics

Oct 2025 - Dec 2025

ML-driven dashboard predicting which YouTube trending videos sustain virality beyond a day, with survival analysis visuals.

  • • Automated daily YouTube Data API pulls to build time-stamped trending snapshots
  • • Random Forest classifier optimized for recall plus Kaplan-Meier survival curves
  • • Streamlit UI highlighting top viral candidates and feature importance insights
Python scikit-learn Random Forest YouTube Data API pandas Streamlit lifelines
The Seismic Lottery cover

The Seismic Lottery: When Infrastructure Matters More Than Magnitude 🌍

May 2025 - Jun 2025

Interactive data visualization exploring earthquake impacts based on infrastructure resilience and preparedness.

  • • Built 3D interactive globe with D3.js for real-time earthquake visualization with dynamic tooltips
  • • Developed educational storyline with scroll-based storytelling and energy calculator tools
  • 🏆 Won Best Project Award & People's Choice Award among 42 teams
HTML5 CSS3 JavaScript D3.js TopoJSON
Watts the Problem cover

Watts the Problem?

Mar 2025

Power outage prediction using machine learning techniques.

  • • Built end-to-end ML pipeline with data preprocessing and feature engineering
  • • Optimized Random Forest using GridSearchCV with cross-validation
  • • Achieved 74.3% accuracy with precision, recall, and F1-score metrics
Python Scikit-learn GridSearchCV Random Forest Cross-validation
MarketScope cover

MarketScope: Intelligent Stock Forecasting App

Oct 2024 - Jan 2025

Hybrid LSTM/GRU model with Monte Carlo simulations for stock price forecasting.

  • • Built hybrid LSTM/GRU model with Monte Carlo simulations for stock forecasting
  • • Integrated technical indicators, sentiment analysis, and risk metrics
  • • Deployed modular Streamlit app with yFinance and REST API
Python LSTM/GRU Monte Carlo Streamlit NLP
Blood Report Analysis Chatbot cover

Blood Report Analysis Chatbot

Oct 2024

AI-powered RAG chatbot for medical report analysis.

  • • Built RAG system using LangChain and ChromaDB for document retrieval
  • • Integrated Hugging Face embeddings and PyPDFLoader for PDF processing
  • • Developed interactive Streamlit UI with real-time chat functionality
RAG LangChain ChromaDB Hugging Face PyPDFLoader Streamlit
SAR Image Classification cover

SAR Image Classification with CNN

Jun 2024 - Sept 2024

CNN-based SAR image classification with speckle filtering for land cover analysis.

  • • Developed CNN model with ResNet50 for SAR image classification achieving 80% accuracy
  • • Applied Lee and Gamma MAP speckle filters to enhance SAR image quality
  • • Conducted extensive evaluation using precision, recall, F1-scores, and confusion matrices
Python TensorFlow CNN ResNet50 SAR Processing
KNN Project cover

Advanced Image Processing & KNN Classification

Jan 2024 - Mar 2024

Comprehensive image processing suite with KNN classification.

  • • Developed image processing operations using Python, NumPy, and PIL
  • • Implemented K-Nearest Neighbors for image categorization
  • • Built efficient image handling system with PIL-NumPy integration
Python NumPy PIL KNN
CampusShelf cover

CampusShelf

2024

Library management system with book lending tracking.

  • • Developed a Python-Tkinter desktop application for library management
  • • Implemented MySQL database integration for data persistence
  • • Created intuitive UI for book lending and tracking operations
Python Tkinter MySQL GUI
Song Recommender cover

Song Recommender

2024

Music recommendation system using audio features and artist data.

  • • Built a recommendation engine analyzing audio features
  • • Implemented search functionality across artist discographies
  • • Developed similarity matching algorithms for song suggestions
Python ML Audio Analysis NLP

Achievements

Highlights of impact, leadership, and recognition across projects.

PantryPal award preview

PantryPal – DS3 Project Showcase

Best Project

Fall 2025 · Data Science Student Society

  • • Mentored and led a 4-person team across planning, technical direction, and iterative execution.
  • • Built image-to-ingredient + fuzzy recipe matching pipeline showcased live.
  • • Won Best Project, Presentation, and Website awards at the Fall 2025 showcase.
TubeScope dashboard award preview

TubeScope – DS3 Project Showcase

Best Project · 3rd

Fall 2025 · Data Science Student Society Showcase

  • • Mentored and led a 4-person team through planning, technical direction, and iterative execution.
  • • Delivered a Streamlit dashboard predicting sustained virality with survival analysis visuals.
  • • Automated daily YouTube Data API pulls and highlighted feature importance for creators and marketers.
DSC 106 showcase awards preview

The Seismic Lottery – DSC 106 Showcase

Best Project People's Choice

Spring 2025 · DSC 106 (82 students)

  • • Won Best Project and People’s Choice for an interactive earthquake impact visualization.
  • • Built 3D globe, magnitude/casualty toggles, and energy calculator with D3.js + TopoJSON.
  • • Led narrative design and scroll-based interactions to show how infrastructure shapes outcomes.

Get In Touch

Feel free to contact me for collaborations, opportunities, or just to say hello!

Contact Information

Email

vvardhaan@ucsd.edu

Location

La Jolla, California

Connect With Me

Send Me a Message