Full-Stack Data Developer

Yashwanth Reddy Boddireddy

Where Data Meets Development

Transforming complex data into intuitive solutions through full-stack development and data science

Download CV

2+

Years Experience

15+

Projects Completed

10+

Technologies Mastered

About Me

My Journey

From Electrical Engineer to full-stack data development, my path has been driven by a passion for solving complex problems with data and code.

Yashwanth Reddy Boddireddy

At the intersection of AI innovation and data science, I transform complex problems into impactful solutions that drive measurable business outcomes.

Software Engineer specializing in AI applications and machine learning, helping organizations leverage data for competitive advantage. With experience at Accenture and Headstarter AI, I focus on developing intelligent systems that enhance user experiences.

I combine technical expertise with business acumen, following a systematic approach: understanding requirements, designing data-driven architectures, and implementing scalable solutions with measurable results.

Expected May 2025

Master of Science in Data Science, Statistics @ New Jersey Institute of Technology (NJIT), Newark, NJ

Data Science

Statistics

Machine Learning

Deep Learning

Software Engineering

07/2024 – Present

Web Resource Data Scientist @ VMware

Partnered with product and engineering teams to identify and frame 10+ high-impact business problems across SaaS product telemetry and cloud infrastructure, leading to a 15% improvement in customer retention via targeted insights
Designed and automated data pipelines using PySpark and SQL to ingest and process ~3 TB of telemetry and log data daily from VMware's vSphere and NSX platforms, reducing data availability latency from 12 hours to under 2 hours
Built and deployed machine learning models (e.g., random forest, logistic regression, anomaly detection) to predict VM resource exhaustion and detect anomalous user behavior, increasing proactive alert accuracy by 28%
Engineered 100+ features from product usage logs, cloud performance metrics, and customer support data, and conducted hyperparameter tuning using MLflow and Optuna to improve F1-score by 22% across models
Delivered containerized ML inference services (Flask + Docker) deployed via Jenkins and Kubernetes, reducing model deployment cycles from 5 days to under 1 day and enabling real-time scoring for 50K+ daily API calls
Monitored deployed models for concept drift and performance decay using Grafana and Prometheus dashboards; retrained models monthly using scheduled Airflow workflows, maintaining <5% prediction error across quarters
Created interactive dashboards and reports in Tableau and Power BI to visualize model predictions and business KPIs, which were used weekly by senior leadership to guide roadmap and operational decisions

PySpark

SQL

Machine Learning

Docker

Kubernetes

MLflow

Airflow

Tableau

Power BI

07/2024 – 09/2024

Software Engineering Fellow @ Headstarter AI

Built 5+ AI apps and APIs using NextJS, OpenAI, Pinecone and Stripe API
Successfully led 4+ engineering fellows to deliver projects from design to deployment
Enhanced team productivity through effective leadership and collaboration

NextJS

OpenAI

Pinecone

Stripe API

Team Leadership

10/2021 – 08/2023

Sr. Data Scientist @ Accenture

Led a cross-functional initiative to design and launch a recommendation engine for personalized content across HBO Max, resulting in a 19% increase in average user session duration and a 12% boost in monthly active users (MAU) within the first quarter of deployment
Architected and deployed a real-time churn prediction system using ensemble models (LightGBM, XGBoost) on a Spark-based pipeline, enabling targeted retention campaigns that reduced churn by 8.5% YoY in key demographic segments
Directed the end-to-end experimentation pipeline, including A/B testing frameworks and causal inference techniques (e.g., uplift modeling, propensity scoring), to evaluate content previews and marketing placements across digital platforms — increasing click-through rates by 2%
Managed large-scale data acquisition and enrichment pipelines using Airflow and AWS Glue to process over 5 TB of daily user interaction logs, integrating data from third-party ad platforms, streaming analytics, and CRM tools for unified audience profiling
Developed NLP-based models (topic modeling, sentiment analysis) on viewer feedback and closed-caption text to inform editorial decisions and improve trailer targeting, contributing to a 30% lift in trailer-to-watch conversion for new releases
Collaborated with product, engineering, and data governance teams to define data standards and deploy modular, reusable ML components using Databricks and MLflow, cutting model delivery timelines by 30% across business units
Mentored a team of 3 junior data scientists and analysts, conducting regular peer code reviews, technical deep dives, and knowledge-sharing sessions to elevate team productivity and ensure reproducibility and scalability in deployed solutions

Machine Learning

Spark

AWS

NLP

Databricks

MLflow

Team Leadership

A/B Testing

Skills & Expertise

Technical Proficiency

A comprehensive overview of my technical skills in data science, engineering, MLOps, and development technologies.

Core Competencies

Data Science & Analysis

Data Engineering

MLOps & DevOps

AI & Machine Learning

Frontend Development

Backend Development

Projects

Featured Work

A showcase of my projects spanning data visualization, full-stack applications, and data analysis solutions.

Interactive Data Dashboard

A real-time dashboard for monitoring key business metrics with interactive visualizations.

React

D3.js

Node.js

Socket.io

Demo Code

E-commerce Analytics Platform

Full-stack application for e-commerce businesses to track sales, customer behavior, and inventory.

Next.js

MongoDB

Express

Chart.js

Demo Code

Predictive Sales Analysis

Machine learning model to predict future sales based on historical data and market trends.

Python

Pandas

Scikit-learn

Matplotlib

Demo Code

Customer Segmentation Tool

Data analysis tool that segments customers based on purchasing behavior and demographics.

Python

Clustering

Tableau

SQL

Demo Code

Inventory Management System

Full-stack application for tracking inventory levels, orders, and supplier information.

React

Node.js

PostgreSQL

Express

Demo Code

Financial Performance Dashboard

Interactive dashboard visualizing financial KPIs and performance metrics for executive decision-making.

React

D3.js

TypeScript

REST API

Demo Code

AI Mock Interviewer

Next.js-based AI Interviewer using GPT-4 with speech recognition and analytics dashboard for interview performance tracking.

Next.js

OpenAI GPT-4

Speech Recognition

Analytics

Demo Code

T20 World Cup Cricket Analytics

Power BI dashboard for T20 player selection with 90% match-winning probability using data-driven analysis.

Power BI

Python

Pandas

Data Analysis

Demo Code

Wine Quality Prediction

MLOps-based wine quality prediction system with 97% accuracy, featuring MLflow integration and AWS deployment.

MLOps

MLflow

AWS

CI/CD

Demo Code

Contact

Get In Touch

Have a project in mind or interested in working together? I'd love to hear from you. Let's create something amazing.

Contact Information

Feel free to reach out through any of these channels

yashwanthreddy7178@gmail.com

Ready to discuss your project?

Connect with me

Send a Message

Fill out the form below and I'll get back to you as soon as possible