I am a data science professional with a robust foundation in computer science, currently pursuing a Master’s in Data Analytics at San Jose State University. With over four years of industry experience, I have consistently delivered impactful solutions — from creating dynamic visualizations to engineering scalable data systems. My journey has been marked by roles of increasing responsibility, including leading teams, optimizing processes, and leveraging cutting-edge tools to solve complex challenges.
Previously at Microsoft via Accenture, I spearheaded the development of 50+ Power BI reports and predictive models, enhancing decision-making efficiency and operational effectiveness across departments.
My academic projects are where real-world chaos meets technical curiosity — and sparks fly. I’ve analyzed large-scale crash data to uncover patterns in public safety, built real-time energy forecasting systems using distributed streaming architectures, and developed personality-aware emotion generation models using neural networks. I’ve also designed a mock interview platform that leverages large language models and real-time speech analysis, simulated Uber-inspired systems for ride-sharing and food delivery, and engineered malware detection pipelines that combine traditional machine learning with generative AI for explainable threat reasoning.
✨ Always learning and ready for the next data challenge!
Seasoned Senior Data Analyst with 4+ years of experience driving business strategies,backend development, building data pipelines, and data analysis. Proficient in the latest AI/ML technologies, with a proven ability to drive business outcomes through data insights.
San Jose, CA
San Jose, CA
Hyderabad, India
Hyderabad, India
Hyderabad, India
Hyderabad, India
Built an end-to-end AI mock interview platform enabling company-specific, role-aware, multi-turn interviews powered by fine-tuned LLaMA-3, Mistral-7B, Gemma-7B, and Qwen2.5-3B models. Integrated RAG (Pinecone + SBERT), Gemini NER, and Whisper for voice-based Q&A. Achieved 2× more specific feedback vs. baseline and <1.12s response latency using Qwen2.5-3B (PPL 4.64, LLM-Judge 3.9 of 5). Full-stack deployment via React, FastAPI, MongoDB Atlas, BigQuery, and Cloud Run with analytics dashboards.
Developed a hybrid LightGBM + LLM reasoning pipeline using RAG to classify PE malware with high interpretability. Achieved F1 = 0.916, Recall = 0.917, and ROC-AUC ≈ 0.976. Produced human-readable threat explanations, reducing false negatives by ~20% over ML-only baselines.
Engineered a personality-aware emotion prediction model using BERT/DeBERTa embeddings, BiGRU context modeling, and GNN fusion with OCEAN traits + VAD signals. Achieved +6 macro-F1 over transformer baselines and mood regression MSE = 0.053.
Designed a Kafka-based distributed system with SQL caching and ML models for scalable fare estimation and dynamic pricing.
Built an ML-based anomaly detection system with Random Forest, KNN, XGBoost, and Streamlit dashboard for real-time results.
Developed a containerized platform using Django, React, Kubernetes, and AWS with secure auth, order tracking, and MongoDB integration.
Real-time energy prediction with Apache Kafka, Spark, Cassandra, and RF. Added privacy with LSH, K-anonymity, and machine unlearning.
Processed over 50,000 records with BigQuery and visualized flight delay patterns using Neo4j, Seaborn, and Matplotlib.
Analyzed 155K crash records and built interactive dashboards to showcase accident trends, damages, and fatalities.
Below are the details to reach out to me!
San Jose, CA
vaishunan4@gmail.com
vaishnavi.nanduri@sjsu.edu