By Anansh Sinha
Hello, I’m Anansh Sinha. I have a Bachelor of Science degree in Statistics and Computer Science from Chandigarh University, the backbone of Data Analytics and Data Science. My journey with data began during college and evolved into a key element of my professional role.
Currently I am working as an IT & Operations Executive, where I focus on structuring, analyzing, and visualizing data to enable smarter, data-driven decisions across teams.
My primary skills are Python (and relevant libraries such as pandas, numpy, matplotlib, scikit-learn, etc.), Data Visualizations (with tools such as PowerBI and Tableau) , SQL,and Excel.
Below are selected projects that showcase my analytical skillset. Each project includes visualizations and a link to the GitHub repository for in-depth exploration.
The process of leaning never stops. In order to suppliment my skills as a data analyst, I am working on two main technologies for now:
A subsequent project using both is in works. I will be mapping, analysing and preparing data for transportation in Delhi, India, with a focus on analysing the current situation of the transport scene in the NCR, and provide relevant suggestions for any sort of improvement.
Performed an end-to-end exploratory and predictive analysis of sales data using Python and Power BI, aimed at understanding key drivers of sales success and assisting decision-makers in refining marketing and outreach strategies.
ABC Pvt. Ltd. (name anonymized) manufactures a Dual Fuel System (DFS) for diesel generators, which improves operational efficiency by enabling the engine to use both diesel and natural gas as an energy source. The product is relevant for regions with stringent emission norms, offering a cleaner and more compliant energy solution.
The analysis was mainly done in the Delhi NCR region, where regulations in respect to air pollution (e.g., GRAP) significantly influence generator operations, especially during the high-pollution months (October to January). Sales operations were divided among three salespersons. While all shared comparable experience levels, differences in performance pointed to other influencing factors.
The client base was diverse, including:
Tested multiple classification models to predict sales success. Below are ROC-AUC results:
Gradient Boosting was the best test performer. However, Logistic Regression had more consistent performance across folds, highlighting a balance of accuracy and generalization. Feature importance confirmed that Location, Client Type, and Amount per kVA were critical in predicting outcomes.
Analyzed SpaceX Falcon 9 missions using Python, Plotly Dash, and scikit-learn to explore launch behavior and predict landing outcomes. This project was part of the Capstone Project for completion of the IBM Data Science Professional Certificate (V3). Here are some of the observations:
Predictive Modeling:
Conclusion:
Landing outcome prediction is highly feasible using publicly available data. Launch cost estimation is also possible to a certain degree using features like orbit type, payload mass and other such relevant factors.
Conducted an end-to-end analysis using Python and regression modeling to predict academic performance for B.Sc. students (Computer Science, Statistics, and Mathematics). The goal was to recognize patterns that could guide early academic intervention and improve learning outcomes.
Data was collected anonymously from students in the Department of Mathematics, Chandigarh University, who were in their 5th semester of courses with common subjects, ensuring privacy and minimizing response bias. The study considered living situations, past educational background, attendance, and self-reported engagement levels to assess predictors of CGPA.
Lasso regularization improved model interpretability and minimized multicollinearity, while still capturing over 91% variance. Cross-validation confirmed its robustness with low error margins.