TS
Tanmoy Saha
ID: CCE654651
Tanmoy Saha
ID: CCE654651
About Tanmoy Saha
Analytical and result-driven Data Analyst with hands-on experience in data cleaning, visualization, statistical analysis, and predictive modeling. Successfully developed a predictive model with 85% accuracy during an internship at MedTourEasy, enhancing donor forecasting efficiency. Proficient in Python, SQL, Power BI, and Excel, with practical knowledge of feature engineering, hypothesis testing, and machine learning.
Employment History
2025
M
MedTourEasy
1st Aug 2025 to 1st Sep 2025
Data analyst inter
1st Aug 2025 to 1st Sep 2025
See MoreRoles & Responsibility
Engineered a predictive model Predict Blood Donations, improving donation prediction accuracy by 85% using Python and real-world datasets. Optimized the data preprocessing pipeline, reducing data cleaning time by 30% through automated feature engineering and efficient missing value handling. Visualized donor behavior patterns using Matplotlib and Seaborn, increasing actionable insight extraction efficiency by 40%. Designed and delivered interactive dashboards for stakeholders, enhancing data-driven decision-making effectiveness by 25%. Experimented with AutoML TPOT and Logistic Regression models, achieving an ROC-AUC of 0.7891 and improving overall predictive reliability across test samples.
Education
2025
Data Science & Machine Learning (Ongoing)
Scaler Academy
1st Mar 2024 to 1st Dec 2025
2018
BTech in Computer science and Engineering
RCC Institute Of Information Technology
1st Aug 2014 to 2nd Nov 2018
Expertise
SQL
5/5
NumPy
5/5
Jupyter Notebook
5/5
Google Colab
5/5
Pandas
5/5
SVM
4/5
Recommendation Systems
4/5
K-NN
4/5
Ensemble Learning
4/5
LightGBM
4/5
XGBoost
4/5
Gradient Boosting
4/5
Random Forest
4/5
Decision Trees
4/5
Linear Regression
4/5
Interactive Dashboards
4/5
Data Storytelling
4/5
Clustering (K-Means
4/5
Hierarchical)
4/5
HTML
4/5
MySQL
4/5
SQL Case Studies
4/5
Data Warehousing
4/5
ETL
4/5
bigquery
4/5
SciPy
4/5
Scikit-Learn
4/5
Data transformation
4/5
A/B Testing
4/5
Git
4/5
Excel
4/5
Github
4/5
Descriptive Statistics
4/5
Inferential Statistics
4/5
Hypothesis Testing
4/5
Regression Analysis
4/5
Correlation Analysis
4/5
Probability
4/5
Data Cleaning
4/5
Feature Engineering
4/5
Outlier Detection
4/5
Statistical Modeling
4/5
Tableau
4/5
Python
4/5
MatplotLib
4/5
Seaborn
4/5
Plotly
4/5
ggplot2
4/5
API Data Extraction
3/5
Keras (basic)
3/5
TensorFlow (basic)
3/5
Regex
3/5
CSS
3/5
PHP
3/5
Cloud Databases
3/5
Data Pipeline Development
3/5
ORACLE SQL
3/5
Time Series Analysis
3/5
power bi
3/5
Logistic Regression
3/5
DASH
3/5
Powershell
0/5
Lingo
Bengali
Verbal
5/5
Written
5/5
Eng
Verbal
4/5
Written
4/5

