Data Scientist

Building intelligence
from data.

Mechatronics Engineer turned Data Scientist — specializing in |

Python SQL scikit-learn Keras Tableau Power BI
Shakil Ansari
01

About

I'm a data science practitioner with a multidisciplinary foundation in Mechatronics Engineering and academic training in Life Sciences (Botany). This unique blend allows me to approach problems at the intersection of engineering, biology, and computation with a systems-thinking mindset.


My journey into data science has been fueled by deep curiosity and a drive to build practical, impactful solutions. I've built end-to-end projects in machine learning, natural language processing, and time series forecasting — including a fake news detector, an RNN-based lyrics generator, and a stock price predictor.

My background in education and research has sharpened my ability to communicate complex concepts clearly — whether teaching high school mathematics or analyzing planetary data at Abyom SpaceTech.


I'm proficient in Python, SQL, scikit-learn, Keras, Pandas, Tableau, and Power BI. Currently expanding through training at the Boston Institute of Analytics and the IBM Data Science Professional Certificate.


I believe in lifelong learning, cross-disciplinary thinking, and building systems that balance precision with creativity. Long term, I aim to apply AI to challenges in education, automation, and ecological sustainability.

02

Projects

Dialogue Summarization
Abstractive Dialogue Summarization
Fine-tuned FLAN-T5 on DialogSum to generate abstractive summaries from conversational text using prompt-based modeling.
Recipe Generation
Indian Recipe Generation
Trained a language model to generate Indian food recipes from natural queries using domain-specific prompt engineering.
Movie Recommendation
Movie Recommendation
Collaborative filtering model using IMDb ratings to recommend movies tailored to user preferences.
Stellar Object Classifier
Stellar Object Classifier
Classified celestial objects using supervised ML models and neural networks on astronomical datasets.
Stock Price Forecasting
Stock Price Forecasting
ARIMA/SARIMA models trained on Yahoo finance time series for accurate stock price prediction and analysis.
Lyrics Generator
Lyrics Generator (LSTM)
RNN-LSTM trained on lyrics to generate creative text sequences — AI meets art.
Fake News Detection
Fake News Detection
Deep learning model using GloVe embeddings and LSTM to classify news articles as fake or true.
03

Skills

Python
SQL
Machine Learning
Deep Learning
NLP
Data Visualization
Time Series
Tableau
Power BI
Git
04

Experience

Educator
Vidhyanjali Rising Point, Delhi
Delivered science & math lessons aligned with CBSE curriculum, fostering curiosity and analytical thinking in students.
Research Trainee
Abyom SpaceTech
Researched planetary material diversity using NASA data, contributing to innovative space research projects.
Engineer Intern
Northern Indian Railways
Developed Axle Box Temperature Sensor for loco maintenance, improving safety and reliability in railway operations.
05

Education

B.Tech — Mechatronics Engineering
Delhi Institute of Tool Engineering · GPA 8.0/10.0 · 2024
Interdisciplinary foundation in mechanical systems, electronics, CS, and automation. Final project: Kinetic Tiles for Energy Harvesting.
B.Sc — Life Sciences (Botany)
IGNOU · GPA 6.0/10.0 · 2018–2021
Coursework in Ecology, Genetics, Cytology, Plant Physiology. Strong foundation in biological systems.
Data Science & AI Certification
Boston Institute of Analytics · 2024
Hands-on projects in machine learning, NLP, and forecasting using Python & Tableau.
IBM Data Science Professional Certificate
Coursera · 2025
Python, SQL, data analysis, visualization, machine learning, and project methodology.
06

Blog

May 2025
How I Built a Movie Recommendation System
A step-by-step guide to building a collaborative filtering recommendation engine using Python and real-world IMDb data.
Read more
April 2025
Getting Started with Time Series Forecasting
An introduction to time series analysis and forecasting with ARIMA/SARIMA models.
Read more
March 2025
NLP for Beginners: Text Generation with LSTM
Explore the basics of NLP and how to generate creative text using LSTM neural networks.
Read more
07

Get in Touch

I'm always open to discussing data science, collaboration opportunities, or just connecting. Feel free to reach out through any of the platforms below.

Email
ansarishakeel006@gmail.com


Based in
Delhi, India


Currently
Open to Data Science roles & collaborations