Shakil Ansari

Shakil Ansari

Entry-Level Data Scientist | Mechatronics & Botany
NLP · Time Series · Forecasting · Deep Learning

Python SQL scikit-learn Keras Tableau Power BI

About Me

I’m a data science practitioner with a multidisciplinary foundation in Mechatronics Engineering and academic training in Life Sciences (Botany). This unique blend allows me to approach problems at the intersection of engineering, biology, and computation with a systems-thinking mindset.

My journey into data science has been fueled by deep curiosity and a drive to build practical, impactful solutions. I’ve built end-to-end projects in machine learning, natural language processing, and time series forecasting — including a fake news detector using LSTM & GloVe embeddings, an RNN-based lyrics generator, and a stock price predictor using ARIMA/SARIMA. I enjoy exploring both statistical rigor and neural architectures to solve real-world problems.

My background in education and research has sharpened my ability to communicate complex concepts clearly — whether it’s teaching high school mathematics and physics or analyzing planetary data during my time as a research trainee at Abyom SpaceTech. These experiences have helped me cultivate strong analytical thinking, pattern recognition, and an instinct for structure and clarity in data.

I’m proficient in tools like Python, SQL, scikit-learn, Keras, Pandas, Tableau, and Power BI, and regularly work with Git and Jupyter notebooks. I’m currently expanding my knowledge through formal training at the Boston Institute of Analytics and the IBM Data Science Professional Certificate.

I believe in lifelong learning, cross-disciplinary thinking, and building systems that balance precision with creativity. Whether I’m writing code, training a model, designing a dashboard, or mentoring a student, I approach every task with intensity and curiosity. In the long term, I aim to apply AI to solve challenges in education, automation, and ecological sustainability.

Projects

Abstractive Dialogue Summarization using Transformers Abstractive Dialogue Summarization using Transformers Abstractive Dialogue Summarization using Transformers

Fine-tuned FLAN-T5 on DialogSum to generate abstractive summaries from conversational text using prompt-based modeling.

gpt2_indian_food_recipe.png Domain-Specific NLP: Indian Recipe Generation

Trained a language model to generate Indian food recipes from natural queries using domain-specific prompt engineering.

Movie Recommendation Movie Recommendation

Collaborative filtering model using IMDb ratings dataset to recommend movies tailored to user preferences.

Stellar Object Classifer Stellar Object Classifier

Classified celestial objects using supervised ML models and neural networks on astronomical datasets.

Lyrics Generator Lyrics Generator (LSTM)

RNN-LSTM trained on lyrics to generate creative text sequences, exploring the intersection of AI and art.

Fake News Detection Fake News Detection (GloVe + LSTM)

Developed a deep learning model using GloVe word embeddings and LSTM networks to classify news articles as fake or true. Trained on a real-world news dataset, achieving high accuracy in detecting misinformation.

Stock Price Forecasting Stock Price Forecasting Stock Price Forecasting

ARIMA/SARIMA models trained on Yahoo finance time series for accurate stock price prediction and analysis.

Skills

Python SQL Machine Learning Deep Learning NLP Data Visualization Time Series Tableau Power BI Git

Experience

Educator
Vidhyanjali Rising Point, Delhi

Delivered science & math lessons aligned with CBSE curriculum, fostering curiosity and analytical thinking in students.

Research Trainee
Abyom SpaceTech

Researched planetary material diversity using NASA data, contributing to innovative space research projects.

Engineer Intern
Northern Indian Railways

Developed Axle Box Temperature Sensor for loco maintenance, improving safety and reliability in railway operations.

Education

B.Tech – Mechatronics Engineering
Delhi Institute of Tool Engineering, India | GPA: 8.0/10.0 | 2024 – 2024

Interdisciplinary foundation in mechanical systems, electronics, computer science, and automation.
Final-year project: Kinetic Tiles for Energy Harvesting.

B.Sc – Life Sciences (Botany)
Indira Gandhi National Open University | GPA: 6.0/10.0 | 2018 – 2021

Coursework included Ecology, Genetics, Cytology, Plant Physiology. Built a strong foundation in biological systems and environmental interaction.

Certification – Data Science & AI
Boston Institute of Analytics | 2024

Weekend training program with hands-on projects in machine learning, NLP, and forecasting using Python & Tableau.

IBM Data Science Professional Certificate
Coursera | 2025

Completed modules on Python, SQL, data analysis, visualization, machine learning, and project methodology.

Blog

How I Built a Movie Recommendation System
May 2025
A step-by-step guide to building a collaborative filtering recommendation engine using Python and real-world IMDb data. Includes code, evaluation, and deployment tips.
Read more
Getting Started with Time Series Forecasting
April 2025
An introduction to time series analysis and forecasting with ARIMA/SARIMA models. Learn how to prepare data, select models, and interpret results.
Read more
NLP for Beginners: Text Generation with LSTM
March 2025
Explore the basics of natural language processing and how to generate creative text using LSTM neural networks. Includes code snippets and practical advice.
Read more

Contact & Links