Tech Background

I'm Jatin Mehra

View My Work

Name: Jatin Mehra

Profile: Data Scientist

Email: jatinmehra119@gmail.com

Phone: (+91) 9910364780

GitHub: Jatin-Mehra119

Languages, Frameworks, and Tools
  • Language: Python, C++, C, Java
  • Database: MySQL
  • Frameworks: Pandas, Scikit-learn, Numpy, Streamlit, Tensorflow
  • Visualization: Matplotlib, Seaborn
  • API Integration: FastAPI, FlaskAPI
  • Tools: Jupyter Notebook, Visual Studio, Anaconda, Power BI, Microsoft Excel, Tableau

About Me

Results-driven Data Scientist with expertise in machine learning, natural language processing (NLP), and automation. Currently pursuing a Master of Science in Data Science, I specialize in building AI-powered solutions that enhance business efficiency and automate workflows. My portfolio showcases projects in predictive analytics, AI-driven automation, chatbots, and financial forecasting. Proficient in Python, Scikit-learn, Pandas, NumPy, SQL, Streamlit, and cloud computing.

Resume

My professional background

Summary

Jatin Mehra

Results-driven Data Scientist with expertise in machine learning, NLP, and automation. Specializing in AI-powered solutions that enhance business efficiency and automate workflows.

Education

Master of Science in Data Science

2025 - Present

Chandigarh University, Punjab, India

Bachelor of Computer Applications

2022 - 2025

Chandigarh University, Punjab, India

CGPA: 7.89

Professional Experience

Sales Operations Analyst

Dec 2022 - Present

NB ENTERPRISER (India Today Group), Delhi, India

  • Designed daily inventory analytical reports using Excel, improving decision-making.
  • Built KPI dashboards in Tableau, enhancing strategic insights for leadership.
  • Automated bank statement reconciliation for 1000+ transactions/month, reducing manual work by 80% and 5x faster processing using pdfplumber, OpenPyXL, and Pandas.
  • Developed the WMPL SAP Report Generator, a Python & Streamlit tool for automated SAP-compatible Excel reports.

Technical Skills

Programming
  • Python
  • SQL
  • Java
ML & AI
  • Scikit-learn
  • PyTorch
  • Hugging Face Transformers
  • Pandas & NumPy
Web & API
  • FastAPI
  • Streamlit
  • Flask
DevOps & Cloud
  • Git & Docker
  • Google Cloud Platform
  • Hugging Face Spaces
  • GitHub Actions

Projects

Explore my work across various domains of data science and AI

  • All Projects
  • Generative AI
  • Machine Learning
  • Data Analysis
  • NLP
  • Computer Vision

CrawlGPT 🤖

A powerful web content crawler with LLM-powered RAG (Retrieval Augmented Generation) capabilities. CrawlGPT extracts content from URLs, processes it through intelligent summarization, and enables natural language interactions using modern LLM technology.

Docker Sentence Transformers FAISS Playwright SQLAlchemy Web Scraping (Crawl4ai)

🎥 AI-Powered YouTube Video Summarizer & Fact-Checker

This Web APP extracts captions from YouTube videos, generates summary, text embeddings, and allows users to search within podcast transcripts. It also refines the context and fact-checks claims using AI models and web crawlers.

GROQ(ollama 3.1) Docker Sentence Transformers FAISS FastAPI Web Scraping (Crawl4ai)

PDF Insight Pro: RAG APP

PDF Insight Pro is a Streamlit-based web application that allows users to upload PDF documents and interact with them using AI-driven insights. The application processes PDFs to extract text and uses a language model to answer user queries about the content of the documents. Users can adjust model parameters, manage their uploaded documents, and interact with the AI to gain insights from the PDFs.

Android(Java) Python Docker PyPDF2 Groq Streamlit

Plagiarism detector using Fine-tuned smolLM2 135M LLM

The smolLM2 135M Ins. MODEL was fine-tuned on the MIT Plagiarism Detection Dataset for improved performance in identifying textual similarities. This model provides binary classification outputs, indicating if two given documents are plagiarized or original.

Pytorch Transformers scikit-learn PyMuPDF huggingface-hub LLM Fine Tuning

Automated Essay Scoring System

Developed a state-of-the-art AI model for automated essay evaluation as part of the Kaggle AES competition. The project aimed to reduce manual grading effort and enhance the feedback process for students and educators.

Pytorch Transformers scikit-learn PyMuPDF huggingface-hub LLM Fine Tuning

Crypto Intelligence Pro

Crypto Intelligence Pro is an advanced market analysis tool for cryptocurrencies. It provides real-time price trends, sentiment analysis, technical indicators, and AI-powered market forecasts.

Web Scraping(Crawl4ai) Pandas plotly Deployed on Google Cloud Platform Technical Analysis GenAI

Return of Equity analysis(CN regions)

This project aims to predict the Return on Equity (ROE) of companies based on various financial metrics using a machine learning model. The project includes data analysis, model training, and a web application for making predictions.

Heart Disease Predictor

This app provides predictions on the presence or absence of heart disease based on input parameters.

Streamlit Pandas Numpy scikit-learn A/B Testing Data Analysis

BrisT1D Blood Glucose Prediction Competition

Forecasted blood glucose levels one hour ahead using LightGBM regression, targeting participants with type 1 diabetes. Preprocessed data, addressed missing values, and engineered cyclical time-based features for time-series prediction.

LightGBM Hyperparameter Tuning Numpy scikit-learn Pandas MLflow

Email Spam/Ham Classifier

This application leverages the power of machine learning to determine whether an entered email or text is spam or ham.

SMS Spam Detection

Spam or Ham?

Loan Approval Prediction

Built a machine learning pipeline to predict loan approvals using advanced ensemble models like XGBoost, CatBoost, and LightGBM. Deployed a FastAPI application for real-time predictions, achieving a 96% ROC-AUC score.

Flight Price Prediction

The objective of the study is to analyse the flight booking dataset obtained from "Ease My Trip" website and to conduct various statistical hypothesis tests in order to get meaningful information from it.

Paris House Price Prediction

This project highlights my skills in data cleaning, visualization, feature engineering, preprocessing data using custom transformers, and machine learning algorithms.

Cars Dataset- Analysis and Model Training

We are required to model the price of cars with the available independent variables. It will be used by the management to understand how exactly the prices vary with the independent variables.

Gas Turbine Electricity Prediction with LSTM Neural Networks

Developed a deep learning solution for predicting gas turbine electricity output using LSTM neural networks. The model processes time-series data to forecast power generation with high accuracy (RMSE < 370), outperforming traditional prediction methods and reducing the need for manual monitoring.

TensorFlow LSTM Time Series Deep Learning

Cat/Dog Image Classifier

In this project I trained a model by using transfer learning and data augmentation to improve generalization. It uses MobileNetV2 as the base model with pre-trained weights from ImageNet then fine-tuned to enhance its accuracy.

TensorFlow Transfer Learning MobileNetV2 Image Classification Data Augmentation Streamlit

Bike Rentals Dataset

This repository focuses on optimizing bike rental availability during peak hours and days using machine learning techniques.

Churn Modeling

This repository contains scripts for building and evaluating a model to predict customer churn.

Apps

Interactive applications I've developed

CrawlGPT 🤖

A powerful web content crawler with LLM-powered RAG (Retrieval Augmented Generation) capabilities. CrawlGPT extracts content from URLs, processes it through intelligent summarization, and enables natural language interactions using modern LLM technology.

Try it live

Crypto Intelligence Pro

Crypto Intelligence Pro is an advanced market analysis tool for cryptocurrencies. It provides real-time price trends, sentiment analysis, technical indicators, and AI-powered market forecasts.

Try it live

PDF Insight Pro:RAG APP

PDF Insight Pro is a web application that allows users to upload PDF documents and interact with them using AI-driven insights.

Try it live

APP for Plagiarism Detection using LLM

The app leverages a custom fine-tuned version of the SmolLM (135M parameters) that has been trained on the MIT Plagiarism Detection Dataset for improved performance in identifying textual similarities. This model provides binary classification outputs, indicating if two given documents are plagiarized or original.

Try it live

Essay Scorer Pro

Essay Scorer Pro is a Streamlit-based web application that uses AI to score essays on a scale of 1 to 6. Users can input essay text directly or upload a CSV file containing essays to get predictions. The model used for scoring is based on a pre-trained model from Hugging Face.

Try it live

Cat/Dog Classifier APP

This app is a part of my project. It can classify Cats and Dogs images accurately.

Try it live

Heart Disease Predictor

A Web App that predicts whether the person has heart disease or not using Machine Learning.

Try it live

APP for Detecting SPAM/HAM Email

A Web App that predicts whether the entered text/E-mail is Spam or not, Using Logistic Regression Algorithm

Try it live

Blogs

Introduction to Machine Learning

Learn the fundamentals of machine learning, including its definition, types, and real-world applications.

End-to-End Machine Learning Project

A step-by-step journey through an End-to-End Machine Learning project, from data collection and preprocessing to model deployment and evaluation..

Understanding Linear Regression Algorithm

In this blog, we will first look at the mathematical jargons then we will use these jargons to train a linear regression model without using any library like scikit-learn.

Understanding Gradient Descent-A beginners Guide

If you’ve ever dipped your toes into the world of machine learning or optimization, you’ve likely encountered the term “gradient descent.” This powerful algorithm is a cornerstone in the field of artificial intelligence...

Contact

Connect with me

Address

West Delhi, Delhi, India

Phone Number

+91 9910364780

Email

jatinmehra119@gmail.com