Shashank Raj, Developer in Pune, Maharashtra, India
Shashank is available for hire
Hire Shashank

Shashank Raj

Data Scientist and Developer

Pune, Maharashtra, India

Toptal member since May 26, 2026

Bio

Shashank is a results-oriented data scientist and AI engineer with 14+ years of experience specializing in building intelligent systems, scalable ML workflows, and production-grade AI pipelines. He's an expert in deploying agentic frameworks, generative AI, and RAG architectures. He transforms complex data into actionable business insights, driving efficiency gains of up to 85% through automation and predictive modeling.

Portfolio

EPAM Systems
Python, Go, SQL, SQLAlchemy, Pandas, FastAPI, Docker, Kubernetes, Git, Linux...
Syngenta
Python, Git, Windows, Linux, SQL, SQLAlchemy, Scikit-learn, TensorFlow...
LatentBridge
Data Science, Machine Learning, SQL, SQLAlchemy, Git, Python, Windows, Linux...

Experience

  • Windows - 18 years
  • Linux - 12 years
  • Python - 8 years
  • SQL - 8 years
  • Data Science - 5 years
  • Machine Learning - 5 years
  • SQLAlchemy - 5 years
  • TensorFlow - 4 years

Preferred Environment

Windows, Linux, Python, Go, Generative Artificial Intelligence (GenAI)

The most amazing...

...solution I've developed is DevTrack, a local-first developer automation platform that converts a problem statement into a fully planned sprint.

Work Experience

Lead Software Engineer

2025 - PRESENT
EPAM Systems
  • Developed and maintained production-grade Python code supporting MLOps/LLMOps workflows across model training, deployment, and monitoring.
  • Built AI-driven applications using LangChain, LangGraph, and LangSmith. Designed and deployed Agentic AI architectures incorporating MCP and Google A2A protocols.
  • Developed and integrated RAG pipelines with prompt engineering to solve complex business challenges using Generative AI.
  • Architected distributed microservices systems for large-scale batch and stream processing analytics pipelines.
Technologies: Python, Go, SQL, SQLAlchemy, Pandas, FastAPI, Docker, Kubernetes, Git, Linux, Scikit-learn, TensorFlow, Artificial Intelligence (AI), Machine Learning Operations (MLOps), Large Language Models (LLMs), Natural Language Processing (NLP), Amazon Web Services (AWS), LangChain, LangGraph, Retrieval-augmented Generation (RAG)

Senior Analyst, Run and Support

2021 - 2026
Syngenta
  • Orchestrated the MLOps migration to KServe and Kubeflow, saving millions in infrastructure costs and improving operational efficiency by 30% through advanced hyperparameter tuning.
  • Developed data models for agricultural tech initiatives, improving decision-making reliability and scaling ML systems by 40%.
  • Established ML monitoring frameworks with automated scripts, reducing production errors by 30%.
  • Led cross-functional collaboration on data-driven agricultural solutions, saving 10+ weekly hours through 50% automation.
Technologies: Python, Git, Windows, Linux, SQL, SQLAlchemy, Scikit-learn, TensorFlow, Data Science, Machine Learning, Artificial Intelligence (AI), Machine Learning Operations (MLOps), Natural Language Processing (NLP), Amazon Web Services (AWS)

Senior Manager, Data Science

2025 - 2025
LatentBridge
  • Architected a production-ready natural language to SQL engine for client-facing applications, reducing research time from 4+ hours to minutes and boosting query efficiency by 85% through intelligent metadata preprocessing and adaptive caching.
  • Engineered an agentic AI research application with multi-LLM orchestration and LangGraph, cutting manual investigation time by 80%.
  • Implemented NLTK and SpaCy-based intent recognition engines, improving query accuracy by 40% and optimizing human-AI task distribution.
  • Designed scalable agentic RAG frameworks with Mixtral and Azure vector embeddings, enhancing information extraction efficiency by 45% across business functions.
Technologies: Data Science, Machine Learning, SQL, SQLAlchemy, Git, Python, Windows, Linux, Artificial Intelligence (AI), Machine Learning Operations (MLOps), Large Language Models (LLMs), Natural Language Processing (NLP), Amazon Web Services (AWS), LangChain, LangGraph, Retrieval-augmented Generation (RAG)

Lead Data Engineer

2024 - 2025
Daas Labs
  • Designed a scalable ETL system using Python and PySpark, processing 500+ GB of daily data and improving throughput by 50% to enable real-time AI model training.
  • Built a text-to-SQL prototype for complex enterprise schemas, with metadata caching that increased query accuracy by 35%.
  • Implemented an MLflow-based model deployment framework with versioning, reducing deployment downtime by 40%.
  • Optimized data transformation workflows with caching and parallel processing, improving computational efficiency by 45%.
Technologies: Python, JavaScript, Git, Windows, Linux, SQL, SQLAlchemy, AWS ECS Fargate, Artificial Intelligence (AI), Machine Learning Operations (MLOps), Large Language Models (LLMs), Natural Language Processing (NLP), Amazon Web Services (AWS)

Senior Manager, Data and Analytics

2023 - 2024
Exalogic Consulting
  • Developed advanced time series forecasting and enterprise planning algorithms, reducing insight generation time from one week to minutes (75% efficiency gain).
  • Engineered a cloud-agnostic migration strategy to Kubernetes and Docker microservices, enhancing scalability by 30%.
  • Built integration engines for automated data workflows, reducing ML deployment time by 35%.
  • Implemented GDPR/CCPA-compliant data governance frameworks, reducing compliance risks by 25%.
Technologies: Python, Git, Windows, Linux, SQL, SQLAlchemy, Scikit-learn, TensorFlow

Experience

DevTrack

https://www.devtrack.cloud
DevTrack is a developer automation tool that monitors your Git activity, watches your repository for commits and scheduled time intervals, prompts you for work updates, asks you what you're working on at key moments, and processes them with AI (using natural language processing to understand your updates). It updates your tasks (automatically updates Azure DevOps, GitHub, and other systems), and creates daily/weekly reports of your work.

The system works as your personal developer assistant, running locally on your machine, learning your communication style, and helping you stay organized without manual data entry.

Education

2019 - 2021

Master's Degree in Data Science

BITS Pilani - Pune, MH, India

Skills

Libraries/APIs

SQLAlchemy, Pandas, Scikit-learn, TensorFlow

Tools

Git

Languages

Python, Go, SQL, JavaScript

Platforms

Windows, Linux, Amazon Web Services (AWS), Docker, Kubernetes

Frameworks

LangGraph

Other

Data Science, FastAPI, Artificial Intelligence (AI), Machine Learning Operations (MLOps), Large Language Models (LLMs), Natural Language Processing (NLP), LangChain, Retrieval-augmented Generation (RAG), Machine Learning, AWS ECS Fargate, Generative Artificial Intelligence (GenAI)

Collaboration That Works

How to Work with Toptal

Toptal matches you directly with global industry experts from our network in hours—not weeks or months.

1

Share your needs

Discuss your requirements and refine your scope in a call with a Toptal domain expert.
2

Choose your talent

Get a short list of expertly matched talent within 24 hours to review, interview, and choose from.
3

Start your risk-free talent trial

Work with your chosen talent on a trial basis for up to two weeks. Pay only if you decide to hire them.

Top talent is in high demand.

Start hiring