Bhoumik Shah, Developer in Bengaluru, Karnataka, India
Bhoumik is available for hire
Hire Bhoumik

Bhoumik Shah

Verified Expert  in Engineering

Operations Research Scientist and Developer

Location
Bengaluru, Karnataka, India
Toptal Member Since
September 13, 2022

Bhoumik is a machine learning and operations research specialist with six years in AI, notably in ads moderation, linehaul cost optimization, and McKinsey's routing tool. He's excelled in cost reduction and system efficiency, adept with jsprit, OR-Tools, and ML model development. With both a bachelor's and master's degree in mechanical engineering, with a focus on operations research, Bhoumik also holds certifications in deep learning and related fields.

Portfolio

Amazon India
Artificial Intelligence (AI), H20, Machine Learning, Gradient Boosting, Spark...
Amazon India
Operations Research, Optimization, Vehicle Routing, Network Optimization...
McKinsey & Company
Supply Chain, Supply Chain Optimization, Supply Chain Management (SCM)...

Experience

Availability

Part-time

Preferred Environment

OR-Tools, Jsprit, PyCharm, IntelliJ IDEA, H20, Spark, Deep Learning, PyTorch

The most amazing...

...thing I've designed is Amazon's ad system, which moderated over 5 billion ads and reduced the need for human checks by 60%

Work Experience

Applied Scientist

2021 - 2023
Amazon India
  • Developed automated moderation systems to streamline the review of more than 5 billion ads annually.
  • Led a team of three in enhancing current machine learning systems, cutting down the number of ads requiring manual moderation by 60%.
  • Achieved a $500 million annual reduction in shipping invoice estimation errors by training a gradient boosting machine (GBM) model using H2O to handle a large training dataset of 250 million samples.
Technologies: Artificial Intelligence (AI), H20, Machine Learning, Gradient Boosting, Spark, PyTorch, Generative Pre-trained Transformers (GPT), Natural Language Processing (NLP), Data Science, Deep Learning, Large Language Models (LLMs), Amazon Web Services (AWS), Data Analysis, Predictive Modeling, OpenAI GPT-3 API, Technical Leadership, Haystack, Retrieval-augmented Generation (RAG), Trend Analysis, Advertising Technology (Adtech), Machine Learning Operations (MLOps), Neural Networks, Algorithms, Retail & Wholesale, Inventory, CSV File Processing, Full-stack, Web Development, Bedrock, Generative Pre-trained Transformer 3 (GPT-3), Consulting, Leadership, SQL, Product Management, PySpark, Data Science Product Manager, GitLab, AI Programming, AI Model Training, Pandas, NumPy, NoSQL, FastAPI, Neo4j, Graph Databases, Python 3, Models, Generative Artificial Intelligence (GenAI), Generative AI, Scikit-learn, Classifier Development, Supervised Learning, Teamwork, Regression, Data Engineering, Web Scraping, Spark ML, Optical Character Recognition, Workflow Automation, TensorFlow, eCommerce, Analytics, Sentiment Analysis, Modeling, Data Collection, Big Data, C#, Document Parsing, Minimum Viable Product (MVP)

Operations Research Scientist

2019 - 2021
Amazon India
  • Built a dispatch time optimizer for the Amazon India network to deliver 10% of shipments one day faster. Used concepts of multi-agent learning and mixed-integer linear programming to solve the NP-complete problem.
  • Used concepts of vehicle routing optimization and built a line-haul route planner tool for the Amazon line-haul network. It combined multiple lanes to increase vehicle utilization. This resulted in an annual freight cost reduction of $10 million.
  • Created a workforce scheduling tool using constraint programming to automate and optimize human resources deployment at the Amazon sort center, which resulted in an 8% reduction in the workforce.
Technologies: Operations Research, Optimization, Vehicle Routing, Network Optimization, Python, Java, Gurobi, CPLEX, Amazon Web Services (AWS), Data Analysis, Data Science, Time Series, Predictive Modeling, Technical Leadership, Machine Learning, Trend Analysis, Machine Learning Operations (MLOps), Neural Networks, Algorithms, Retail & Wholesale, Inventory, CSV File Processing, Consulting, SQL, Product Management, PySpark, Data Science Product Manager, GitLab, AI Model Training, Pandas, NumPy, NoSQL, FastAPI, Python 3, Models, Location Services and Maps, Spatial Databases, Classifier Development, Teamwork, Regression, Data Engineering, Web Scraping, Software Architecture, Workflow Automation, eCommerce, Generative Pre-trained Transformers (GPT), Analytics, Grocery Delivery, Modeling, Data Collection, Big Data, Document Parsing, Minimum Viable Product (MVP)

Knowledge Analyst

2017 - 2019
McKinsey & Company
  • Developed a state-of-the-art tool to solve vehicle routing problems with real-life constraints using heuristic algorithms. Deployed the solution as a web application and implemented it in over 15 client scenarios spanning various industries.
  • Collaborated with multiple global clients for topology and production planning optimization.
  • Developed a routing algorithm for solving ship container routing for a leading petrochemical producer.
Technologies: Supply Chain, Supply Chain Optimization, Supply Chain Management (SCM), Data Analysis, Data Science, Time Series, Forecasting, Predictive Modeling, Technical Leadership, Machine Learning, Trend Analysis, Finance, Neural Networks, Algorithms, Retail & Wholesale, Inventory, CSV File Processing, Consulting, SQL, Product Management, Data Science Product Manager, GitLab, AI Programming, AI Model Training, Pandas, NumPy, NoSQL, FastAPI, Python 3, Models, Location Services and Maps, Spatial Databases, Classifier Development, Supervised Learning, Teamwork, Regression, Data Engineering, Web Scraping, Spark ML, Software Architecture, Workflow Automation, Analytics, Grocery Delivery, Modeling, Data Collection, Big Data

Routing Optimization

I built a state-of-the-art routing optimization tool using jsprit, OR-Tools, and a proprietary algorithm to optimize outbound routing for an eCommerce giant. The tool resulted in an annual cost reduction of USD 10 million.

Automatic Grading of Handwritten Tests

https://qn-a-eval.vercel.app/
The automatic grading system is an innovative tool designed to assess handwritten answer sheets with high precision. Utilizing state-of-the-art optical character recognition (OCR) technology, the system deciphers handwritten text and efficiently converts it into digital format. Beyond this conversion, smart post-processing ensures accuracy, addressing common issues associated with handwriting variations. Once the answers are digitized, the system employs large language models (LLMs) to evaluate the content, ensuring a comprehensive understanding of the answer's context and relevance. The integration of these technologies provides a swift and accurate grading process, reducing manual effort and increasing consistency in evaluation.

Ads Moderation System

Developed automated moderation systems to streamline the review of over 5 billion ads per year. Also, I led a team of three in enhancing current machine learning systems, cutting down the volume of ads requiring manual moderation by 60%.

Virtual Companion

http://t.me/mysaraAi_bot
The Telegram virtual companion bot is a groundbreaking chatbot leveraging the power of large language models (LLMs) to foster human-like interactions. With the integration of a long-term memory feature, the bot not only engages in real-time conversations but also personalizes interactions over time based on previous exchanges. Hosted on AWS Lambda for efficient serverless deployment, this bot ensures seamless and scalable interactions. Its exceptional conversational capabilities have resonated with users, amassing over 5,000 active users within just the first week of its launch.

YT Chat

https://yt-chat.com/
I built a chatbot that enables users to chat with any YouTube channel. I built both the front and the back end.

Stack:
• Front end: React, Tailwind, and shadcn
• Authentication: Clerk
• Database: MongoDB
• Back end: Python with FastAPI
• RAG: LlamaIndex, Pinecone vector database, and OpenAI

Real-time Transcription for Medical Industry

A full-stack, real-time transcription service. The product records conversations between doctors and patients in real time.

Stack:
• Front end: React, Tailwind, and shadcn
• Back end: Python and FastAPI
• Transcription: AssemblyAI
• Summarisation: OpenAI
2012 - 2017

Master's Degree in Mechanical Engineering

Indian Institute of Technology Bombay - Mumbai, India

AUGUST 2018 - PRESENT

Deep Learning

Coursera

JULY 2018 - PRESENT

Structuring Machine Learning Projects

Coursera

MAY 2018 - PRESENT

Introduction to Data Science in Python

Coursera

Libraries/APIs

Pandas, NumPy, Scikit-learn, OpenCV, Spark ML, TensorFlow, PyTorch, React, Telegram Bot API, PySpark, Java Natural Language Processing (JNLP), Google Speech-to-Text API

Tools

GitLab, PyCharm, IntelliJ IDEA, ChatGPT, Haystack, Azure OpenAI Service, Gurobi, CPLEX, Amazon SageMaker

Frameworks

Bedrock, Next.js, Flask, LlamaIndex, Selenium, Spark, Tailwind CSS, Scrapy

Languages

Python, SQL, Python 3, Java, JavaScript, C#

Paradigms

Data Science, Spatial Databases

Industry Expertise

Retail & Wholesale

Platforms

Amazon Web Services (AWS), Docker, Azure Functions, H20, AWS Lambda, Azure

Storage

NoSQL, Neo4j, Graph Databases, Amazon DynamoDB, MongoDB

Other

OR-Tools, Jsprit, Operations Research, Supply Chain, Supply Chain Optimization, Supply Chain Management (SCM), Machine Learning, Optimization, Vehicle Routing Problem (VRP), Vehicle Routing, Natural Language Processing (NLP), Artificial Intelligence (AI), Data Analysis, Technical Leadership, Retrieval-augmented Generation (RAG), Neural Networks, Algorithms, Inventory, CSV File Processing, Generative Pre-trained Transformer 3 (GPT-3), Consulting, Leadership, Data Science Product Manager, AI Programming, AI Model Training, FastAPI, Models, Location Services and Maps, Generative Artificial Intelligence (GenAI), LoRa, Hugging Face, Generative AI, Classifier Development, Supervised Learning, Teamwork, Regression, Pinecone, Data Engineering, Artificial General Intelligence (AGI), Data Scraping, Chatbots, Open-source LLMs, Workflow Automation, eCommerce, Analytics, APIs, Vectorization, Sentiment Analysis, Modeling, Data Collection, Big Data, Minimum Viable Product (MVP), Simulations, Machine Learning Operations (MLOps), Large Language Models (LLMs), LangChain, Time Series, Predictive Modeling, OpenAI GPT-3 API, Prompt Engineering, OpenAI GPT-4 API, Full-stack Development, AI Chatbots, SaaS, Trend Analysis, Advertising Technology (Adtech), Full-stack, Product Management, Chatbot Conversation Design, Web Scraping, PDF Scraping, Software Architecture, Embeddings from Language Models (ELMo), Fine-tuning, Computer Vision, Optical Character Recognition, Large Language Model Operations (LLMOps), Grocery Delivery, Document Parsing, Deep Learning, Convolutional Neural Networks (CNN), Recurrent Neural Networks (RNNs), Network Optimization, Generative Pre-trained Transformers (GPT), Gradient Boosting, OCR, OpenAI, Forecasting, Finance, Web Development, Legal Technology (Legaltech), Travel, ShadCn, Speech to Text, Speech to Text AI

Collaboration That Works

How to Work with Toptal

Toptal matches you directly with global industry experts from our network in hours—not weeks or months.

1

Share your needs

Discuss your requirements and refine your scope in a call with a Toptal domain expert.
2

Choose your talent

Get a short list of expertly matched talent within 24 hours to review, interview, and choose from.
3

Start your risk-free talent trial

Work with your chosen talent on a trial basis for up to two weeks. Pay only if you decide to hire them.

Top talent is in high demand.

Start hiring