Bhoumik Shah, Developer in Bengaluru, Karnataka, India
Bhoumik is available for hire
Hire Bhoumik

Bhoumik Shah

Verified Expert  in Engineering

Operations Research Scientist and Developer

Bengaluru, Karnataka, India

Toptal member since September 13, 2022

Bio

Bhoumik is a machine learning and operations research specialist with six years in AI, notably in ads moderation, linehaul cost optimization, and McKinsey's routing tool. He's excelled in cost reduction and system efficiency, adept with jsprit, OR-Tools, and ML model development. With both a bachelor's and master's degree in mechanical engineering, with a focus on operations research, Bhoumik also holds certifications in deep learning and related fields.

Portfolio

New Taiwan Trading Corp DBA NTFOODS
Machine Learning, Python, Data Modeling, Chatbot Development, ChatGPT, Food...
Mercatus Center at George Mason University - Main
Machine Learning, Web Scraping, Java Natural Language Processing (JNLP)...
Gold Coin Group LLC
Computer Vision, Optical Character Recognition, Python, OpenCV, TensorFlow...

Experience

Availability

Part-time

Preferred Environment

OR-Tools, Jsprit, PyCharm, IntelliJ IDEA, H20, Spark, Deep Learning, PyTorch

The most amazing...

...thing I've designed is Amazon's ad system, which moderated over 5 billion ads and reduced the need for human checks by 60%

Work Experience

ML & Analytics Expert | Food Industry

2024 - 2024
New Taiwan Trading Corp DBA NTFOODS
  • Developed end-to-end recommendation engine for a platform that sells raw materials to restaurants.
  • Scrapped restaurants' menus and used items in the menu to generate personalized recommendations for raw materials.
  • Deployed recommendation engine as a service on AWS using AWS Lambda, DynamoDB, and Step Functions.
Technologies: Machine Learning, Python, Data Modeling, Chatbot Development, ChatGPT, Food, Grocery Delivery, E-commerce marketing, Analytics Development

AI Consultant

2024 - 2024
Mercatus Center at George Mason University - Main
  • Built a heuristic algorithm to identify and extract relevant information and amendments from policy documents published by US federal agencies. Due to large size of documents (300+ pages), it could not be done directly LLMs.
  • Used LLMs to make amendments into federal regulations based on published policy by multiple federal agencies.
  • Built a front-end application using Streamlit to compare changes in regulation.
Technologies: Machine Learning, Web Scraping, Java Natural Language Processing (JNLP), Artificial General Intelligence (AGI), Deep Learning, SQL, NLP, Artificial Intelligence, LLM, AI Prompts, Large Data Sets, Finance, Capital Markets, Entity Extraction, AI Agents

AI/ML Engineer

2024 - 2024
Gold Coin Group LLC
  • Developed an end-to-end application for scrapping scanned images of handwritten mails from an internal website and extracting data from it.
  • Used AWS textract, and OpenAI along with Donut model to do OCR of handwritten images and extract information in structured format.
  • Implemented web scraping solution using Selenium and Beautiful Soup.
Technologies: Computer Vision, Optical Character Recognition, Python, OpenCV, TensorFlow, Pandas, Machine Learning, Artificial Intelligence, Deep Learning, Workflow Automation, NLP, OCR, Keras, AI Prompts, Large Data Sets, Finance, Capital Markets

Applied Scientist

2021 - 2023
Amazon India
  • Developed automated moderation systems to streamline the review of more than 5 billion ads annually.
  • Led a team of three in enhancing current machine learning systems, cutting down the number of ads requiring manual moderation by 60%.
  • Achieved a $500 million annual reduction in shipping invoice estimation errors by training a gradient boosting machine (GBM) model using H2O to handle a large training dataset of 250 million samples.
Technologies: Artificial Intelligence, H20, Machine Learning, Gradient Boosting, Spark, PyTorch, Generative Pre-trained Transformers (GPT), NLP, Data Science, Deep Learning, LLM, AWS, Data Analysis, Predictive Modeling, OpenAI GPT-3 API, Technical Leadership, Haystack, RAG, Trend Analysis, Advertising Technology (Adtech), Machine Learning Operations (MLOps), Neural Network, Algorithms, Retail & Wholesale, Inventory, CSV File Processing, Full-stack, Web Development, Bedrock, GPT-3, Design Consulting, Leadership, SQL, Product Management, PySpark, Data Science, Git, Artificial Intelligence, AI Model Training, Pandas, NumPy, NoSQL, FastAPI, Neo4j, Graph Databases, Python, Models, Generative Artificial Intelligence (GenAI), Scikit-Learn, Classifier Development, Supervised Learning, Teamwork, Regression, Data Engineering, Web Scraping, Apache, Optical Character Recognition, Workflow Automation, TensorFlow, E-commerce marketing, Analytics Development, Sentiment Analysis, Modeling, Data Collection, Big Data Architecture, C#, Document Parsing, Minimum Viable Product (MVP), Automation, Excel Development, AI Prompts, Large Data Sets, Java, Entity Extraction, AI Agents

Operations Research Scientist

2019 - 2021
Amazon India
  • Built a dispatch time optimizer for the Amazon India network to deliver 10% of shipments one day faster. Used concepts of multi-agent learning and mixed-integer linear programming to solve the NP-complete problem.
  • Used concepts of vehicle routing optimization and built a line-haul route planner tool for the Amazon line-haul network. It combined multiple lanes to increase vehicle utilization. This resulted in an annual freight cost reduction of $10 million.
  • Created a workforce scheduling tool using constraint programming to automate and optimize human resources deployment at the Amazon sort center, which resulted in an 8% reduction in the workforce.
Technologies: Operations Research, Optimization, Vehicle Routing, Network Optimization, Python, Java, Gurobi, CPLEX, AWS, Data Analysis, Data Science, Time Series, Predictive Modeling, Technical Leadership, Machine Learning, Trend Analysis, Machine Learning Operations (MLOps), Neural Network, Algorithms, Retail & Wholesale, Inventory, CSV File Processing, Design Consulting, SQL, Product Management, PySpark, Data Science, Git, AI Model Training, Pandas, NumPy, NoSQL, FastAPI, Python, Models, Location Services and Maps, Spatial Data Scientists, Classifier Development, Teamwork, Regression, Data Engineering, Web Scraping, Software Architecture, Workflow Automation, E-commerce marketing, Generative Pre-trained Transformers (GPT), Analytics Development, Grocery Delivery, Modeling, Data Collection, Big Data Architecture, Document Parsing, Minimum Viable Product (MVP), Automation, Excel Development

Knowledge Analyst

2017 - 2019
McKinsey & Company
  • Developed a state-of-the-art tool to solve vehicle routing problems with real-life constraints using heuristic algorithms. Deployed the solution as a web application and implemented it in over 15 client scenarios spanning various industries.
  • Collaborated with multiple global clients for topology and production planning optimization.
  • Developed a routing algorithm for solving ship container routing for a leading petrochemical producer.
Technologies: Supply Chain, Supply Chain Optimization, Supply Chain Management, Data Analysis, Data Science, Time Series, Forecasting, Predictive Modeling, Technical Leadership, Machine Learning, Trend Analysis, Finance, Neural Network, Algorithms, Retail & Wholesale, Inventory, CSV File Processing, Design Consulting, SQL, Product Management, Data Science, Git, Artificial Intelligence, AI Model Training, Pandas, NumPy, NoSQL, FastAPI, Python, Models, Location Services and Maps, Spatial Data Scientists, Classifier Development, Supervised Learning, Teamwork, Regression, Data Engineering, Web Scraping, Apache, Software Architecture, Workflow Automation, Analytics Development, Grocery Delivery, Modeling, Data Collection, Big Data Architecture, Automation, Excel Development, Large Data Sets, Java

Automatic Grading of Handwritten Tests

https://qn-a-eval.vercel.app/
The automatic grading system is an innovative tool designed to assess handwritten answer sheets with high precision. Utilizing state-of-the-art optical character recognition (OCR) technology, the system deciphers handwritten text and efficiently converts it into digital format. Beyond this conversion, smart post-processing ensures accuracy, addressing common issues associated with handwriting variations. Once the answers are digitized, the system employs large language models (LLMs) to evaluate the content, ensuring a comprehensive understanding of the answer's context and relevance. The integration of these technologies provides a swift and accurate grading process, reducing manual effort and increasing consistency in evaluation.

Routing Optimization

I built a state-of-the-art routing optimization tool using jsprit, OR-Tools, and a proprietary algorithm to optimize outbound routing for an eCommerce giant. The tool resulted in an annual cost reduction of USD 10 million.

Ads Moderation System

Developed automated moderation systems to streamline the review of over 5 billion ads per year. Also, I led a team of three in enhancing current machine learning systems, cutting down the volume of ads requiring manual moderation by 60%.

Virtual Companion

http://t.me/mysaraAi_bot
The Telegram virtual companion bot is a groundbreaking chatbot leveraging the power of large language models (LLMs) to foster human-like interactions. With the integration of a long-term memory feature, the bot not only engages in real-time conversations but also personalizes interactions over time based on previous exchanges. Hosted on AWS Lambda for efficient serverless deployment, this bot ensures seamless and scalable interactions. Its exceptional conversational capabilities have resonated with users, amassing over 5,000 active users within just the first week of its launch.

YT Chat

I built a chatbot that enables users to chat with any YouTube channel. I built both the front and the back end.

Stack:
• Front end: React, Tailwind, and shadcn
• Authentication: Clerk
• Database: MongoDB
• Back end: Python with FastAPI
• RAG: LlamaIndex, Pinecone vector database, and OpenAI

Real-time Transcription for Medical Industry

A full-stack, real-time transcription service. The product records conversations between doctors and patients in real time.

Stack:
• Front end: React, Tailwind, and shadcn
• Back end: Python and FastAPI
• Transcription: AssemblyAI
• Summarisation: OpenAI
2012 - 2017

Master's Degree in Mechanical Engineering

Indian Institute of Technology Bombay - Mumbai, India

AUGUST 2018 - PRESENT

Deep Learning

Coursera

JULY 2018 - PRESENT

Structuring Machine Learning Projects

Coursera

MAY 2018 - PRESENT

Introduction to Data Science in Python

Coursera

Libraries/APIs

Pandas, NumPy, Scikit-Learn, OpenCV, Apache, TensorFlow, PyTorch, React.js, Telegram API, PySpark, Java Natural Language Processing (JNLP), Google Speech-to-Text API, Keras

Tools

ChatGPT, Git, Excel Development, AI Prompts, PyCharm, IntelliJ IDEA, Haystack, Azure OpenAI Service, Gurobi, CPLEX, SageMaker, Shadcn

Languages

Python, SQL, Python, Java, JavaScript, C#

Frameworks

Bedrock, Next.js, Flask, LlamaIndex, Selenium, Spark, Tailwind CSS, Scrapy

Paradigms

Spatial Data Scientists, Automation

Industry Expertise

Retail & Wholesale

Platforms

AWS, Docker, Azure Functions, H20, AWS Lambda, Azure Design

Storage

NoSQL, Neo4j, Graph Databases, AWS, MongoDB

Other

OR-Tools, Jsprit, Operations Research, Supply Chain, Supply Chain Optimization, Supply Chain Management, Machine Learning, Optimization, Vehicle Routing Problem (VRP), Vehicle Routing, Data Science, NLP, Artificial Intelligence, LLM, Data Analysis, Prompt Engineering, Technical Leadership, RAG, Neural Network, Algorithms, Inventory, CSV File Processing, GPT-3, Design Consulting, Leadership, Data Science, Artificial Intelligence, AI Model Training, FastAPI, Models, Location Services and Maps, Generative Artificial Intelligence (GenAI), LoRa, Hugging Face, Classifier Development, Supervised Learning, Teamwork, Regression, Pinecone, Data Engineering, Artificial General Intelligence (AGI), Data Scraping, Chatbot Development, Open-source LLMs, Workflow Automation, E-commerce marketing, Analytics Development, APIs, Vectorization, Sentiment Analysis, Modeling, Data Collection, Big Data Architecture, Minimum Viable Product (MVP), Large Data Sets, Entity Extraction, Simulations, Machine Learning Operations (MLOps), LangChain, Time Series, Predictive Modeling, OpenAI GPT-3 API, GPT-4, Full-stack, AI Chatbots, SaaS, Trend Analysis, Finance, Advertising Technology (Adtech), Full-stack, Product Management, Chatbot Development, Web Scraping, PDF Scraping, Software Architecture, Embeddings from Language Models (ELMo), Fine-tuning, Computer Vision, Optical Character Recognition, Large Language Model Operations (LLMOps), Grocery Delivery, Document Parsing, Gemini, Capital Markets, AI Agents, Deep Learning, Convolutional Neural Networks (CNNs), Recurrent Neural Networks (RNNs), Network Optimization, Generative Pre-trained Transformers (GPT), Gradient Boosting, OCR, OpenAI, Forecasting, Web Development, Legal Technology (Legaltech), Travel, Speech to Text, Speech to Text AI, Data Modeling, Food

Collaboration That Works

How to Work with Toptal

Toptal matches you directly with global industry experts from our network in hours—not weeks or months.

1

Share your needs

Discuss your requirements and refine your scope in a call with a Toptal domain expert.
2

Choose your talent

Get a short list of expertly matched talent within 24 hours to review, interview, and choose from.
3

Start your risk-free talent trial

Work with your chosen talent on a trial basis for up to two weeks. Pay only if you decide to hire them.

Top talent is in high demand.

Start hiring