Bhoumik Shah, Developer in Bengaluru, Karnataka, India
Bhoumik is available for hire
Hire Bhoumik

Bhoumik Shah

Verified Expert  in Engineering

Bio

Bhoumik is a machine learning and operations research specialist with six years in AI, notably in ads moderation, linehaul cost optimization, and McKinsey's routing tool. He's excelled in cost reduction and system efficiency, adept with jsprit, OR-Tools, and ML model development. With both a bachelor's and master's degree in mechanical engineering, with a focus on operations research, Bhoumik also holds certifications in deep learning and related fields.

Portfolio

Mercatus Center at George Mason University - Main
Machine Learning, Web Scraping, Java Natural Language Processing (JNLP)...
Gold Coin Group LLC
Computer Vision, Optical Character Recognition, Python, OpenCV, TensorFlow...
Amazon India
Artificial Intelligence (AI), H20, Machine Learning, Gradient Boosting, Spark...

Experience

Availability

Part-time

Preferred Environment

OR-Tools, Jsprit, PyCharm, IntelliJ IDEA, H20, Spark, Deep Learning, PyTorch

The most amazing...

...thing I've designed is Amazon's ad system, which moderated over 5 billion ads and reduced the need for human checks by 60%

Work Experience

AI Consultant

2024 - 2024
Mercatus Center at George Mason University - Main
  • Built a heuristic algorithm to identify and extract relevant information and amendments from policy documents published by US federal agencies. Due to large size of documents (300+ pages), it could not be done directly LLMs.
  • Used LLMs to make amendments into federal regulations based on published policy by multiple federal agencies.
  • Built a front-end application using Streamlit to compare changes in regulation.
Technologies: Machine Learning, Web Scraping, Java Natural Language Processing (JNLP), Artificial General Intelligence (AGI), Deep Learning, SQL, Natural Language Processing (NLP), Artificial Intelligence (AI), Large Language Models (LLMs), AI Prompts, Large Data Sets, Finance, Capital Markets

AI/ML Engineer

2024 - 2024
Gold Coin Group LLC
  • Developed an end-to-end application for scrapping scanned images of handwritten mails from an internal website and extracting data from it.
  • Used AWS textract, and OpenAI along with Donut model to do OCR of handwritten images and extract information in structured format.
  • Implemented web scraping solution using Selenium and Beautiful Soup.
Technologies: Computer Vision, Optical Character Recognition, Python, OpenCV, TensorFlow, Pandas, Machine Learning, Artificial Intelligence (AI), Deep Learning, Workflow Automation, Natural Language Processing (NLP), OCR, Keras, AI Prompts, Large Data Sets, Finance, Capital Markets

Applied Scientist

2021 - 2023
Amazon India
  • Developed automated moderation systems to streamline the review of more than 5 billion ads annually.
  • Led a team of three in enhancing current machine learning systems, cutting down the number of ads requiring manual moderation by 60%.
  • Achieved a $500 million annual reduction in shipping invoice estimation errors by training a gradient boosting machine (GBM) model using H2O to handle a large training dataset of 250 million samples.
Technologies: Artificial Intelligence (AI), H20, Machine Learning, Gradient Boosting, Spark, PyTorch, Generative Pre-trained Transformers (GPT), Natural Language Processing (NLP), Data Science, Deep Learning, Large Language Models (LLMs), Amazon Web Services (AWS), Data Analysis, Predictive Modeling, OpenAI GPT-3 API, Technical Leadership, Haystack, Retrieval-augmented Generation (RAG), Trend Analysis, Advertising Technology (Adtech), Machine Learning Operations (MLOps), Neural Networks, Algorithms, Retail & Wholesale, Inventory, CSV File Processing, Full-stack, Web Development, Bedrock, Generative Pre-trained Transformer 3 (GPT-3), Consulting, Leadership, SQL, Product Management, PySpark, Data Science Product Manager, GitLab, AI Programming, AI Model Training, Pandas, NumPy, NoSQL, FastAPI, Neo4j, Graph Databases, Python 3, Models, Generative Artificial Intelligence (GenAI), Scikit-learn, Classifier Development, Supervised Learning, Teamwork, Regression, Data Engineering, Web Scraping, Spark ML, Optical Character Recognition, Workflow Automation, TensorFlow, eCommerce, Analytics, Sentiment Analysis, Modeling, Data Collection, Big Data, C#, Document Parsing, Minimum Viable Product (MVP), Automation, Microsoft Excel, AI Prompts, Large Data Sets, Java

Operations Research Scientist

2019 - 2021
Amazon India
  • Built a dispatch time optimizer for the Amazon India network to deliver 10% of shipments one day faster. Used concepts of multi-agent learning and mixed-integer linear programming to solve the NP-complete problem.
  • Used concepts of vehicle routing optimization and built a line-haul route planner tool for the Amazon line-haul network. It combined multiple lanes to increase vehicle utilization. This resulted in an annual freight cost reduction of $10 million.
  • Created a workforce scheduling tool using constraint programming to automate and optimize human resources deployment at the Amazon sort center, which resulted in an 8% reduction in the workforce.
Technologies: Operations Research, Optimization, Vehicle Routing, Network Optimization, Python, Java, Gurobi, CPLEX, Amazon Web Services (AWS), Data Analysis, Data Science, Time Series, Predictive Modeling, Technical Leadership, Machine Learning, Trend Analysis, Machine Learning Operations (MLOps), Neural Networks, Algorithms, Retail & Wholesale, Inventory, CSV File Processing, Consulting, SQL, Product Management, PySpark, Data Science Product Manager, GitLab, AI Model Training, Pandas, NumPy, NoSQL, FastAPI, Python 3, Models, Location Services and Maps, Spatial Databases, Classifier Development, Teamwork, Regression, Data Engineering, Web Scraping, Software Architecture, Workflow Automation, eCommerce, Generative Pre-trained Transformers (GPT), Analytics, Grocery Delivery, Modeling, Data Collection, Big Data, Document Parsing, Minimum Viable Product (MVP), Automation, Microsoft Excel

Knowledge Analyst

2017 - 2019
McKinsey & Company
  • Developed a state-of-the-art tool to solve vehicle routing problems with real-life constraints using heuristic algorithms. Deployed the solution as a web application and implemented it in over 15 client scenarios spanning various industries.
  • Collaborated with multiple global clients for topology and production planning optimization.
  • Developed a routing algorithm for solving ship container routing for a leading petrochemical producer.
Technologies: Supply Chain, Supply Chain Optimization, Supply Chain Management (SCM), Data Analysis, Data Science, Time Series, Forecasting, Predictive Modeling, Technical Leadership, Machine Learning, Trend Analysis, Finance, Neural Networks, Algorithms, Retail & Wholesale, Inventory, CSV File Processing, Consulting, SQL, Product Management, Data Science Product Manager, GitLab, AI Programming, AI Model Training, Pandas, NumPy, NoSQL, FastAPI, Python 3, Models, Location Services and Maps, Spatial Databases, Classifier Development, Supervised Learning, Teamwork, Regression, Data Engineering, Web Scraping, Spark ML, Software Architecture, Workflow Automation, Analytics, Grocery Delivery, Modeling, Data Collection, Big Data, Automation, Microsoft Excel, Large Data Sets, Java

Routing Optimization

I built a state-of-the-art routing optimization tool using jsprit, OR-Tools, and a proprietary algorithm to optimize outbound routing for an eCommerce giant. The tool resulted in an annual cost reduction of USD 10 million.

Automatic Grading of Handwritten Tests

https://qn-a-eval.vercel.app/
The automatic grading system is an innovative tool designed to assess handwritten answer sheets with high precision. Utilizing state-of-the-art optical character recognition (OCR) technology, the system deciphers handwritten text and efficiently converts it into digital format. Beyond this conversion, smart post-processing ensures accuracy, addressing common issues associated with handwriting variations. Once the answers are digitized, the system employs large language models (LLMs) to evaluate the content, ensuring a comprehensive understanding of the answer's context and relevance. The integration of these technologies provides a swift and accurate grading process, reducing manual effort and increasing consistency in evaluation.

Ads Moderation System

Developed automated moderation systems to streamline the review of over 5 billion ads per year. Also, I led a team of three in enhancing current machine learning systems, cutting down the volume of ads requiring manual moderation by 60%.

Virtual Companion

http://t.me/mysaraAi_bot
The Telegram virtual companion bot is a groundbreaking chatbot leveraging the power of large language models (LLMs) to foster human-like interactions. With the integration of a long-term memory feature, the bot not only engages in real-time conversations but also personalizes interactions over time based on previous exchanges. Hosted on AWS Lambda for efficient serverless deployment, this bot ensures seamless and scalable interactions. Its exceptional conversational capabilities have resonated with users, amassing over 5,000 active users within just the first week of its launch.

YT Chat

I built a chatbot that enables users to chat with any YouTube channel. I built both the front and the back end.

Stack:
• Front end: React, Tailwind, and shadcn
• Authentication: Clerk
• Database: MongoDB
• Back end: Python with FastAPI
• RAG: LlamaIndex, Pinecone vector database, and OpenAI

Real-time Transcription for Medical Industry

A full-stack, real-time transcription service. The product records conversations between doctors and patients in real time.

Stack:
• Front end: React, Tailwind, and shadcn
• Back end: Python and FastAPI
• Transcription: AssemblyAI
• Summarisation: OpenAI
2012 - 2017

Master's Degree in Mechanical Engineering

Indian Institute of Technology Bombay - Mumbai, India

AUGUST 2018 - PRESENT

Deep Learning

Coursera

JULY 2018 - PRESENT

Structuring Machine Learning Projects

Coursera

MAY 2018 - PRESENT

Introduction to Data Science in Python

Coursera

Libraries/APIs

Pandas, NumPy, Scikit-learn, OpenCV, Spark ML, TensorFlow, PyTorch, React, Telegram Bot API, PySpark, Java Natural Language Processing (JNLP), Google Speech-to-Text API, Keras

Tools

ChatGPT, GitLab, Microsoft Excel, AI Prompts, PyCharm, IntelliJ IDEA, Haystack, Azure OpenAI Service, Gurobi, CPLEX, Amazon SageMaker, Shadcn

Languages

Python, SQL, Python 3, Java, JavaScript, C#

Frameworks

Bedrock, Next.js, Flask, LlamaIndex, Selenium, Spark, Tailwind CSS, Scrapy

Paradigms

Spatial Databases, Automation

Industry Expertise

Retail & Wholesale

Platforms

Amazon Web Services (AWS), Docker, Azure Functions, H20, AWS Lambda, Azure

Storage

NoSQL, Neo4j, Graph Databases, Amazon DynamoDB, MongoDB

Other

OR-Tools, Jsprit, Operations Research, Supply Chain, Supply Chain Optimization, Supply Chain Management (SCM), Machine Learning, Optimization, Vehicle Routing Problem (VRP), Vehicle Routing, Data Science, Natural Language Processing (NLP), Artificial Intelligence (AI), Large Language Models (LLMs), Data Analysis, Prompt Engineering, Technical Leadership, Retrieval-augmented Generation (RAG), Neural Networks, Algorithms, Inventory, CSV File Processing, Generative Pre-trained Transformer 3 (GPT-3), Consulting, Leadership, Data Science Product Manager, AI Programming, AI Model Training, FastAPI, Models, Location Services and Maps, Generative Artificial Intelligence (GenAI), LoRa, Hugging Face, Classifier Development, Supervised Learning, Teamwork, Regression, Pinecone, Data Engineering, Artificial General Intelligence (AGI), Data Scraping, Chatbots, Open-source LLMs, Workflow Automation, eCommerce, Analytics, APIs, Vectorization, Sentiment Analysis, Modeling, Data Collection, Big Data, Minimum Viable Product (MVP), Large Data Sets, Simulations, Machine Learning Operations (MLOps), LangChain, Time Series, Predictive Modeling, OpenAI GPT-3 API, OpenAI GPT-4 API, Full-stack Development, AI Chatbots, SaaS, Trend Analysis, Finance, Advertising Technology (Adtech), Full-stack, Product Management, Chatbot Conversation Design, Web Scraping, PDF Scraping, Software Architecture, Embeddings from Language Models (ELMo), Fine-tuning, Computer Vision, Optical Character Recognition, Large Language Model Operations (LLMOps), Grocery Delivery, Document Parsing, Gemini, Capital Markets, Deep Learning, Convolutional Neural Networks (CNNs), Recurrent Neural Networks (RNNs), Network Optimization, Generative Pre-trained Transformers (GPT), Gradient Boosting, OCR, OpenAI, Forecasting, Web Development, Legal Technology (Legaltech), Travel, Speech to Text, Speech to Text AI

Collaboration That Works

How to Work with Toptal

Toptal matches you directly with global industry experts from our network in hours—not weeks or months.

1

Share your needs

Discuss your requirements and refine your scope in a call with a Toptal domain expert.
2

Choose your talent

Get a short list of expertly matched talent within 24 hours to review, interview, and choose from.
3

Start your risk-free talent trial

Work with your chosen talent on a trial basis for up to two weeks. Pay only if you decide to hire them.

Top talent is in high demand.

Start hiring