Bhoumik Shah
Verified Expert in Engineering
Operations Research Scientist and Developer
Bengaluru, Karnataka, India
Toptal member since September 13, 2022
Bhoumik is a machine learning and operations research specialist with six years in AI, notably in ads moderation, linehaul cost optimization, and McKinsey's routing tool. He's excelled in cost reduction and system efficiency, adept with jsprit, OR-Tools, and ML model development. With both a bachelor's and master's degree in mechanical engineering, with a focus on operations research, Bhoumik also holds certifications in deep learning and related fields.
Portfolio
Experience
Availability
Preferred Environment
OR-Tools, Jsprit, PyCharm, IntelliJ IDEA, H20, Spark, Deep Learning, PyTorch
The most amazing...
...thing I've designed is Amazon's ad system, which moderated over 5 billion ads and reduced the need for human checks by 60%
Work Experience
AI Consultant
Mercatus Center at George Mason University - Main
- Built a heuristic algorithm to identify and extract relevant information and amendments from policy documents published by US federal agencies. Due to large size of documents (300+ pages), it could not be done directly LLMs.
- Used LLMs to make amendments into federal regulations based on published policy by multiple federal agencies.
- Built a front-end application using Streamlit to compare changes in regulation.
AI/ML Engineer
Gold Coin Group LLC
- Developed an end-to-end application for scrapping scanned images of handwritten mails from an internal website and extracting data from it.
- Used AWS textract, and OpenAI along with Donut model to do OCR of handwritten images and extract information in structured format.
- Implemented web scraping solution using Selenium and Beautiful Soup.
Applied Scientist
Amazon India
- Developed automated moderation systems to streamline the review of more than 5 billion ads annually.
- Led a team of three in enhancing current machine learning systems, cutting down the number of ads requiring manual moderation by 60%.
- Achieved a $500 million annual reduction in shipping invoice estimation errors by training a gradient boosting machine (GBM) model using H2O to handle a large training dataset of 250 million samples.
Operations Research Scientist
Amazon India
- Built a dispatch time optimizer for the Amazon India network to deliver 10% of shipments one day faster. Used concepts of multi-agent learning and mixed-integer linear programming to solve the NP-complete problem.
- Used concepts of vehicle routing optimization and built a line-haul route planner tool for the Amazon line-haul network. It combined multiple lanes to increase vehicle utilization. This resulted in an annual freight cost reduction of $10 million.
- Created a workforce scheduling tool using constraint programming to automate and optimize human resources deployment at the Amazon sort center, which resulted in an 8% reduction in the workforce.
Knowledge Analyst
McKinsey & Company
- Developed a state-of-the-art tool to solve vehicle routing problems with real-life constraints using heuristic algorithms. Deployed the solution as a web application and implemented it in over 15 client scenarios spanning various industries.
- Collaborated with multiple global clients for topology and production planning optimization.
- Developed a routing algorithm for solving ship container routing for a leading petrochemical producer.
Experience
Routing Optimization
Automatic Grading of Handwritten Tests
https://qn-a-eval.vercel.app/Ads Moderation System
Virtual Companion
http://t.me/mysaraAi_botYT Chat
Stack:
• Front end: React, Tailwind, and shadcn
• Authentication: Clerk
• Database: MongoDB
• Back end: Python with FastAPI
• RAG: LlamaIndex, Pinecone vector database, and OpenAI
Real-time Transcription for Medical Industry
Stack:
• Front end: React, Tailwind, and shadcn
• Back end: Python and FastAPI
• Transcription: AssemblyAI
• Summarisation: OpenAI
Education
Master's Degree in Mechanical Engineering
Indian Institute of Technology Bombay - Mumbai, India
Certifications
Deep Learning
Coursera
Structuring Machine Learning Projects
Coursera
Introduction to Data Science in Python
Coursera
Skills
Libraries/APIs
Pandas, NumPy, Scikit-learn, OpenCV, Spark ML, TensorFlow, PyTorch, React, Telegram Bot API, PySpark, Java Natural Language Processing (JNLP), Google Speech-to-Text API, Keras
Tools
ChatGPT, GitLab, Microsoft Excel, AI Prompts, PyCharm, IntelliJ IDEA, Haystack, Azure OpenAI Service, Gurobi, CPLEX, Amazon SageMaker, Shadcn
Languages
Python, SQL, Python 3, Java, JavaScript, C#
Frameworks
Bedrock, Next.js, Flask, LlamaIndex, Selenium, Spark, Tailwind CSS, Scrapy
Paradigms
Spatial Databases, Automation
Industry Expertise
Retail & Wholesale
Platforms
Amazon Web Services (AWS), Docker, Azure Functions, H20, AWS Lambda, Azure
Storage
NoSQL, Neo4j, Graph Databases, Amazon DynamoDB, MongoDB
Other
OR-Tools, Jsprit, Operations Research, Supply Chain, Supply Chain Optimization, Supply Chain Management (SCM), Machine Learning, Optimization, Vehicle Routing Problem (VRP), Vehicle Routing, Data Science, Natural Language Processing (NLP), Artificial Intelligence (AI), Large Language Models (LLMs), Data Analysis, Prompt Engineering, Technical Leadership, Retrieval-augmented Generation (RAG), Neural Networks, Algorithms, Inventory, CSV File Processing, Generative Pre-trained Transformer 3 (GPT-3), Consulting, Leadership, Data Science Product Manager, AI Programming, AI Model Training, FastAPI, Models, Location Services and Maps, Generative Artificial Intelligence (GenAI), LoRa, Hugging Face, Classifier Development, Supervised Learning, Teamwork, Regression, Pinecone, Data Engineering, Artificial General Intelligence (AGI), Data Scraping, Chatbots, Open-source LLMs, Workflow Automation, eCommerce, Analytics, APIs, Vectorization, Sentiment Analysis, Modeling, Data Collection, Big Data, Minimum Viable Product (MVP), Large Data Sets, Simulations, Machine Learning Operations (MLOps), LangChain, Time Series, Predictive Modeling, OpenAI GPT-3 API, OpenAI GPT-4 API, Full-stack Development, AI Chatbots, SaaS, Trend Analysis, Finance, Advertising Technology (Adtech), Full-stack, Product Management, Chatbot Conversation Design, Web Scraping, PDF Scraping, Software Architecture, Embeddings from Language Models (ELMo), Fine-tuning, Computer Vision, Optical Character Recognition, Large Language Model Operations (LLMOps), Grocery Delivery, Document Parsing, Gemini, Capital Markets, Deep Learning, Convolutional Neural Networks (CNNs), Recurrent Neural Networks (RNNs), Network Optimization, Generative Pre-trained Transformers (GPT), Gradient Boosting, OCR, OpenAI, Forecasting, Web Development, Legal Technology (Legaltech), Travel, Speech to Text, Speech to Text AI
How to Work with Toptal
Toptal matches you directly with global industry experts from our network in hours—not weeks or months.
Share your needs
Choose your talent
Start your risk-free talent trial
Top talent is in high demand.
Start hiring