Amanbir Singh, Developer in Delhi, India
Amanbir is available for hire
Hire Amanbir

Amanbir Singh

Verified Expert  in Engineering

Data Scientist and Back-end Developer

Location
Delhi, India
Toptal Member Since
September 13, 2021

Amanbir has 10 years of experience in data science, analytics, and back-end engineering. He has worked at a large multilateral organization and with early-stage tech startups. Amanbir excels at working with clients in tackling complex business problems and has deep expertise in machine learning, data analysis, and building scalable web apps.

Portfolio

ATS Software
Artificial Intelligence (AI), Machine Learning, Python, MySQL, GPT...
Monsoon CreditTech
Python, Pandas, Django, Angular, Docker, Kubernetes, Machine Learning...
IISD Experimental Lakes Area Inc - Main
Machine Learning, Data Science, Python, PostgreSQL, Amazon Web Services (AWS)...

Experience

Availability

Part-time

Preferred Environment

Python, Data Analytics, Data Science, Machine Learning, Pandas, Generative Pre-trained Transformers (GPT), OpenAI GPT-3 API, Minimum Viable Product (MVP), Generative Pre-trained Transformer 3 (GPT-3), OpenAI GPT-4 API, User Interface (UI), Product Management, Large Language Models (LLMs)

The most amazing...

...data science project I've worked on is building an automated machine learning platform for credit risk assessment from the ground up.

Work Experience

ML Developer

2023 - PRESENT
ATS Software
  • Worked on a computer vision model to extract information from unstructured PDF files (including drawings, tables, etc.).
  • Operated on NER models to extract information from natural language and unstructured text.
  • Used GPT-4 for postprocessing of AI pipeline to improve the performance. Also included rule-based postprocessing to improve pipeline performance.
  • Deployed the entire platform on AWS SageMaker and integrated with the client's stack.
  • Trained multimodal models to improve NER performance.
Technologies: Artificial Intelligence (AI), Machine Learning, Python, MySQL, GPT, Amazon SageMaker, Computer Vision, Named-entity Recognition (NER), Object Detection, Text Detection, Generative AI, Supervised Learning

Head of Product and Engineering

2016 - PRESENT
Monsoon CreditTech
  • Led the development of the SaaS AutoML platform as an architect and product manager; made wireframes, wrote user and functional requirements, decided on back-end architecture, and ran sprints using Django, Angular, Jenkins, and Docker.
  • Architected AutoML libraries used internally. The platform generated machine learning models optimized for lending.
  • Acted as a product manager and architect for developer tools used by our internal data science team to speed up model development and deployment.
  • Managed client engagements with 15 banks and NBFCs; built and deployed models to identify risky borrowers at the time of application. Increased revenue for the client by 20% and more.
  • Hired and managed a team of 10+ data scientists and software developers. Conducted one on ones, set targets for the team, and mentored junior members.
  • Built an auto-deployment process for machine learning models that supported multiple and multistage models.
Technologies: Python, Pandas, Django, Angular, Docker, Kubernetes, Machine Learning, Data Science, Machine Learning Operations (MLOps), XGBoost, Jupyter Notebook, SQL, Data Analytics, Data Visualization, Data Mining, Web Scraping, Data Reporting, Artificial Intelligence (AI), Agile, Data Analysis, Time Series, Time Series Analysis, Optimization, Financial Modeling, Amazon Web Services (AWS), MySQL, Azure, Scikit-learn, Statistics, Statistical Analysis, Real-time Data, Predictive Analytics, APIs, Banking & Finance, Architecture, Leadership, Automation Scripting, Scripting, AWS Lambda, REST APIs, Amazon S3 (AWS S3), HTML, Decision Trees, Data Scientist, Natural Language Processing (NLP), Recommendation Systems, Regression, PDF Scraping, Scraping, Back-end, Software Architecture, Azure ML Studio, Git, Amazon DynamoDB, PostgreSQL, Non-performing Loans (NPL), Data Scraping, TypeScript, NumPy, MongoDB, Serverless, Predictive Modeling, Customer Segmentation, Visualization, Django REST Framework, Full-stack Development, API Integration, AI Design, Automation, Full-stack, CSS, Flask, Solution Architecture, Software Development, PyPDF2, openpyxl, Microservices, Advisory, Technology Strategy & Architecture, Databases, Web Development, CTO, DevOps, Google Cloud Platform (GCP), JavaScript, Object-relational Mapping (ORM), Technical Leadership, Database Architecture, Agile Software Development, Data Structures, Amazon SageMaker, ETL, Minimum Viable Product (MVP), Requirements Analysis, Startups, Mathematics, Task Scheduling, Regular Expressions, Sockets, Linear Regression, Data-driven Decision-making, Decision Modeling, Neural Networks, Programming, Integration, User Interface (UI), Cloud, Models, Exploratory Data Analysis, EDA, Modeling, Data Cleaning, Unstructured Data Analysis, Large Data Sets, Data Gathering, Spreadsheets, Machine Learning Automation, Amazon Elastic Container Service (Amazon ECS), Data Processing, Product Management, Amazon EC2, Back-end Development, Azure Cosmos DB, GitHub, Azure Functions, Azure Blobs, Scrapy, Large Language Models (LLMs), Regression Modeling, Language Models, FastAPI, Containerization, Vertex, System Architecture, Product Roadmaps, Product Strategy, Team Leadership, Project Management, Azure Machine Learning, Pytest, Unit Testing, Statistical Modeling, NoSQL, Object-oriented Programming (OOP), Research, Cloud Computing, Unsupervised Fraud Detection, Unsupervised Learning, Supervised Learning, Open-source LLMs

Data Scientist | ML Expert

2023 - 2024
IISD Experimental Lakes Area Inc - Main
  • Developed a model using meteorological data to predict the date of ice melt for a lake. The prediction was within a day of the actual ice melting date.
  • Used boosting, bagging, and other algorithms to improve performance.
  • Created a dashboard using React to show model predictions and performance.
Technologies: Machine Learning, Data Science, Python, PostgreSQL, Amazon Web Services (AWS), React, Gradient Boosting, Scikit-learn, Google Earth, Statistical Modeling, Object-oriented Programming (OOP), Cloud Computing, Generative AI, Supervised Learning

Data Scientist

2023 - 2023
Independent Research Group
  • Created a simulation to model the interactions between different economic actors (firms, employees, non-economic participants, etc.).
  • Ran a Markov chain simulation to understand the effect with different initial states and interventions.
  • Created output visualizations and statistics to test hypotheses.
Technologies: Data Science, Agent-based Modeling, R, Python, Markov Chain Monte Carlo (MCMC) Algorithms, Monte Carlo Simulations, Simulations, Unsupervised Learning

AI/ML Developer

2023 - 2023
America Interpretation
  • Developed a real-time translation API to convert speech to speech across any language.
  • Built a back end in Django to handle streaming audio data and return translated audio data and transcription. The back end also addressed meeting creation and meeting joining.
  • Created a front end in React and used RecordRTC to capture audio. Established a WebSocket connection to allow for audio streaming to the back end.
  • Deployed both front and back end on Azure services.
  • Integrated with multiple translation and speech generation services.
Technologies: Python, Artificial Intelligence (AI), Machine Learning, Text to Speech (TTS), Speech to Text, Natural Language Processing (NLP), React, Azure Text to Speech, Elementor, Django, Azure, OpenAI, WebSockets, TypeScript, JavaScript, RecordRTC, Voice Recognition, Language Models, Prompt Engineering, System Architecture, New Product Development, Object-oriented Programming (OOP), Cloud Computing, Generative AI

AI/ML Expert/Consultant

2023 - 2023
Harbor
  • Did prompt engineering to improve LLM model predictions.
  • Compared open-source LLMs against closed models.
  • Self-hosted open-source LLMs on the company's infrastructure.
  • Built a prompt testing framework in Python to compare and improve prompts.
Technologies: OpenAI GPT-3 API, GPT, OpenAI GPT-4 API, Generative Pre-trained Transformers (GPT), Generative Pre-trained Transformer 3 (GPT-3), AIOps, Machine Learning Operations (MLOps), Natural Language Processing (NLP), Graphics Processing Unit (GPU), AI Design, Amazon SageMaker, Hugging Face, ChatGPT, Amazon EC2, Back-end Development, GitHub, LangChain, Pinecone, Large Language Models (LLMs), OpenAI, LlamaIndex, Language Models, Prompt Engineering, Containerization, System Architecture, Project Management, Retrieval-augmented Generation (RAG), Llama 2, NoSQL, Object-oriented Programming (OOP), Research, Cloud Computing, Generative AI, Open-source LLMs

AI/ML Engineer

2023 - 2023
Grown Unknown, LLC
  • Developed prompts to generate customized parental advice using OpenAI APIs.
  • Added context to the prompts to tailor the tone of the outputs.
  • Compared OpenAI with other options and created a plan for future product development.
Technologies: Python, Machine Learning, Language Models, OpenAI GPT-4 API, OpenAI GPT-3 API, GPT, Data Scientist, Language Learning, Generative Systems, Natural Language Processing (NLP), ChatGPT, Large Language Models (LLMs), OpenAI, Prompt Engineering, System Architecture, Generative AI

Machine Learning Expert

2023 - 2023
AmpVis Ltd.
  • Advised the client on building the MVP, including all technical steps needed.
  • Decided on team structures to handle different product decisions.
  • Consulted on hiring decisions for other technical roles.
Technologies: Python, Machine Learning, Artificial Intelligence (AI), Data Science, APIs, Google Vision API, Amazon Rekognition, Programming, Cloud, Models, Data Scientist, Generative Systems, Deep Learning, Large Language Models (LLMs), Product Roadmaps, Product Strategy

Data Scientist

2023 - 2023
NewCloud Medical LLC
  • Built a Looker Studio dashboard to show data and summary statistics based on filters.
  • Added visualizations in the Looker Studio to generate insights from the data.
  • Created dashboard views that dynamically update based on selected fields.
Technologies: Python, PDF Scraping, Scraping, Databases, Looker, Programming, Language Models, GPT, Data Cleaning, Data Scientist, Spreadsheets, Data Processing, Large Language Models (LLMs)

Research Coordinator

2015 - 2016
JustJobs Network
  • Set up an internal data management system to track versions of datasets.
  • Led research on vocational training and skill-building programs in India. Led data collection and analysis; published a findings report.
  • Designed a training module on statistics and R, which was used for the training of new hires.
Technologies: Python, R, Data Analytics, Data Visualization, Data Mining, Web Scraping, Data Reporting, Data Analysis, Statistics, Statistical Analysis, Automation Scripting, Scripting, Data Scientist, Regression, Scraping, Git, Predictive Modeling, Visualization, Automation, Mathematics, Linear Regression, Data-driven Decision-making, Decision Modeling, Programming, Models, Exploratory Data Analysis, EDA, Modeling, Data Cleaning, Unstructured Data Analysis, Data Gathering, Spreadsheets, Data Processing, Regression Modeling, Project Management, Statistical Modeling, Research, Supervised Learning

Consultant

2014 - 2015
World Bank Group
  • Supervised statewide data collection for 4,500 surveys at the individual and household levels.
  • Built models to identify factors that affected education and labor market outcomes for adolescents.
  • Participated in the dissemination of research findings.
Technologies: R, Data Science, Data Analytics, Data Visualization, Data Mining, Data Reporting, Data Analysis, Statistics, Statistical Analysis, Automation Scripting, Scripting, Regression, Git, Predictive Modeling, Visualization, ETL, Mathematics, Linear Regression, Data-driven Decision-making, Decision Modeling, Programming, Models, Exploratory Data Analysis, EDA, Modeling, Data Cleaning, Unstructured Data Analysis, Data Gathering, Spreadsheets, Data Processing, Regression Modeling, Project Management, Statistical Modeling, Research, Unsupervised Learning, Supervised Learning

Senior Research Associate

2012 - 2014
Centre for Microfinance Research
  • Managed two randomized control trials studying the effect of financial access in India.
  • Trained and supervised a field team of 30 members for 1,700 individual surveys across four districts.
  • Designed and implemented six electronic questionnaires using Open Data Kit and SurveyCTO and built the back end for the survey data.
Technologies: STATA, Survey Design, Open Data Kit, Data Visualization, Data Mining, Data Reporting, Data Analysis, Causal Inference, Statistics, Statistical Analysis, Automation Scripting, Regression, Visualization, Mathematics, Linear Regression, Data-driven Decision-making, Models, Exploratory Data Analysis, EDA, Modeling, Data Cleaning, Unstructured Data Analysis, Data Gathering, Spreadsheets, Data Processing, Regression Modeling, Project Management, Statistical Modeling, Research, Unsupervised Learning, Supervised Learning

AutoML Platform for Lenders

https://monsoonfintech.com/thoth/
Built an AutoML platform that takes data from lenders and produces state-of-the-art machine-learning models. Supports traditional financial data and alternate (SMS, mobile, etc.) data.

The platform produced models for new applications and to help with collections for running loans. This was offered as a SaaS product.

Custom Machine Learning Models for Lenders

https://monsoonfintech.com/
Managed a team of developers and data scientists to build models for lenders. This included models that predicted the risk of loan applications, recommendation engines for financial products, and marketing models to reach out to identify target customers.

Built and delivered models to the largest lenders in India. This led to a 30% reduction in delinquencies and increased loan approvals by 25%.

Report for the World Bank

https://documents.worldbank.org/en/publication/documents-reports/documentdetail/866381523450216235/a-window-of-opportunity-a-diagnostic-of-adolescent-girls-and-young-women-s-socio-economic-empowerment-in-jharkhand-india
Worked closely with the World Bank to identify critical challenges, along with key reforms, that adolescent girls in Jharkhand, India were facing.

My role included experimental design, data collection, analysis, and modeling. I also worked on the dissemination of the report and communication with key stakeholders.

Languages

Python, HTML, R, SQL, TypeScript, CSS, JavaScript

Frameworks

Django, Django REST Framework, Bootstrap, Material UI, LlamaIndex, Angular, Flask, Scrapy

Libraries/APIs

Pandas, XGBoost, Scikit-learn, REST APIs, NumPy, Beautiful Soup, Sockets, Google Vision API, Amazon Rekognition, React, RecordRTC

Tools

Amazon SageMaker, ChatGPT, Git, Spreadsheets, Amazon Elastic Container Service (Amazon ECS), GitHub, Azure Machine Learning, Pytest, STATA, Open Data Kit, Azure ML Studio, Looker, Named-entity Recognition (NER)

Paradigms

Data Science, Automation, Object-relational Mapping (ORM), Object-oriented Programming (OOP), Agile, Microservices, Agile Software Development, ETL, Requirements Analysis, Unit Testing, DevOps, Agent-based Modeling

Platforms

Jupyter Notebook, AWS Lambda, Amazon EC2, Docker, Amazon Web Services (AWS), Azure, Azure Functions, Kubernetes, Google Cloud Platform (GCP)

Storage

MySQL, Amazon S3 (AWS S3), PostgreSQL, MongoDB, Databases, Database Architecture, Azure Cosmos DB, Azure Blobs, NoSQL, Amazon DynamoDB

Industry Expertise

Project Management, Banking & Finance

Other

Machine Learning, Data Analytics, Data Mining, Web Scraping, Artificial Intelligence (AI), Data Analysis, Statistics, Statistical Analysis, Predictive Analytics, APIs, Architecture, Automation Scripting, Scripting, Decision Trees, Data Scientist, Natural Language Processing (NLP), Regression, PDF Scraping, Scraping, Back-end, Software Architecture, Non-performing Loans (NPL), Data Scraping, Predictive Modeling, Customer Segmentation, Visualization, Full-stack Development, API Integration, Software Development, PyPDF2, Advisory, Technology Strategy & Architecture, Web Development, CTO, Technical Leadership, Generative Pre-trained Transformers (GPT), OpenAI GPT-3 API, Minimum Viable Product (MVP), Startups, Regular Expressions, Linear Regression, Data-driven Decision-making, Programming, Integration, Models, GPT, Exploratory Data Analysis, EDA, Modeling, Data Cleaning, Unstructured Data Analysis, Large Data Sets, Data Gathering, Machine Learning Automation, Data Processing, Back-end Development, Regression Modeling, Large Language Models (LLMs), OpenAI, Prompt Engineering, System Architecture, Product Roadmaps, Product Strategy, New Product Development, Team Leadership, Statistical Modeling, Unsupervised Learning, Supervised Learning, Machine Learning Operations (MLOps), Data Visualization, Data Reporting, Time Series, Time Series Analysis, Real-time Data, Leadership, Recommendation Systems, Serverless, AI Design, Full-stack, Solution Architecture, Data Structures, Generative Pre-trained Transformer 3 (GPT-3), Mathematics, Task Scheduling, OpenAI GPT-4 API, Decision Modeling, Neural Networks, Cloud, Language Models, Language Learning, Generative Systems, Product Management, LangChain, Speech to Text, Voice Recognition, FastAPI, Containerization, Retrieval-augmented Generation (RAG), Llama 2, Research, Cloud Computing, Unsupervised Fraud Detection, Generative AI, Open-source LLMs, Survey Design, SaaS, Optimization, Financial Modeling, Causal Inference, openpyxl, User Interface (UI), Deep Learning, AIOps, Graphics Processing Unit (GPU), Hugging Face, Pinecone, Text to Speech (TTS), Azure Text to Speech, Elementor, WebSockets, Vertex, Gradient Boosting, Google Earth, Markov Chain Monte Carlo (MCMC) Algorithms, Monte Carlo Simulations, Simulations, Computer Vision, Object Detection, Text Detection

2008 - 2012

Bachelor's Degree in Economics and Statistics

Carnegie Mellon University - Pittsburgh, PA, USA

Collaboration That Works

How to Work with Toptal

Toptal matches you directly with global industry experts from our network in hours—not weeks or months.

1

Share your needs

Discuss your requirements and refine your scope in a call with a Toptal domain expert.
2

Choose your talent

Get a short list of expertly matched talent within 24 hours to review, interview, and choose from.
3

Start your risk-free talent trial

Work with your chosen talent on a trial basis for up to two weeks. Pay only if you decide to hire them.

Top talent is in high demand.

Start hiring