Gaurav Singh, Developer in London, United Kingdom
Gaurav is available for hire
Hire Gaurav

Gaurav Singh

Verified Expert  in Engineering

Machine Learning Engineer and Developer

Location
London, United Kingdom
Toptal Member Since
October 7, 2020

Gaurav is a talented machine learning and NLP scientist with a Ph.D. from University College London. Gaurav's research focused on information extraction from unstructured text using deep learning, mainly under scarce training data constraints. He sped up convergence in training deep neural networks, improving generalization and robustness to adversarial noise, and developed an automated approach for finding promising materials from the scientific literature for making energy devices.

Portfolio

OctoML
PyTorch, Python 3, Amazon SageMaker
SimpliCapital LLC
Amazon SageMaker, Python, Data Science, Parallelization, Large Data Sets...
Binance
Amazon SageMaker, Amazon EC2, Deep Learning, Natural Language Processing (NLP)...

Experience

Availability

Part-time

Preferred Environment

Amazon Web Services (AWS), Deep Learning, NumPy, Git, Pandas, Scikit-learn, PyTorch, Python 3, Azure, SQL

The most amazing...

...thing I've developed as an ML/NLP scientist is an automated approach for finding promising materials from the scientific literature for making energy devices.

Work Experience

Deep Learning LLM Scientist and Deployment Specialist

2023 - 2023
OctoML
  • Audited the platform that was built to allow people to use LLMs—both public and private—via easy-to-use and quick-to-set-up APIs.
  • Tracked bugs and made a report to inform the company to get them fixed.
  • Tested various features of the Octoml.ai website.
Technologies: PyTorch, Python 3, Amazon SageMaker

SageMaker Expert

2023 - 2023
SimpliCapital LLC
  • Analyzed the problem and developed a strategy for solving the problem under given constraints for the customer. Clarified the problem and fixed major issues in the company's previous solution.
  • Built a new state-of-the-art after extensive experimentation that performed as per the expectations of the customer. Worked with the engineering team to fix the AWS architectural issues so the model could work without significant delays in prod.
  • Prepared the results to be presented to the investors of the company.
Technologies: Amazon SageMaker, Python, Data Science, Parallelization, Large Data Sets, Feature Engineering, Machine Learning, Regression Modeling, Classification Algorithms, Amazon S3 (AWS S3), Amazon Machine Learning, Leadership

Lead Data Scientist

2022 - 2023
Binance
  • Built a machine learning-based system to extract information from users' uploaded ID images to perform cheaper and faster KYC.
  • Developed a social media monitoring system that could detect upcoming trends, identify and summarize customer feedback, create alerts for customer complaints, and identify new coins that are getting attention from users, etc.
  • Built a fraud smart contract detection system based on the code and external factors such as the outflow and inflow of money into the contract, the website and the promised return, and the reputation of founders on social media, etc.
Technologies: Amazon SageMaker, Amazon EC2, Deep Learning, Natural Language Processing (NLP), GPT, Generative Pre-trained Transformers (GPT), Computer Vision, Predictive Modeling, Statistical Modeling, Machine Learning Operations (MLOps), Data Modeling, PySpark, Amazon S3 (AWS S3), Amazon Machine Learning, Leadership

Data Scientist

2022 - 2022
Helium Billboard Partners, LLC
  • Developed a machine learning model to identify where to deploy a given helium Node.js to maximize the payout given the various resource constraints.
  • Built a pipeline to extract useful information from blockchain and various other sources, such as Google Maps API for geolocation data.
  • Presented the results in weekly meetings to management and others.
Technologies: Data Science, Blockchain, Big Data, Data Visualization, Data Analysis, Data Analytics, Geospatial Analytics

Applied Scientist

2020 - 2022
Amazon UK
  • Worked on information extraction from structured and semi-structured sources on the web to populate the KG of Alexa via automation.
  • Built and published state-of-the-art approaches for superior information extraction from web tables and aligning them to our knowledge graph.
  • Worked on and improved the semantic question understanding and aggregate fact generation for Alexa.
Technologies: Python 3, PyTorch, Machine Learning, GPT, Generative Pre-trained Transformers (GPT), Natural Language Processing (NLP), CI/CD Pipelines, Machine Learning Operations (MLOps), Data Modeling, PySpark, Amazon S3 (AWS S3), Amazon Machine Learning

Senior NLP Research Scientist

2019 - 2020
MediaTek Research UK
  • Developed an approach for natural language understanding on a device with various constraints such as memory and power.
  • Developed algorithms for generating artificial data for training deep learning models that would otherwise require expensive and time-consuming labeled data collection processes.
  • Created tools and scripts to allow easy model-training, graph plotting, and the transfer of scripts to GPU servers.
Technologies: Deep Learning, GPT, Natural Language Processing (NLP), Generative Pre-trained Transformers (GPT), PyTorch, Python 3

Senior Research Associate

2017 - 2019
Cochrane
  • Created a state-of-the-art approach for identifying (biomedical) scientific papers that are useful for a systematic review from a long list with a high recall/precision.
  • Built a state-of-the-art machine learning algorithm for tagging biomedical paper abstracts with labels denoting the PICO (population, intervention, outcome) characteristics of the trial described in the paper.
  • Developed APIs in Flask and Python to provide the SD teams at IoE-UCL and Cochrane to use SOTA text classification models in their workflow.
Technologies: GPT, Natural Language Processing (NLP), Generative Pre-trained Transformers (GPT), Deep Learning, PyTorch, Pandas, Flask, Python, PySpark, Amazon S3 (AWS S3)

Researcher

2014 - 2015
Yahoo! Labs
  • Developed a new machine learning algorithm for user profile completion for inactive users with sparse user profiles using yahoo-news and yahoo-videos.
  • Improved news and video recommendation for cold-start users i.e., users that have liked or disliked very few items, with cutting edge state-of-the-art recommendation system algorithms.
  • Developed an approach for zero-shot (unseen) text classification to apply never-before-seen tags to URLs for bookmarking based on the contents of the webpage hosted at the URL.
Technologies: Recommendation Systems, Information Retrieval, MATLAB, Statistical Modeling, Data Modeling, PySpark

Software Engineer

2011 - 2012
vwo.com
  • Served as a full-stack developer on building the UI and backend of the WYSWYG website editing tool.
  • Implemented data mining techniques in Python to extract insights from user session data such as user-session clustering and pattern mining.
  • Created a new knowledge base for the company to reduce customer support requirements. Performed customer support for clients.
Technologies: Python, JavaScript, PHP

Relation Extraction using Explicit Context Conditioning

https://arxiv.org/abs/1902.09271
Relation Extraction (RE) aims to label relations between groups of marked entities in raw text. Most current RE models learn context-aware representations of the target entities used to establish a relation between them. This works well for intra-sentence RE and we call them first-order relations. However, this methodology can sometimes fail to capture complex and long dependencies. To address this, we hypothesize that at times, two target entities can be explicitly connected via a context token. We refer to such indirect relations as second-order relations and describe an efficient implementation for computing them. These second-order relation scores are then combined with first-order relation scores. Our empirical results show that the proposed method leads to state-of-the-art performance over two biomedical datasets.

Constructing Artificial Data for Fine-tuning for Low-resource Biomedical Text Tagging

https://arxiv.org/abs/1910.09255
Biomedical text tagging systems are plagued by the dearth of labeled training data. There have been recent attempts at using pre-trained encoders to deal with this issue. A pre-trained encoder provides a representation of the input text, which is then fed to task-specific layers for classification. The entire network is fine-tuned on the labeled data from the target task. Unfortunately, a low-resource biomedical task often has too few labeled instances for satisfactory fine-tuning. Also, if the label space is large, it contains few or no labeled instances for the majority of labels. Most biomedical tagging systems treat labels as indexes, ignoring the fact that these labels are often concepts expressed in natural language, e.g., `Appearance of a lesion on brain imaging.' To address these issues, we proposed constructing extra labeled instances using label-text (i.e., label's name) as input for the corresponding label-index (i.e., label's index). In fact, we proposed a number of strategies for manufacturing multiple artificial labeled instances from a single label.

Structured Multi-label Biomedical Text Tagging via Attentive Neural Tree Decoding

https://arxiv.org/abs/1810.01468
We proposed a model for tagging unstructured texts with an arbitrary number of terms drawn from a tree-structured vocabulary (i.e., an ontology). We treated this as a special case of sequence-to-sequence learning. The decoder begins at the root node of an ontological tree and recursively elects to expand child nodes as a function of the input text, the current node, and the latent decoder state. In our experiments, the proposed method outperformed state-of-the-art approaches on the important task of automatically assigning MeSH terms to biomedical abstracts.

Languages

Python 3, Python, SQL, Python 2, JavaScript, C++, PHP, Snowflake

Libraries/APIs

PyTorch, Scikit-learn, Matplotlib, NumPy, PySpark, Pandas

Paradigms

Data Science, Agile, Management, Compiler Design, Object-oriented Programming (OOP)

Platforms

Linux, Jupyter Notebook, Amazon Web Services (AWS), Azure, Amazon EC2, Blockchain

Storage

Database Management Systems (DBMS), SQLite, Amazon S3 (AWS S3), Databases

Other

Deep Learning, Natural Language Understanding (NLU), Natural Language Processing (NLP), Scientific Data Analysis, Machine Learning, Statistics, Data Mining, Information Retrieval, Recommendation Systems, Artificial Intelligence (AI), GPT, Generative Pre-trained Transformers (GPT), Machine Learning Operations (MLOps), Data Modeling, Model Development, Amazon Machine Learning, Team Leadership, Predictive Modeling, Leadership, Software Development, Web Programming, Algorithms, Data Structures, NLU, Solution Architecture, Cloud, Computer Vision, CI/CD Pipelines, Statistical Modeling, Parallelization, Large Data Sets, Feature Engineering, Regression Modeling, Classification Algorithms, Big Data, Data Visualization, Data Analysis, Data Analytics, Geospatial Analytics

Frameworks

Flask, Hadoop, Spark

Tools

Git, MATLAB, Amazon SageMaker

2015 - 2019

Ph.D. in Natural Language Processing

University College London - London, England

2012 - 2014

Master's Degree in Machine Learning

Pierre and Marie Curie University - Paris

2007 - 2011

Engineer's Degree in Computer Science

Delhi College of Engineering - Delhi

SEPTEMBER 2020 - PRESENT

Architecting on AWS

Amazon Web Services

Collaboration That Works

How to Work with Toptal

Toptal matches you directly with global industry experts from our network in hours—not weeks or months.

1

Share your needs

Discuss your requirements and refine your scope in a call with a Toptal domain expert.
2

Choose your talent

Get a short list of expertly matched talent within 24 hours to review, interview, and choose from.
3

Start your risk-free talent trial

Work with your chosen talent on a trial basis for up to two weeks. Pay only if you decide to hire them.

Top talent is in high demand.

Start hiring