Surbhi Gupta
Verified Expert in Engineering
Data Scientist and Machine Learning Developer
Jalpaiguri, West Bengal, India
Toptal member since November 2, 2021
Surbhi, previously a CTO at a GenAI startup and assistant professor at MUJ, is a generative AI, ML, and NLP expert with 5+ years of experience. She has designed and developed ML-based end-to-end solutions for startups at Toptal and Fortune 500 clients at Utopia. Her expertise includes ML, deep learning, NLP, computer vision, LLMs, GPT, AI, MLOps, and AWS. Surbhi solved problems in EAM, marketing, finance, chatbot, and crypto industries. She published research in robotics and optimization.
Portfolio
Experience
- Artificial Intelligence (AI) - 5 years
- Robotics - 5 years
- Machine Learning - 5 years
- Python - 5 years
- Generative Pre-trained Transformers (GPT) - 4 years
- Computer Vision - 4 years
- Natural Language Processing (NLP) - 4 years
- Deep Learning - 3 years
Availability
Preferred Environment
Python, TensorFlow, Scikit-learn, OpenCV, Hugging Face, OpenAI, PyTorch, Amazon Web Services (AWS), Generative Pre-trained Transformers (GPT)
The most amazing...
...generative AI solution I've developed interacts with users to identify their brand's purpose and generates BVP and marketing content with text and images.
Work Experience
LLM Expert
Freelancing
- Developed an innovative application to automatically curate and update newsletters. Leveraged LLMs to rewrite news articles, ensuring that the content resonated with the unique characteristics and interests of the target audience.
- Used AI to generate dynamic weather reports for cities nationwide. Taking daily weather forecasts as input, the application wrote interesting weather reports tailored to each city's specific climate conditions.
- Orchestrated the development of a robust back-end infrastructure hosted on AWS, utilizing a diverse array of services including Step Functions, EventBridge Scheduler, Bedrock, DynamoDB, Amplify, and Lambda functions.
- Implemented automated workflow processes using AWS Step Functions, enabling efficient content aggregation, transformation, and distribution.
- Integrated Google Programmable Search Engine API, News APIs, and Google Maps API into the application to augment its functionality and provide users with comprehensive and up-to-date information.
Co-founder and CTO
Freelance Client
- Secured significant investment capital for a company in the SAFE round-through, effective investor engagement, and a clear explanation of the technology.
- Developed the first version of the product, meeting all key functionality requirements.
- Led a technical due diligence process assessed by a renowned AI company and investors.
- Spearheaded the establishment of a talented team through effective interviewing and assessment methods.
- Utilized OpenAI models, effective prompt-engineering strategies, and few-shot learning to pioneer AI-led conversations with human users. Extracted valuable insights from these interactions and generated impactful marketing propositions.
- Designed a feedback mechanism to collect training data from field experts for fine-tuning the models.
GPT-3 Expert
Alec Beglarian
- Developed an MVP for email generation using OpenAI GPT-3 APIs.
- Developed the MVP on the AWS cloud platform, which is integrated with a database, storage, Lambda functions, Amazon SES, etc.
- Fine-tuned the OpenAI model for data correction to be used for email generation.
ML Developer
SimpliCapital LLC
- Deployed machine learning models with AWS cloud platform, using services like lambda functions, Amazon SageMaker, Amazon SNS, Amazon S3, etc.
- Improved machine learning model performance for prediction of finance data.
- Created Amazon SageMaker training and inference pipelines for ML models.
AI Specialist | NLP Python Developer
Toptal Client
- Improved the NLP solution to identify business prospects in financial data.
- Used BERT-based POS tagging to extract important features from large documents.
- Made the solution interpretable by identifying words used to mark sentences relevant to business prospects.
- Used a Hugging Face transformer model for semantic similarity analysis.
AI Specialist | Python and ML Developer
Daylight
- Used OpenAI for solving Q&A and document query problems for a chatbot.
- Provided an open-source alternative to the OpenAI document query solution with better accuracy.
- Identified groups of chat clusters using agglomerative clustering based on semantic similarity.
AI Specialist | Python and ML Developer
Freelance
- Performed stance detection and topic modeling on social media data, using unsupervised and semi-supervised methods.
- Fine-tuned a pre-trained seq2seq transformer model for custom summarization tasks.
- Used NLP performance evaluation metrics like BERTscore and ROUGE score for NLP tasks and achieved a score of 0.89 BERTscore on the summarization task.
Senior Data Science Engineer
Utopia
- Developed an end-to-end machine learning solution for information extraction from scanned documents and diagrams that brought a good deal with a Fortune 100 company. Deployed the project to the client as a cloud application.
- Built a solution to identify equipment classes from descriptions and tags, which was used to deliver services to several clients.
- Created a solution that enables identifying valid values from product descriptions in material master data used to deliver services to various clients.
- Developed a solution to identify different shapes, tables, and text in diagram images. This required the application of several computer vision, image processing, deep learning, and machine learning techniques.
Assistant Professor
Manipal University Jaipur
- Lectured subjects like robotics and mechatronics system design, including topics on machine learning, artificial intelligence, and sensors.
- Conducted laboratory experiments to give students hands-on experience on MATLAB, control systems, and sensors.
- Conducted term papers and online quizzes and evaluated the performance of the students.
Senior Research Fellow
CSIR-Central Scientific Instruments Organisation
- Optimized the design of a minimally invasive surgical robotic arm and formulated the kinematic control for trajectory tracking by its end-effector. Published two papers on this work.
- Improved the design of a passive bipedal robot and underactuated it in simulation to walk stably on steep slopes of zero to 30 degrees. Published two papers on this work.
- Taught subjects like industrial control and robotics to diploma-level students.
Experience
A Brief Review of Dynamics and Control of Underactuated Biped Robots
The article is available at the following link: https://www.tandfonline.com/doi/full/10.1080/01691864.2017.1308270
Split Compound Words
https://github.com/droid-surbhi/split-compound-wordsFake Vs. Real News Classification
https://www.kaggle.com/surbhig/classification-fake-vs-news-95-accuracyOptimization Using Meta-heuristics
https://github.com/droid-surbhi/OptimizationDesign Optimization of Minimally Invasive Surgical Robot
https://doi.org/10.1016/j.asoc.2015.03.032LighVe: Music Synced Lights
Kinematic Control of An Articulated Minimally Invasive Surgical Robotic Arm
https://ieeexplore.ieee.org/document/7853054Clustering Utilities
https://github.com/droid-surbhi/clusteringResume Classification
https://github.com/droid-surbhi/resume_classification• It uses latent Dirichlet allocation for topic modeling and counts vectorizer for vectorization.
• It also visualizes groups using Word Cloud.
Algorithm Design
https://github.com/droid-surbhi/algorithm_designMotion Planning
https://github.com/droid-surbhi/motion_planning/blob/main/simpleMotion.ipynbVisiting Card Creator
https://chatgpt.com/g/g-jjCYHUH5o-visiting-card-creatorCharacter Imitation by AI: Professor Dumbledore
https://chatgpt.com/g/g-LI8gD4kNS-professor-dumbledoreEducation
Master's Degree in Mechatronics
Indian Institute of Engineering Science and Technology - Kolkata, India
Bachelor's Degree in Electronics and Communication Engineering
Bundelkhand University - Jhansi, India
Certifications
AWS Certified Cloud Practitioner
Amazon Web Services
Introduction to Containers
AWS
Introduction to AWS Elastic Beanstalk
AWS
Sequence Models
DeepLearning.AI
Introduction to Tensorflow for Artificial Intelligence, Machine Learning, and Deep Learning
DeepLearning.AI | via Coursera
Git Complete: The Definitive, Step-by-step Guide to Git
Udemy
Neural Networks and Deep Learning
DeepLearning.AI | via Coursera
6.00.1x: Introduction to Computer Science and Programming Using Python
MITx
Control of Mobile Robots
Coursera
Machine Learning
Coursera
Skills
Libraries/APIs
Pandas, PyTorch, LSTM, TensorFlow, Scikit-learn, OpenCV, Keras, Matplotlib, AWS Amplify, Vue, NumPy, React, Beautiful Soup, Node.js
Tools
Git, ChatGPT, Named-entity Recognition (NER), MATLAB, You Only Look Once (YOLO), Amazon SageMaker, Confluence, OpenAI Gym, Jupyter
Languages
Python, SQL, C++, JavaScript, TypeScript
Frameworks
Streamlit
Platforms
AWS Lambda, Amazon Web Services (AWS), Docker, AWS Elastic Beanstalk
Storage
Amazon S3 (AWS S3), Cloud Deployment, Google Cloud
Other
Robotics, Artificial Intelligence (AI), Machine Learning, Deep Learning, Computer Vision, Natural Language Processing (NLP), Optical Character Recognition (OCR), Neural Networks, Tesseract, Data Science, Research, Entity Extraction, Classification, Image Recognition, Data Analysis, OpenAI, Machine Vision, Large Language Models (LLMs), Generative Pre-trained Transformers (GPT), Chatbots, Fine-tuning, Retrieval-augmented Generation (RAG), OpenAI GPT-4 API, Embeddings from Language Models (ELMo), University Teaching, Control Systems, Underactuation, Convolutional Neural Networks (CNNs), Object Detection, Transfer Learning, Text Detection, Machine Learning Operations (MLOps), Graphics Processing Unit (GPU), Transformers, BERT, Sequence Models, Algorithms, Time Series Analysis, Time Series, Generative Pre-trained Transformer 3 (GPT-3), GPU Computing, Team Leadership, Cloud, Code Review, Source Code Review, Technical Hiring, Interviewing, Minimum Viable Product (MVP), Open-source LLMs, Image Processing, Mechatronics, Optimization, Metaheuristics, Robot Operating System (ROS), Gated Recurrent Unit (GRU), Containers, Technical Writing, Publication, Simulations, Mathematics, Clustering, Web Scraping, Semantics, Generative Adversarial Networks (GANs), Hugging Face, Unsupervised Learning, Topic Modeling, Data Visualization, DALL-E, OpenAI GPT-3 API, User Feedback, Few-shot Learning, Motion Planning, ChatGPT API, Amazon Bedrock, Data Scraping, PDF Scraping, Recursion Testing
How to Work with Toptal
Toptal matches you directly with global industry experts from our network in hours—not weeks or months.
Share your needs
Choose your talent
Start your risk-free talent trial
Top talent is in high demand.
Start hiring