
Muhammad Khubaib Raza
Verified Expert in Engineering
Machine Learning Engineer and Developer
Lahore, Punjab, Pakistan
Toptal member since October 13, 2022
Khubaib is a full-stack machine learning engineer who specializes in natural language processing (NLP). He's a problem solver with over six years of experience developing end-to-end machine learning systems using NLP and MLOps and scaling them to run in production environments. He has worked on a wide range of projects involving chatbot development, artificial intelligence, and NLP. Khubaib holds a master's degree in data science covering NLP, machine learning, and deep learning.
Portfolio
Experience
- Deep Learning - 6 years
- Data Science - 4 years
- Machine Learning - 4 years
- Web Development - 4 years
- Natural Language Processing (NLP) - 4 years
- Generative Pre-trained Transformers (GPT) - 4 years
- Amazon Web Services (AWS) - 2 years
- Azure - 2 years
Availability
Preferred Environment
Python 3, Transformers, PyTorch, Pandas, PySpark, Scikit-learn, Generative Pre-trained Transformers (GPT), AutoML, ChatGPT, OpenAI
The most amazing...
...thing I've achieved is participating in NextGrid's GPT-3 Hackathon and finishing in 3rd place.
Work Experience
Head of Engineering
Metric
- Shaped the product vision from a non-technical position, including alignment for product-market fit.
- Managed a team of back-end, front-end, and Android developers using the Kanban methodology to deliver the work.
- Contributed to the product's technical and active development.
- Installed the application on 40,000+ devices. It's used by people from 30 different business sectors. The customer uses this application to track business insights.
- Conducted website crawling and data scraping to assist in company research and marketing analytics.
Machine Learning Engineer
Self-employed
- Designed, prototyped, developed, and deployed (Docker) systems based on machine learning models, especially in the NLP field.
- Conducted information extraction from raw data, such as PDF documents.
- Created crawlers for downloading data from various sources.
- Worked on named entity recognition in medical data, performing manual annotation on the data.
Senior NLP Engineer
matix
- Worked on different NLP projects, with a focus on classification and information extraction from documents.
- Implemented back-end services that use Asterisk and Twilio.
- Used Vosk API and Google Speech-to-Text (STT) to stream speech data and convert it into text.
Experience
Automatic Description Generation From Images
The apartment photo database includes the price of the apartment, floor, latitude, longitude, nearby places, and whether it has parking. First, we had to eliminate duplicate photos from one apartment, so we would not detect items repeatedly. In the second step, the photos are then classified into room types. In the third step, count the number of bedrooms, baths, and other rooms, as well as detect the amenities. The fourth step is to create a dataset for description generation, and last but not least, to generate the description from the transformer model.
ChatGPT Powered Voice-based Customer Support
The system's functionality enables customers to call a business's Twilio phone number and ask a query, which ChatGPT then processes to provide personalized responses based on the customer's input. The ChatGPT model has been trained to understand and respond to a wide range of customer queries.
Information Extraction From Invoices
Education
Master's Degree in Data Science
Information Technology University - Lahore, Pakistan
Bachelor's Degree in Computer Science
Government College University - Lahore, Pakistan
Certifications
Machine Learning in Production
Coursera
AWS Machine Learning Engineer Scholarship Program
Udacity
Getting Started with AWS Machine Learning
Coursera
Deep Learning Nanodegree
Udacity
Introduction to Data Science in Python
Coursera
Skills
Libraries/APIs
Node.js, Google Speech-to-Text API, Python API, Pandas, Matplotlib, Twilio API, Sockets, API Development, PyTorch, Scikit-learn, PySpark, Rasa NLU
Tools
ChatGPT, Zapier, Amazon SageMaker, Amazon Elastic Container Registry (ECR), Amazon Elastic Container Service (ECS), BigQuery, Seaborn, Amazon Cognito, Amazon Textract, AutoML
Languages
Python, Python 3
Platforms
Google Cloud Platform (GCP), Azure, Amazon Web Services (AWS), AWS Lambda, Amazon EC2, Twilio, Firebase, Mixpanel, Docker
Frameworks
Flask
Paradigms
ETL, Object-oriented Programming (OOP)
Storage
Databases, Amazon S3 (AWS S3), Google Cloud, MongoDB, Amazon DynamoDB
Other
Natural Language Processing (NLP), Deep Learning, Artificial Intelligence (AI), Generative Pre-trained Transformers (GPT), AI Chatbots, Model Tuning, API Integration, Web Development, Machine Learning Operations (MLOps), Web App Deployment, OpenAI, OpenAI GPT-4 API, OpenAI GPT-3 API, Text Analytics, BERT, Large Language Models (LLMs), Machine Learning, Big Data, Information Retrieval, Data Science, Model Development, Chatbots, Data Structures, Algorithms, Data Warehousing, Custom Solutions, Android App Design, Model Deployment, Generative Pre-trained Transformer 3 (GPT-3), Computer Vision, Classification, Amazon API Gateway, System Architecture, Transformers, FastAPI, Scripting, CI/CD Pipelines, Optical Character Recognition (OCR), Transformer Models
How to Work with Toptal
Toptal matches you directly with global industry experts from our network in hours—not weeks or months.
Share your needs
Choose your talent
Start your risk-free talent trial
Top talent is in high demand.
Start hiring