Raza Abbas, Developer in Karachi Central, Sindh, Pakistan
Raza is available for hire
Hire Raza

Raza Abbas

Verified Expert  in Engineering

Data Scientist and Software Developer

Karachi Central, Sindh, Pakistan

Toptal member since February 19, 2022

Bio

Raza is a data scientist with a core interest in natural language processing. He has recently obtained his master's degree in data science with a 4.0/4.0 CGPA and summa cum laude. He has been working with top AI companies as a full-time resource and contributing to their DS projects.

Portfolio

Securiti.ai
Data Science, Natural Language Processing (NLP)...
Afiniti
Data Science, Natural Language Processing (NLP)...
IBM
Python, Data Science

Experience

  • Machine Learning - 4 years
  • Data Science - 4 years
  • Natural Language Processing (NLP) - 4 years
  • Python - 4 years
  • Generative Pre-trained Transformers (GPT) - 4 years
  • Artificial Intelligence (AI) - 4 years
  • Data Analysis - 4 years
  • Java - 3 years

Availability

Part-time

Preferred Environment

Windows, MacOS, Python 3, Slack, Jupyter Notebook, Google Colaboratory (Colab), Java, IntelliJ IDEA

The most amazing...

...project I've done is analyzing the effects of social network analysis on Twitter interactions and generating synthetic data while preserving privacy.

Work Experience

Data Scientist

2019 - PRESENT
Securiti.ai
  • Developed personal data detection modules and framework.
  • Built ML and DL-based models for classification and detection across various use cases.
  • Contributed to DL models to produce synthetic data based on customer requests, ensuring privacy guidelines.
Technologies: Data Science, Natural Language Processing (NLP), Generative Pre-trained Transformers (GPT), Machine Learning

Analyst Software Engineer | Data Science

2017 - 2019
Afiniti
  • Developed an in-house data analysis tool pertaining to caller-agent interactions.
  • Contributed to ad-hoc data analysis queries for customer requests.
  • Helped develop predictive modeling to enhance efficacy across multiple fronts.
Technologies: Data Science, Natural Language Processing (NLP), Generative Pre-trained Transformers (GPT), Machine Learning

Advanced Analytics Intern

2017 - 2017
IBM
  • Contributed to multiple POCs in the natural language processing domain pertaining to different industries on top of IBM services.
  • Designed and developed approaches to leverage IBM Discovery services for in-house data gathering and analysis efforts.
  • Worked on ad-hoc customer-centric chatbot use cases.
Technologies: Python, Data Science

Experience

Visual Analytics and Data Exploration Tool for Afiniti

Worked on an in-house data analysis and predictive analytics for Afiniti with the sole purpose of handling caller-agent data in a more purposeful and data-centric manner. I worked mainly on the predictive analytics part.

Synthetic Data Generation

Wrote pipelines and DL-based models for synthetic data generation while ensuring differential privacy guidelines. The project took care of generating sensitive tabular data and ensuring no data leaks.

Chicago Crimes Data Analysis | Academic Project

https://drive.google.com/file/d/0Bzf4mLCCJmpBSmtjZUdXN0NpU1k/view?resourcekey=0-X0GkGmQFB_lPzVLC3Bb-sg
Worked on Chicago Crimes Dataset to uncover hidden patterns in the dataset. The project achieved a top position in the IBA Data Analytics competition. The project report link is attached to give an idea about the overall data analysis that was done.

Calibrating Classifiers to Penalize Overconfident Predictions

This project was my core master thesis. In this work, I probed existing shortcomings of the contemporary regularization methods and, hence, proposed a regularizer that penalizes overconfident predictions only with the magnitude of their incorrectness. This has enabled us to train well-calibrated classifiers sensitive to misclassifications and vary confidences significantly on test samples. It is possible to detect a misclassification by examining the class conditional confidences.

Education

2018 - 2020

Master's Degree in Data Science

National University of Computer and Emerging Sciences - Islamabad, Pakistan

Skills

Libraries/APIs

Scikit-learn

Tools

Slack, IntelliJ IDEA

Languages

Python, Java, R, Python 3

Platforms

Windows, MacOS, Jupyter Notebook

Other

Data Science, Natural Language Processing (NLP), Machine Learning, Data Analysis, Generative Pre-trained Transformers (GPT), Artificial Intelligence (AI), Social Network Analysis, Time Series Analysis, Google Colaboratory (Colab), Deep Learning

Collaboration That Works

How to Work with Toptal

Toptal matches you directly with global industry experts from our network in hours—not weeks or months.

1

Share your needs

Discuss your requirements and refine your scope in a call with a Toptal domain expert.
2

Choose your talent

Get a short list of expertly matched talent within 24 hours to review, interview, and choose from.
3

Start your risk-free talent trial

Work with your chosen talent on a trial basis for up to two weeks. Pay only if you decide to hire them.

Top talent is in high demand.

Start hiring