Ahmed Bahaaeldin Mohammed, Developer in Cairo, Cairo Governorate, Egypt
Ahmed is available for hire
Hire Ahmed

Ahmed Bahaaeldin Mohammed

Verified Expert  in Engineering

Data Scientist and Software Developer

Cairo, Cairo Governorate, Egypt

Toptal member since November 12, 2021

Bio

Ahmed is a passionate data scientist interested in solving challenging business problems. He has been working on time series and NLP-related issues for nearly three years at VOIS. He designs novel deep learning architectures that suit the clients' needs, accompanied by statistical inference and modeling. Ahmed helped startups launch MVP products rapidly using state-of-the-art AI models.

Portfolio

Zewail University
Python 3, Natural Language Toolkit (NLTK), Jupyter Notebook, PyTorch...
Microsoft
Azure, Python 3, Natural Language Processing (NLP), Torch AI, OpenAI...
VoiceOps
Python, Deep Learning, Generative Pre-trained Transformers (GPT)...

Experience

  • Generative Pre-trained Transformers (GPT) - 4 years
  • Deep Learning - 4 years
  • Natural Language Processing (NLP) - 4 years
  • Natural Language Understanding (NLU) - 4 years
  • Python 3 - 4 years
  • Machine Learning - 3 years
  • PyTorch - 3 years
  • Recommendation Systems - 3 years

Availability

Full-time

Preferred Environment

Visual Studio Code (VS Code)

The most amazing...

...project I've developed was novel architecture to solve time-series problems combining text input.

Work Experience

Teaching Assistant

2021 - PRESENT
Zewail University
  • Designed the course flow for the assignments and code review.
  • Created Jupyter notebook tutorials to summarize basic NLP skills.
  • Explained classical approaches for solving NLP problems.
Technologies: Python 3, Natural Language Toolkit (NLTK), Jupyter Notebook, PyTorch, Artificial Intelligence (AI)

Applied Scientist II

2023 - 2025
Microsoft
  • Worked on content moderation for Microsoft Bing comments where I developed complex models for the classification of toxicity of each comment.
  • Worked on large-scale solutions for thousands of requests per minute.
  • Constructed a knowledge base representing the relationship of the topic of the article with the reflected toxicity of people (e.g., to detect what triggers people the most).
Technologies: Azure, Python 3, Natural Language Processing (NLP), Torch AI, OpenAI, OpenAI GPT-4 API, Generative Pre-trained Transformers (GPT), Scikit-learn, NumPy, Pandas, SQL, Artificial Intelligence (AI)

ML Engineer

2021 - 2022
VoiceOps
  • Created a novel text-to-text Longformer model that takes calls from call centers and transforms them properly to be shown. Types of ransformations include redaction, punctuation, and diarization.
  • Created a "show similar statement" end-to-end pipeline that encoded more than one million events. Users can query most similar events in less than ten seconds.
  • Created a clustering pipeline that uses hierarchical clustering combined with dimensionality reduction techniques like (UMAP and PCA) to improve the speed of creating the clusters.
Technologies: Python, Deep Learning, Natural Language Processing (NLP), Generative Pre-trained Transformers (GPT), Machine Learning, Recommendation Systems, Clustering, Artificial Intelligence (AI)

Senior Data Scientist

2019 - 2022
Vodafone Intelligent Solutions (VOIS)
  • Developed a deep learning framework for time series data where the input is a CSV file and the whole training, feature engineering, and preprocessing takes place. Designed multiple deep learning architectures.
  • Used the RoBERTa base model finetuned on Stanford NLI dataset for question mapping with FAQ questions in a database with a 74% hit-miss rate. Other trials used BERT, ALBERT, large, and xlarge.
  • Improved the ticket routing system by using the XLNet-based model to classify tickets and route them to the resolver group, resulting in an improvement from 60% to 92%.
  • Held training for junior and fresh grad data scientists.
Technologies: Python 3, Deep Learning, APIs, Algorithms, Big Data, Artificial Intelligence (AI)

Deep Learning Consultant

2018 - 2020
Arete Global
  • Created novel grammar classification architecture with accuracy 82% with ALBERT-base embedded in Conv1D local attention model.
  • Developed a factoid question answering system with neural dependency parser and Question recommendation for interviewing bots using the dot product of RoBERTa large embeddings.
  • Managed a group of fresh grads and upskilled them in deep learning and NLP.
Technologies: Python, Deep Learning, Computer Vision, Generative Pre-trained Transformers (GPT), Natural Language Processing (NLP)

Experience

Pain Point Detection

https://github.com/ahmedbahaaeldin/Pain-Point-Detection
This project revolved around trying to predict the pain point that the user is talking about in the given sentence, like the company image, the quality of the product, and customer service. It is generally a multi-class classification problem.

I mainly used pre-trained transformers and applied multiple experimentations to find the best solution. Moreover, I tried various data augmentation techniques to enrich the dataset and check which augmentation technique improved the overall performance.

Cross-lingual NLP Services

An end-to-end pipeline for text similarity and text classification with support for eight different languages. The pipeline contained different routes to translate and normalize any language to English or use as-is for cross-lingual work. The models were trained in a multilingual fashion and fine-tuned using only the English language, then used zero-shot learning for other languages.

Education

2019 - 2021

Master's Degree in Machine Learning

Cairo University - Cairo, Egypt

2012 - 2017

Bachelor's Degree in Computer Engineering

Cairo University - Cairo, Egypt

Certifications

JUNE 2019 - PRESENT

Udacity PyTorch Scholarship

Udacity

JULY 2018 - PRESENT

Deep Learning Specialization

Coursera

APRIL 2018 - PRESENT

DEV288x: Natural Language Processing (NLP)

Microsoft

Skills

Libraries/APIs

PyTorch, TensorFlow, Pandas, Natural Language Toolkit (NLTK), Torch AI, Scikit-learn, NumPy

Tools

Amazon SageMaker

Languages

Python 3, C++, Python, SQL

Platforms

Jupyter Notebook, Amazon Web Services (AWS), Visual Studio Code (VS Code), Azure

Other

Machine Learning, Deep Learning, Natural Language Processing (NLP), Natural Language Understanding (NLU), Graph Neural Networks, Transformers, Recurrent Neural Networks (RNNs), Convolutional Neural Networks (CNNs), Recommendation Systems, Generative Pre-trained Transformers (GPT), Artificial Intelligence (AI), Computer Vision, Machine Learning Operations (MLOps), Amazon Machine Learning, Data Structures, Algorithms, APIs, Big Data, Generative Adversarial Networks (GANs), Clustering, OpenAI, OpenAI GPT-4 API

Collaboration That Works

How to Work with Toptal

Toptal matches you directly with global industry experts from our network in hours—not weeks or months.

1

Share your needs

Discuss your requirements and refine your scope in a call with a Toptal domain expert.
2

Choose your talent

Get a short list of expertly matched talent within 24 hours to review, interview, and choose from.
3

Start your risk-free talent trial

Work with your chosen talent on a trial basis for up to two weeks. Pay only if you decide to hire them.

Top talent is in high demand.

Start hiring