Salman is available for hire

Salman Ahmed

Verified Expert in Engineering

Artificial Intelligence Engineer and Developer

Houston, TX, United States

Toptal member since August 11, 2022

Expertise

Machine Learning Artificial Intelligence NLP Computer Vision Cloud Engineering Deep Learning Neural Network Data Science LLM Algorithms Startup Development Prompt Engineering

Bio

Salman is a principal AI engineer with 13+ years of experience designing and implementing AI solutions in top Fortune 500 companies (Google, CVS Health, and Moss). He's skilled in designing and developing end-to-end solutions that align with business objectives. Salman is an effective leader who has led teams of 15+ and delivered estimated cost savings of $18 million per quarter.

Portfolio

Nasser Alsabah

Diffusion Models, PyTorch, Image Generation, Artificial Intelligence (AI), Flux...

Algomarketing Ltd

Machine Learning, Artificial Intelligence (AI)...

HeroikStrategies LLC

Python, Open-source LLMs, ChatGPT, Llama 3, Fine-tuning, Full-stack, Docker...

Experience

Machine Learning - 10 years
Python - 9 years
Computer Vision - 7 years
PyTorch - 6 years
Deep Learning - 6 years
Large Language Models (LLMs) - 5 years
Agentic AI - 3 years
JAX - 2 years

Preferred Environment

PyTorch, Large Language Models (LLMs), Hugging Face, Large Language Model Operations (LLMOps), Natural Language Processing (NLP), Agentic AI, AI Agents, Deep Learning, Computer Vision

The most amazing...

...project I've led is now live in production, delivering approximately $18 million in savings per quarter for a leading tech company.

Work Experience

Generative AI Engineer

2025 - 2026

Nasser Alsabah

Designed, implemented, and fine-tuned multiple diffusion models specialized for GCC culture.
Deployed multiple diffusion models on a large scale to handle hundreds of user requests to generate images at the same time.
Managed the team in building and improving image generation platforms and provided support for hundreds of different image generation models.

Technologies: Diffusion Models, PyTorch, Image Generation, Artificial Intelligence (AI), Flux, SDXL, Hugging Face, LoRa, DreamBooth, ControlNet, ComfyUI, Azure Blob Storage, Azure Blobs, Blob Storage, AI Assistants, Agentic RAG Systems, Amazon Bedrock, Pydantic

Principal AI Engineer

2023 - 2025

Algomarketing Ltd

Designed, built, and implemented a deep learning-based AI pipeline for Google. It has saved 800+ hours per month for Google employees worldwide.
Contributed to the deep learning algorithm to suggest the following best actions for the sales and marketing team to improve leads.
Contributed to designing and implementing an LLM-based POC for a personalized QA bot to do runtime data analysis and perform recommendations.

Technologies: Machine Learning, Artificial Intelligence (AI), Natural Language Processing (NLP), Python, Google Cloud Platform (GCP), Language Models, AI Agents, Retrieval-augmented Generation (RAG), Rapid Prototyping, LangChain, Azure AI Studio, Generative Artificial Intelligence (GenAI), Team Leadership, Gemini, Prompt Engineering, English, Outbound Marketing, Data Modeling, Modeling, LlamaIndex, TypeScript, Architecture, Chatbot Conversation Design, Minimum Viable Product (MVP), CI/CD Pipelines, Machine Learning Operations (MLOps), Discord, OpenAI API, Data Interpretation, Jupyter, Open-source LLMs, Llama 3, AWS Glue, AWS Lambda, Amazon Rekognition, Data Engineering, Docker, Unicorn, Vector Search, Adversarial Machine Learning, Scikit-learn, Django, CrewAI, K-means Clustering, Hierarchical Clustering, Clustering Algorithms, Clustering, NumPy, DBSCAN, Recommendation Systems, Diffusion Models, Diffusion-based AI Models, Text to Image, Multi-agent Systems, FastAPI, Vector Databases, Meta Llama, Hugging Face Transformers, Bittensor, Context Parallelism, FSDP, Fully Sharded Data Parallelism (FSDP), Sequence Parallelism, Tensor Parallelism, Distributed Training, Transformers, Transformer Models, Large Language Model Operations (LLMOps), JupyterLab, MySQL, XGBoost, Hugging Face, OpenAI o1, Gemini API, Visual Studio Code (VS Code), Amazon Machine Learning, Feasibility Studies, Strategic Planning, Jupyter Notebook, Deep Neural Networks (DNNs), NLU, Convolutional Neural Networks (CNNs), Developing AI Models locally, Neural Network Pruning, Recurrent Neural Networks (RNNs), PyTorch Lightning, model quantization, AI Prompts, Claude, Data Cleansing, Lead Generation, Optical Character Recognition (OCR), Graphs, Automation, Construction, eCommerce, Reinforcement Learning, Natural Language Generation (NLG), AI Chatbots, Knowledge Bases, Predictive Modeling, Google Maps, Mapbox, JavaScript, React Native, CTO, Agile Software Development, Software Architecture, Marketing, LangGraph, AI Integration, Discord Bots, Web Scraping, Drones, Startups, Robotics, Fractional CTO, Full-stack Development, ACH Payments, SQL, Speech Recognition, React, Next.js, Node.js, Audio Streaming, Whisper, Low Latency, Agentic AI, Multimodal GenAI, Java, Data Scraping, Flask, GIS, PostGIS, Audio Processing, Digital Signal Processing, Real-time Audio Processing, Sound, Object-oriented Programming (OOP), Pure Data, ARIMA, Demand Forecasting, Joblib, LightGBM, Time Series Forecasting, Streamlit, Product Forecasts, Retail & Wholesale, SARIMA, Sales Forecasting, Data Science, Forecasting, GraphRAG, Biology, Kubernetes, Cloud, PySpark, Solution Architecture, Technical Leadership, AI-generated Code, Databricks, DevOps, Graph Databases, Model Context Protocol (MCP), Front-end Development, User Interface (UI), Google Cloud, Vertex AI, Data Aggregation, Data Cleaning, Real-time Systems, Speech-to-Text (STT), Statistical Modeling, Text-to-Speech (TTS), Amazon Polly, DSP, Voice AI, Scalable Architecture, Point of Sale, PG Vector, Go, Qdrant, n8n, Education, REST APIs, AI Pipeline, Anthropic, Zapier, AI Adoption, Strategy, Amazon S3 (AWS S3), Image Classification, Residual Neural Networks (ResNets), Foundation Models, Supervised Learning, Supervised Machine Learning, Vision Transformer (ViT), Self-Distillation with No Labels, AI Model Training, Anomaly Detection, Model Validation, Predictive Maintenance, Quantization, Sensor Data, Time Series Data, Unsupervised Learning, AI Algorithms, Package Distribution, Statistics, Containerization, Spring, Automation Engineering, Customer Relationship Management (CRM), Pinecone, Snowflake, Retail Technology, ControlNet, DreamBooth, LoRa, SDXL, ElevenLabs Solutions, Azure Text to Speech, Replit, Data Annotation, Image Annotation, Sentiment Analysis, CVAT, Technical Strategy, Custom Automation, R&D, Tech Research & Evaluation, Product Innovation, Product Design, AI Tool Assessment, Agentic Frameworks, Firecrawl, Role-based Access Control (RBAC), Platform as a Service (PaaS), JSON, HIPAA Compliance, Data Protection, C#, Auto Encoder, Binary Classification Models, System Architecture, Finance, Back-end Development, RAG Pipelines, Azure Blob Storage, Azure Blobs, Blob Storage, AI Assistants, Agentic RAG Systems, Amazon Bedrock, Pydantic

Lead AI Engineer

2024 - 2024

HeroikStrategies LLC

Deployed a Discord bot for game developers to analyze Steam reviews.
Designed and deployed a CI/CD pipeline for Google GCP.
Designed and deployed an LLM in the Discord bot for developers to chat with it on live analysis.

Technologies: Python, Open-source LLMs, ChatGPT, Llama 3, Fine-tuning, Full-stack, Docker, Unicorn, Vector Search, Adversarial Machine Learning, Scikit-learn, Django, K-means Clustering, Hierarchical Clustering, Clustering Algorithms, Clustering, NumPy, Actor Model, DBSCAN, Recommendation Systems, Diffusion Models, Diffusion-based AI Models, Multi-agent Systems, Backtesting Trading Strategies, Trading Strategy Development, FastAPI, Vector Databases, Meta Llama, Hugging Face Transformers, Bittensor, Context Parallelism, FSDP, Fully Sharded Data Parallelism (FSDP), Sequence Parallelism, Tensor Parallelism, Distributed Training, Transformers, Transformer Models, Large Language Model Operations (LLMOps), JupyterLab, MySQL, XGBoost, Hugging Face, Visual Studio Code (VS Code), Amazon Machine Learning, Feasibility Studies, Strategic Planning, Jupyter Notebook, Deep Neural Networks (DNNs), NLU, Convolutional Neural Networks (CNNs), Developing AI Models locally, Neural Network Pruning, Recurrent Neural Networks (RNNs), PyTorch Lightning, model quantization, AI Prompts, Claude, Data Cleansing, Lead Generation, Optical Character Recognition (OCR), Graphs, Automation, Construction, Reinforcement Learning, Natural Language Generation (NLG), AI Chatbots, Knowledge Bases, Google Maps, Mapbox, JavaScript, React Native, CTO, Agile Software Development, Software Architecture, Marketing, LangGraph, AI Integration, Discord Bots, Web Scraping, Drones, Startups, Robotics, Fractional CTO, Full-stack Development, ACH Payments, SQL, Speech Recognition, Audio Streaming, Whisper, Low Latency, Agentic AI, Multimodal GenAI, Java, Data Scraping, Flask, GIS, PostGIS, Audio, Audio Processing, Digital Signal Processing, Real-time Audio Processing, Sound, Object-oriented Programming (OOP), Pure Data, ARIMA, Demand Forecasting, Joblib, LightGBM, Time Series Forecasting, Streamlit, Product Forecasts, Retail & Wholesale, SARIMA, Sales Forecasting, Data Science, Forecasting, GraphRAG, Biology, Kubernetes, Cloud, PySpark, Solution Architecture, Technical Leadership, AI-generated Code, Databricks, DevOps, Graph Databases, Model Context Protocol (MCP), Front-end Development, User Interface (UI), Google Cloud, Vertex AI, Data Aggregation, Data Cleaning, Real-time Systems, Speech-to-Text (STT), Statistical Modeling, Text-to-Speech (TTS), DSP, Voice AI, Scalable Architecture, Point of Sale, PG Vector, Qdrant, n8n, Education, REST APIs, AI Pipeline, Anthropic, AI Adoption, Strategy, Amazon S3 (AWS S3), Image Classification, Residual Neural Networks (ResNets), Foundation Models, Supervised Learning, Supervised Machine Learning, Vision Transformer (ViT), Self-Distillation with No Labels, AI Model Training, Anomaly Detection, Model Validation, Predictive Maintenance, Quantization, Sensor Data, Time Series Data, Unsupervised Learning, AI Algorithms, Machine Learning Operations (MLOps), Package Distribution, Statistics, Go, Containerization, Spring, Automation Engineering, Customer Relationship Management (CRM), Pinecone, Snowflake, Retail Technology, ControlNet, DreamBooth, LoRa, SDXL, ElevenLabs Solutions, Azure Text to Speech, Replit, Data Annotation, Sentiment Analysis, Technical Strategy, Custom Automation, R&D, Tech Research & Evaluation, Product Innovation, Product Design, AI Tool Assessment, Agentic Frameworks, Firecrawl, Platform as a Service (PaaS), JSON, HIPAA Compliance, Data Protection, C#, Auto Encoder, Binary Classification Models, System Architecture, Finance, Back-end Development, RAG Pipelines, Azure Blob Storage, Azure Blobs, Blob Storage, AI Assistants, Agentic RAG Systems, Pydantic

LLM Expert (GenAI)

2024 - 2024

Moss & Associates - Main

Developed the complete prototype of an AI architecture, including custom AI assistants.
Improved the assistants with a highly optimized parallel doc processing method.
Improved the knowledge base retrievals in the assistants.

Technologies: Large Language Models (LLMs), Generative Pre-trained Transformers (GPT), Artificial Intelligence (AI), Generative Pre-trained Transformer 3 (GPT-3), OpenAI, OpenAI GPT-3 API, OpenAI GPT-4 API, Bedrock, Amazon Web Services (AWS), Azure ML Studio, Azure, AI Agents, Retrieval-augmented Generation (RAG), Rapid Prototyping, LangChain, Azure AI Studio, Generative Artificial Intelligence (GenAI), Prompt Engineering, English, Data Modeling, Modeling, Architecture, Chatbot Conversation Design, Minimum Viable Product (MVP), CI/CD Pipelines, Machine Learning Operations (MLOps), Discord, OpenAI API, Data Interpretation, Jupyter, Open-source LLMs, Llama 3, Data Engineering, Django, K-means Clustering, Hierarchical Clustering, Clustering Algorithms, Clustering, NumPy, Algorithmic Trading, DBSCAN, Recommendation Systems, Diffusion Models, Diffusion-based AI Models, Text to Image, FastAPI, Vector Databases, Meta Llama, Hugging Face Transformers, Bittensor, Context Parallelism, FSDP, Fully Sharded Data Parallelism (FSDP), Sequence Parallelism, Tensor Parallelism, Distributed Training, Transformers, Transformer Models, MySQL, XGBoost, Hugging Face, Visual Studio Code (VS Code), Amazon Machine Learning, Strategic Planning, Deep Neural Networks (DNNs), NLU, Convolutional Neural Networks (CNNs), Developing AI Models locally, Neural Network Pruning, Recurrent Neural Networks (RNNs), PyTorch Lightning, model quantization, AI Prompts, Claude, Data Cleansing, Lead Generation, Optical Character Recognition (OCR), Construction, Reinforcement Learning, Natural Language Generation (NLG), AI Chatbots, Knowledge Bases, Predictive Modeling, Google Maps, Mapbox, JavaScript, React Native, CTO, Agile Software Development, Software Architecture, Marketing, LangGraph, AI Integration, Discord Bots, Web Scraping, Drones, Startups, Robotics, Fractional CTO, Full-stack Development, ACH Payments, SQL, Speech Recognition, Whisper, Low Latency, Agentic AI, Multimodal GenAI, Java, Data Scraping, Flask, GIS, PostGIS, Audio, Audio Processing, Digital Signal Processing, Real-time Audio Processing, Sound, Object-oriented Programming (OOP), Pure Data, ARIMA, Demand Forecasting, Joblib, LightGBM, Time Series Forecasting, Streamlit, Product Forecasts, Retail & Wholesale, SARIMA, Sales Forecasting, Data Science, Forecasting, GraphRAG, Biology, Kubernetes, Cloud, Solution Architecture, Technical Leadership, AI-generated Code, Databricks, DevOps, Graph Databases, Front-end Development, User Interface (UI), Google Cloud, Vertex AI, Data Aggregation, Data Cleaning, Real-time Systems, Speech-to-Text (STT), Statistical Modeling, Text-to-Speech (TTS), DSP, Voice AI, Scalable Architecture, Point of Sale, PG Vector, Qdrant, n8n, Education, REST APIs, AI Pipeline, Anthropic, AI Adoption, Strategy, Amazon S3 (AWS S3), Image Classification, Residual Neural Networks (ResNets), Foundation Models, Supervised Learning, Supervised Machine Learning, Vision Transformer (ViT), Self-Distillation with No Labels, AI Model Training, Anomaly Detection, Model Validation, Predictive Maintenance, Quantization, Sensor Data, Time Series Data, Unsupervised Learning, AI Algorithms, Package Distribution, Statistics, Go, Containerization, Spring, Automation Engineering, Customer Relationship Management (CRM), Pinecone, Snowflake, Retail Technology, ControlNet, DreamBooth, SDXL, Azure Text to Speech, Data Annotation, Image Annotation, Sentiment Analysis, Technical Strategy, Custom Automation, R&D, Tech Research & Evaluation, Product Innovation, Product Design, AI Tool Assessment, Agentic Frameworks, Role-based Access Control (RBAC), JSON, HIPAA Compliance, Data Protection, C#, Auto Encoder, Binary Classification Models, System Architecture, Finance, Back-end Development, RAG Pipelines, Azure Blob Storage, Azure Blobs, Blob Storage, AI Assistants, Agentic RAG Systems, Pydantic

LLM Fine-tuning Expert

2023 - 2023

Designstripe

Developed pipelines to fine-tune open-source LLMs on custom data.
Built Stable Diffusion pipelines to fine-tune custom data.
Developed a LangChain-based agent to optimize the workflow.

Technologies: Machine Learning, Artificial Intelligence (AI), Natural Language Processing (NLP), Neural Networks, Web Design, OpenAI GPT-3 API, OpenAI GPT-4 API, Integration, Time Series, ChatGPT, OpenAI, Language Models, AI Agents, Retrieval-augmented Generation (RAG), Generative Artificial Intelligence (GenAI), Prompt Engineering, English, Data Modeling, Modeling, Architecture, Chatbot Conversation Design, Minimum Viable Product (MVP), CI/CD Pipelines, Discord, OpenAI API, Data Interpretation, Open-source LLMs, Data Engineering, K-means Clustering, Hierarchical Clustering, Clustering Algorithms, Clustering, NumPy, DBSCAN, Recommendation Systems, Diffusion-based AI Models, Text to Image, FastAPI, Vector Databases, Hugging Face Transformers, MySQL, Deep Neural Networks (DNNs), NLU, Convolutional Neural Networks (CNNs), Recurrent Neural Networks (RNNs), PyTorch Lightning, AI Prompts, Data Cleansing, Construction, AI Chatbots, Knowledge Bases, Predictive Modeling, JavaScript, React Native, Agile Software Development, Software Architecture, Discord Bots, Web Scraping, Drones, Startups, Robotics, Full-stack Development, SQL, Low Latency, Agentic AI, Java, Data Scraping, Flask, PostGIS, Audio, Digital Signal Processing, Real-time Audio Processing, Sound, Object-oriented Programming (OOP), Pure Data, ARIMA, Demand Forecasting, Joblib, LightGBM, Forecasting, Solution Architecture, Databricks, DevOps, Data Aggregation, Data Cleaning, Statistical Modeling, Text-to-Speech (TTS), Education, Anthropic, Strategy, Amazon S3 (AWS S3), Image Classification, Residual Neural Networks (ResNets), Foundation Models, Supervised Learning, Supervised Machine Learning, Vision Transformer (ViT), AI Model Training, Anomaly Detection, Model Validation, Predictive Maintenance, Sensor Data, Time Series Data, Unsupervised Learning, AI Algorithms, Machine Learning Operations (MLOps), Package Distribution, Statistics, Containerization, Pinecone, SDXL, Image Annotation, Sentiment Analysis, Custom Automation, R&D, Role-based Access Control (RBAC), Platform as a Service (PaaS), Data Protection, Auto Encoder, Binary Classification Models, System Architecture, Finance, Back-end Development

LLM Prompt Engineer

2023 - 2023

NIC MAP Vision LLC

Engineered LLM prompts to design the legal document Q/A in the chatbot.
Engineered LLM prompts to design the Q/A on document queries for the platform.
Engineered LLM prompts to design the summarization of legal documents.

Technologies: Artificial Intelligence (AI), Natural Language Processing (NLP), OpenAI GPT-3 API, OpenAI GPT-4 API, Generative Pre-trained Transformers (GPT), Generative Pre-trained Transformer 3 (GPT-3), Language Models, Research, API Integration, Integration, Time Series, ChatGPT, OpenAI, Generative Artificial Intelligence (GenAI), Prompt Engineering, English, Data Modeling, Modeling, Architecture, Chatbot Conversation Design, Minimum Viable Product (MVP), Clustering Algorithms, Hugging Face Transformers, XGBoost, Recurrent Neural Networks (RNNs), PyTorch Lightning, AI Chatbots, JavaScript, Software Architecture, Startups, Full-stack Development, SQL, Audio, Solution Architecture, Statistical Modeling, Text-to-Speech (TTS), Amazon S3 (AWS S3), Image Classification, Residual Neural Networks (ResNets), Anomaly Detection, Model Validation, Predictive Maintenance, Sensor Data, Time Series Data, Unsupervised Learning, Statistics, Sentiment Analysis, Data Protection, Binary Classification Models

Machine Learning Developer

2023 - 2023

Atmospheric Data Solutions

Designed data pipelines to manage vast amounts of data for an atmospheric-related project.
Designed machine learning algorithms to improve wind speed predictions.
Converted existing codebase from R to Python and optimized machine learning pipelines.

Technologies: Machine Learning, Python, Pandas, NumPy, R, JupyterLab, Weather, MySQL, Random Forests, XGBoost, Data Scientist, Algorithms, Keras, Data Analysis, Benchmarking, Research, API Integration, Integration, Time Series, ChatGPT, OpenAI, Language Models, Clustering Algorithms, Software Architecture

AI Expert

2022 - 2023

RunKicker

Developed deep learning pipelines for BMI detection on complex data for a personal healthcare assistant.
Developed deep learning algorithms that efficiently handle small datasets, enhancing their robustness by leveraging the distribution derived from limited data.
Optimized existing models and reduced sizes of the models from 250MB to just 50MB.

Technologies: Artificial Intelligence (AI), Image Processing, Python, Signal Processing, Health, Computer Vision, C++, Models, PyTorch, TensorFlow, Mobile, AI Programming, Image Generation, APIs, AI Design, Data Pipelines, Data Visualization, Large Language Models (LLMs), Data Scientist, Algorithms, Keras, Data Analysis, Open Neural Network Exchange (ONNX), Research, API Integration, Integration, Time Series, Statistical Analysis, Data Analytics, OpenAI, Team Leadership, Medical Imaging

Machine Learning Developer | Models Build and Models Fine Tune

2022 - 2022

Psi.Wave LLC

Designed and implemented deep learning large language model (LLM) pipelines on huge data sets.
Optimized the existing training pipeline from both time and computation perspectives.
Implemented custom attention heads for multiple LLMs.

Technologies: Python, Machine Learning, Deep Learning, Artificial Intelligence (AI), AI Programming, APIs, AI Design, Data Pipelines, Data Visualization, Large Language Models (LLMs), Data Scientist, Algorithms, Keras, Research, API Integration, Integration, Statistical Analysis, OpenAI, Language Models, Algorithmic Trading, Actor Model, Backtesting Trading Strategies, Trading Strategy Development

Senior Data Scientist

2021 - 2022

HamzaAi

Implemented a machine learning pipeline for vessel delay prediction at Khalifa Port in the UAE. Reduction in error rate from more than 24 hours to two hours. This resulted in better use of resources, including data mining and ML at Khalifa Port.
Executed the machine learning pipeline for job category detection through text mining.
Implemented the pipeline to detect Arabic content originality through text mining.
Implemented auto fault prediction in chips during manufacturing.

Technologies: Computer Vision, Generative Pre-trained Transformers (GPT), Natural Language Processing (NLP), PyTorch, TensorFlow, Deep Learning, Image Processing, Machine Learning, Python, Custom Models, Artificial Intelligence (AI), Neural Networks, Artificial Neural Networks (ANN), Generative Adversarial Networks (GANs), Facial Recognition, OpenCV, Computer Vision Algorithms, Azure Machine Learning, Pandas, Azure, Spark ML, Best Practices, Performance Optimization, Language Models, Text Generation, Fine-tuning, Data Inference, AI Programming, Image Generation, APIs, Chatbots, AI Design, PostgreSQL, Data Pipelines, Data Visualization, Financial Forecasting, Large Language Models (LLMs), OpenAI GPT-4 API, OpenAI GPT-3 API, Leadership, Data Scientist, Algorithms, Reinforcement Learning, Keras, Data Analysis, Dashboards, Business Intelligence (BI), Reports, Google Data Studio, BigQuery, Legal Documentation, Research, API Integration, Wearables, Biometrics, Time Series, Statistical Analysis, Data Analytics, Data Reporting, Amazon SageMaker, JSTransformers, Team Leadership, Medical Imaging

Graduate Research Assistant

2021 - 2021

Texas A&M University

Researched T-cell and Receptor sequence contact prediction on human protein sequences using deep learning. (NLP).
Investigated cancer region detection in whole slide images (WSI) in collaboration with the University of Chicago.
Achieved the challenge of each WSI taking GBs to be stored, so it's impossible to use direct deep learning methods like image classification and segmentation.

Technologies: Computer Vision, Natural Language Processing (NLP), Generative Pre-trained Transformers (GPT), PyTorch, Deep Learning, Image Processing, Machine Learning, Python, Custom Models, Artificial Intelligence (AI), Neural Networks, Artificial Neural Networks (ANN), Generative Adversarial Networks (GANs), Facial Recognition, OpenCV, Computer Vision Algorithms, Pandas, Best Practices, Language Models, Text Generation, Data Inference, AI Programming, Data Visualization, Large Language Models (LLMs), Algorithms, Research, Biometrics, Statistical Analysis

Data Scientist

2020 - 2021

HamzaAi

Implemented a deep learning pipeline for event and accident detection on self-driving car synthetic data.
Executed an Arabic OCR detection pipeline based on EasyOCR adjustments.
Worked on a handwriting recognition tool for Arabic schools.

Technologies: Computer Vision, Generative Pre-trained Transformers (GPT), Natural Language Processing (NLP), PyTorch, Deep Learning, Image Processing, Machine Learning, Python, Custom Models, Artificial Intelligence (AI), Neural Networks, Point Clouds, Artificial Neural Networks (ANN), Generative Adversarial Networks (GANs), OpenCV, Computer Vision Algorithms, Azure Machine Learning, Pandas, Azure, Spark ML, Best Practices, Performance Optimization, Language Models, Text Generation, Fine-tuning, Data Inference, AI Programming, AI Design, Data Pipelines, Data Visualization, Large Language Models (LLMs), Algorithms, Data Analysis, Dashboards, Business Intelligence (BI), Reports, BigQuery, Research, Biometrics, Web Development, Statistical Analysis, Data Analytics, Data Reporting, Automated Biometric Identification Systems (ABIS), Medical Imaging

Data Scientist

2020 - 2021

National University of Computer and Emerging Sciences

Researched breast cancer detection using whole slide images, computerized medical imaging, and graphics.
Worked on a low-cost pathology project that received a $13.68 million grant for breast cancer detection.
Worked on Amal. It wasn't just a project but served as an awareness campaign too. I was the lead to start a movement about low-cost pathology—breast cancer detection—in Pakistan using artificial intelligence.

Technologies: Computer Vision, Machine Learning, Deep Learning, PyTorch, Image Processing, Python, Custom Models, Artificial Intelligence (AI), Neural Networks, Point Clouds, Artificial Neural Networks (ANN), Generative Adversarial Networks (GANs), OpenCV, Computer Vision Algorithms, Pandas, Best Practices, Language Models, Data Inference, AI Programming, AI Design, Data Visualization, Algorithms, Data Analysis, Dashboards, Business Intelligence (BI), Reports, BigQuery, Research, Web Development, Data Analytics

Software Engineer

2017 - 2020

National University of Computer and Emerging Sciences

Developed a deep learning pipeline to detect breast cancer based on low-cost pathology by extracting whole slide images from a scanned microscopic mobile video.
Designed a Python library and package called OpTorch to optimize training for whole slide images. Optimized the PyTorch training pipeline library for WSI. Published OpTorch research paper in a well-reputed conference.
Built a deep learning pipeline based on CAT scan images to detect brain tumors.

Technologies: Machine Learning, Deep Learning, PyTorch, TensorFlow, Computer Vision, Generative Adversarial Networks (GANs), OpenCV, Computer Vision Algorithms, Pandas, Language Models, Data Inference, AI Programming, AI Design, Data Visualization, Algorithms

Experience

PMNet | A Probability Map-based Scaled Network for Breast Cancer Diagnosis

https://pubmed.ncbi.nlm.nih.gov/33578222/

Our method employs scaled networks for detecting breast cancer in whole slide images. It classifies entire slide images on a patch level into normal, benign, in situ, and invasive tumors.

Our approach yielded an f1-score of 88.9 (±1.7)%, which outperformed the benchmark f1-score of 81.2 (±1.3)% on patch level and achieved an average dice coefficient of 69.8% on 10 whole slide images compared to the benchmark average dice coefficient of 61.5% on BACH dataset.

Similarly, on the Dryad test dataset comprising 173 whole slide images, we achieved an average dice coefficient of 82.7% compared to the previous state-of-art of 76% without fine-tuning on this dataset. We further proposed a method to generate patch-level annotations for the image-level TCGA breast cancer database that will be useful for future deep learning methods.

Bias Adjustable Activation Network for Imbalanced Data | Diabetic Foot Ulcer Challenge 2021

Despite great success, deep learning models still face a critical obstacle in classifying highly imbalanced real-life data.

Detecting diabetic foot ulcers is fundamental for healthcare specialists to prevent amputations. In this work, we performed multiple experiments to benchmark results on the grand. To adjust the bias of the convolutional neural networks, we also proposed a custom-designed activation layer based on softmax to handle the probability skew of the classes.

We achieved the second position in the validation set with a macro F1 score of 0.593 and the third position in the test set with a macro F1 score of 0.596 for the Diabetic Foot Ulcer Detection 2021 Grand Challenge.

PRNet | A Progressive Resolution-based Network for Radiograph-based Disease Classification

https://ieeexplore.ieee.org/document/9708553

COVID-19 and pneumonia have impacted human life significantly. The number of infected people and deaths is increasing daily due to COVID-19. Rapid COVID-19 detection is vital to control and stop the spread of the disease.

Considering AI can play a significant role in accurately detecting such diseases, EE-RDS conducted a multi-class classification challenge by providing chest X-rays of pneumonia, COVID-19, and regular patients. We proposed PRNet, a novel deep learning pipeline, and achieved 96.3% accuracy, winning the second position on the test set leader board.

OpTorch | Optimized Deep Learning Architectures for Resource Limited Environments

https://arxiv.org/abs/2105.00619

Deep learning algorithms have made many breakthroughs and various real-life applications. Computational resources become a bottleneck as the data and complexity of the deep learning pipeline increase.

In this paper, we proposed optimized deep learning pipelines in multiple aspects of training, including time and memory. OpTorch is a machine learning library designed to overcome weaknesses in existing implementations of neural network training. It provides features to train complex neural networks with limited computational resources.

OpTorch achieved the same accuracy as existing libraries on CIFAR-10 and CIFAR-100 datasets while reducing memory usage to approximately 50%. We also explored the effect of weights on total memory usage in deep learning pipelines.

In our experiments, parallel encoding-decoding along with sequential checkpoints result in a much-improved memory and time usage while keeping the accuracy similar to existing pipelines.

Education

2019 - 2023

PhD in Computer Science

Texas A&M University - College Station, TX, USA

2015 - 2019

Bachelor's Degree in Computer Science

National University of Computer and Emerging Sciences - Islamabad, Pakistan

Certifications

JULY 2022 - PRESENT

Winner of Object Detection for Dash CAM Images AI-challenege

Motive (Former KeepTruckin)

SEPTEMBER 2021 - PRESENT

Winner of Chest-XRAY COVID-19 Grand Challenge

Amazon Web Services

AUGUST 2021 - PRESENT

Winner of Diabetic Foot Ulcer Detection Grand Challenge

MICCAI

AUGUST 2021 - PRESENT

Certificate of Achievement

The Manchester Metropolitan University

Skills

Libraries/APIs

PyTorch, TensorFlow, OpenCV, Pandas, NumPy, XGBoost, Keras, OpenAI API, Scikit-learn, Hugging Face Transformers, PyTorch Lightning, Joblib, PySpark, REST APIs, JAX, Pydantic, Spark ML, Amazon Rekognition, Google Maps, Node.js, React

Tools

Amazon SageMaker, ChatGPT, Azure ML Studio, Jupyter, OpenAI o1, AI Prompts, Claude, Whisper, ARIMA, SARIMA, GraphRAG, n8n, Azure Machine Learning, BigQuery, Open Neural Network Exchange (ONNX), AWS Glue, GIS, Amazon Polly, Zapier, CVAT, ComfyUI

Languages

Python, C++, Unicorn, JavaScript, SQL, Java, Snowflake, R, Go, C#, TypeScript

Frameworks

Bedrock, LlamaIndex, LangGraph, Flask, LightGBM, Streamlit, Agentic Frameworks, Flux, Django, React Native, Spring, Next.js

Paradigms

Best Practices, Business Intelligence (BI), Automation, Agile Software Development, Object-oriented Programming (OOP), DevOps, Model Context Protocol (MCP), Real-time Systems, Foundation Models, Anomaly Detection, Automation Engineering, Role-based Access Control (RBAC), HIPAA Compliance, Rapid Prototyping, Actor Model

Platforms

Jupyter Notebook, Amazon Web Services (AWS), Azure AI Studio, Docker, Visual Studio Code (VS Code), NVIDIA CUDA, Kubernetes, Vertex AI, Azure, Google Cloud Platform (GCP), AWS Lambda, CrewAI, Mapbox, Databricks, Replit, Mobile

Storage

Data Pipelines, MySQL, Graph Databases, Google Cloud, Amazon S3 (AWS S3), JSON, Azure Blobs, PostgreSQL, PostGIS

Industry Expertise

Marketing, Retail & Wholesale, Bioinformatics, Trading Strategy Development, Web Design

Other

Machine Learning, Computer Vision, Natural Language Processing (NLP), Deep Learning, Data Science, Image Processing, JSTransformers, Custom Models, Artificial Intelligence (AI), Cloud, Neural Networks, Artificial Neural Networks (ANN), Generative Adversarial Networks (GANs), Code Review, Source Code Review, Task Analysis, Technical Hiring, Interviewing, Facial Recognition, Computer Vision Algorithms, Language Models, Text Generation, Fine-tuning, Data Inference, Classification Algorithms, Classification, Text Classification, Signal Processing, Health, Models, AI Programming, APIs, Chatbots, AI Design, Data Visualization, Image Generation, Large Language Models (LLMs), Generative Pre-trained Transformers (GPT), JupyterLab, Random Forests, OpenAI GPT-4 API, Generative Pre-trained Transformer 3 (GPT-3), OpenAI GPT-3 API, Data Scientist, Algorithms, Reinforcement Learning, Data Analysis, Dashboards, Reports, Research, API Integration, Time Series, Statistical Analysis, Data Analytics, OpenAI, AI Agents, Retrieval-augmented Generation (RAG), LangChain, Generative Artificial Intelligence (GenAI), Gemini, Prompt Engineering, English, Data Modeling, Modeling, Chatbot Conversation Design, Minimum Viable Product (MVP), Medical Imaging, CI/CD Pipelines, Machine Learning Operations (MLOps), Data Interpretation, Open-source LLMs, Llama 3, Vector Search, Adversarial Machine Learning, K-means Clustering, Hierarchical Clustering, Clustering Algorithms, Clustering, DBSCAN, Recommendation Systems, Diffusion Models, Diffusion-based AI Models, Text to Image, Multi-agent Systems, FastAPI, Vector Databases, Meta Llama, Bittensor, Context Parallelism, FSDP, Fully Sharded Data Parallelism (FSDP), Sequence Parallelism, Tensor Parallelism, Distributed Training, Transformers, Transformer Models, Large Language Model Operations (LLMOps), Hugging Face, Gemini API, Amazon Machine Learning, Feasibility Studies, Strategic Planning, Deep Neural Networks (DNNs), NLU, Convolutional Neural Networks (CNNs), Developing AI Models locally, Neural Network Pruning, Recurrent Neural Networks (RNNs), model quantization, Data Cleansing, CUDA Kernel, Optical Character Recognition (OCR), Graphs, Natural Language Generation (NLG), AI Chatbots, Knowledge Bases, Predictive Modeling, CTO, Software Architecture, AI Integration, Discord Bots, Web Scraping, Startups, Full-stack Development, Speech Recognition, Low Latency, Agentic AI, Multimodal GenAI, Data Scraping, Audio, Audio Processing, Digital Signal Processing, Real-time Audio Processing, Sound, Demand Forecasting, Time Series Forecasting, Product Forecasts, Sales Forecasting, Forecasting, Biology, Solution Architecture, AI-generated Code, Data Aggregation, Data Cleaning, Speech-to-Text (STT), Statistical Modeling, Text-to-Speech (TTS), DSP, Voice AI, Scalable Architecture, Qdrant, Education, AI Pipeline, Anthropic, AI Adoption, Strategy, Image Classification, Residual Neural Networks (ResNets), Supervised Learning, Supervised Machine Learning, Vision Transformer (ViT), Self-Distillation with No Labels, AI Model Training, Model Validation, Predictive Maintenance, Quantization, Sensor Data, Time Series Data, Unsupervised Learning, AI Algorithms, Package Distribution, Statistics, Containerization, Customer Relationship Management (CRM), Pinecone, ControlNet, DreamBooth, LoRa, SDXL, Azure Text to Speech, Data Annotation, Image Annotation, Sentiment Analysis, Technical Strategy, Custom Automation, R&D, Tech Research & Evaluation, Firecrawl, Flax, TPU, Platform as a Service (PaaS), Data Protection, Auto Encoder, Binary Classification Models, System Architecture, Finance, Back-end Development, RAG Pipelines, Azure Blob Storage, Blob Storage, AI Assistants, Agentic RAG Systems, Object Detection, Performance Optimization, Financial Forecasting, Leadership, Legal Documentation, Benchmarking, Integration, Wearables, Biometrics, Data Reporting, Automated Biometric Identification Systems (ABIS), Team Leadership, Outbound Marketing, Architecture, Discord, Data Engineering, Algorithmic Trading, Backtesting Trading Strategies, Lead Generation, Construction, eCommerce, Drones, Robotics, Fractional CTO, ACH Payments, Audio Streaming, Technical Leadership, Point of Sale, Retail Technology, ElevenLabs Solutions, Product Innovation, Product Design, AI Tool Assessment, Amazon Bedrock, Point Clouds, Weather, Google Data Studio, Web Development, Multivariate Analysis (MVA), Genomics, Full-stack, Pure Data, Front-end Development, User Interface (UI), PG Vector

Collaboration That Works

How to Work with Toptal

Toptal matches you directly with global industry experts from our network in hours—not weeks or months.

Share your needs

Discuss your requirements and refine your scope in a call with a Toptal domain expert.

Choose your talent

Get a short list of expertly matched talent within 24 hours to review, interview, and choose from.

Start your risk-free talent trial

Work with your chosen talent on a trial basis for up to two weeks. Pay only if you decide to hire them.

Top talent is in high demand.

Start hiring