
Hameed Hasan
Verified Expert in Engineering
Machine Learning Developer
Germantown, MD, United States
Toptal member since April 29, 2020
Hamid is a lead research scientist at UnitedHealth Group/Optum AI. He specializes in natural language processing (NLP) and natural language understanding (NLU) with a focus on healthcare applications. Hamid holds a Ph.D. in computer science from Georgia Tech and has extensive experience leveraging generative AI to advance healthcare solutions.
Portfolio
Experience
- Machine Learning - 13 years
- Python - 9 years
- Natural Language Processing (NLP) - 8 years
- Bioinformatics - 8 years
- Deep Learning - 5 years
- Computer Vision - 4 years
- TensorFlow - 4 years
- PyTorch - 3 years
Availability
Preferred Environment
TensorFlow, Python, PyTorch, Bioinformatics, Natural Language Processing (NLP), Generative Pre-trained Transformers (GPT), Healthcare IT, JavaScript
The most amazing...
...project I've worked on involves the summarization of understanding patient-doctor conversations and information extraction in that domain.
Work Experience
Machine Learning Scientist
United Health Group
- Acted as a researcher trying to push the state of the art in NLP applied to healthcare.
- Worked routinely with PyTorch, Hugging Face, NLP, GPT3, S4, Transformer-based models, many other language models (LMs), etc.
- Improved state of the art in healthcare. My role mainly was researching with NLP.
Data Scientist
Disney Streaming Services
- Designed and implemented advanced NLP pipelines for the Disney Streaming Chatbot for customer services.
- Managed text summarization and topic modeling on survey data.
- Deployed a designed chatbot using cloud services.
Machine Learning Developer
USC/ISI (Information Science Institute)
- Developed machine learning algorithms for event prediction in news corporations, using different technologies, models (e.g., TensorFlow, BERT).
- Turned developed codes into deployable products using containers and Kubernetes.
- Assisted the integration team responsible for delivering a multi-faceted product comprised of different analytics engines.
Machine Learning Researcher
3M/MModal
- Focused the research on improving NLP pipelines that are used for the summarization of patient-doctor conversations.
- Adapted recent advances in deep learning (applied to the NLP domain) to the company's internal domain to improve the deployed pipeline.
- Worked on a variety of healthcare-related NLP tasks. Used technologies and libraries such as deep learning, NLP, transformers, PyTorch, PyTorch Lightning, Hugging Face, etc.
Senior Machine Learning Engineer
Liberty Defense
- Designed and showcased a deep convolutional neural network for the prediction of threats.
- Involved state-of-the-art image segmentation and detection such as Mask-RCNN for the segmentation of threats (e.g., in cases hiding guns or having guns with them).
- Achieved remarkable accuracy of (95%) in detecting cases carrying guns. Used the TensorFlow library for implementation. Trained on CUDA GPUs.
Senior Software Engineer
Home Depot
- Designed and implemented deep models for search and personalization. The task was to rank items returned by a search engine for different search phrases with respect to their relevance and satisfaction of users.
- Trained NLP models implemented with TensorFlow and trained on GPU. Used recurrent neural nets along with siamese networks. Integrated multiple modalities such as user behavior.
- Required preprocessing scripts written in the Spark framework to generate and preprocess large datasets.
Software Engineer Intern
Verizon Connect
- Developed an app using advanced recommender systems for recommending the best matching shopping places for drivers. Used and sorted through a large amount of data accumulated in data clusters.
- Utilized driver behaviors as well as their personalities and demographics to train an integrated deep recommender system. The data was accumulated from a large number of vehicles consuming the product (dongle).
- Integrated two types of recommender systems; the content-based filtering methods, and the collaborative filtering method. Content-based modeled individual personal information, while the collaborative modeled driving behaviors and habits.
- Used TensorFlow and Python to achieve the task using both collaborative and content-based filtering approaches. Trained end-to-end.
- Achieved significant performance in predicting the preference of drivers for their shopping center of interest.
Data Analyst Intern
UCB Pharma
- Developed a deep learning pipeline based on auto-encoders to predict Parkinson's disease from claims data. The goal was to predict whether the person has Parkinson's based on past visits at different doctors.
- Utilized the H2O library in R to implement a deep network from features describing the patient's past medications and diagnosed codes. Achieved an impressive prediction performance of about 90%.
- Identified cases in the early stages of the disease (to receive a more successful treatment), by using the trained model to find trial cases sooner.
Experience
Prediction of Threats from Radar Generated Images
Using NLP for Improving Alignments of High Throughput Reads
Using Large Language Models for Analysis of Adherence to Clinical Guidelines
Developing Advanced Question Answering Models for Information Retrieval from Medical Documents
Education
Ph.D. in Computer Science, Bioinformatics, Machine Learning
Georgia Institute of Technology - Atlanta, Georgia, USA
Skills
Libraries/APIs
PyTorch, TensorFlow, React, Java Natural Language Processing (JNLP), XGBoost, PySpark, Keras
Tools
ChatGPT, MATLAB
Languages
Python, Java, C++, JavaScript, Perl, R
Platforms
Firebase, Docker, Google Cloud Platform (GCP), Azure, Kubernetes, Web, Amazon Web Services (AWS)
Industry Expertise
Bioinformatics
Storage
Google Cloud
Frameworks
Next.js
Other
Data Science, Predictive Modeling, Deep Learning, Convolutional Neural Networks (CNNs), Recurrent Neural Networks (RNNs), Machine Learning, Natural Language Processing (NLP), Artificial Intelligence (AI), Vectorization, Generative Artificial Intelligence (GenAI), OpenAI GPT-4 API, Transformer Models, Web Scraping, CSV Export, Prompt Engineering, Large Language Models (LLMs), APIs, Gemini, Retrieval-augmented Generation (RAG), Llama 3, Sentiment Analysis, Web Development, OpenAI, Fine-tuning, Optical Character Recognition (OCR), Llama 2, Computer Vision, Generative Pre-trained Transformers (GPT), Document Parsing, FastAPI, Genomics, Data Scraping, Scraping, Vector Databases, Chatbot Conversation Design, SAP Sales and Distribution (SAP SD), Deep Neural Networks (DNNs), Biotechnology, Language Models, Healthcare IT, Time Series Analysis, Time Series, LangChain
How to Work with Toptal
Toptal matches you directly with global industry experts from our network in hours—not weeks or months.
Share your needs
Choose your talent
Start your risk-free talent trial
Top talent is in high demand.
Start hiring