
William Zhu
Verified Expert in Engineering
Data Scientist and AI Developer
Shenzhen, Guangdong Province, China
Toptal member since November 18, 2021
William has three years of professional experience in data science and artificial intelligence. Key projects include text classification to identify hate speech in social media and fraud detection applications. He specializes in data analysis, data visualization, and predictive modeling, and his strongest programming language is Python. William is diligent and obsessed with quality.
Portfolio
Experience
- Python 3 - 4 years
- NumPy - 3 years
- Pandas - 3 years
- Matplotlib - 3 years
- Scikit-learn - 2 years
- Predictive Modeling - 2 years
- Text Classification - 2 years
- Natural Language Processing (NLP) - 2 years
Availability
Preferred Environment
Linux, Vim Text Editor, Jupyter Notebook, Python 3
The most amazing...
...thing I've developed is the Python client of Myanmar Tools that is used by Google, Facebook, and others to detect Zawgyi encoding.
Work Experience
Lead AI Engineer
Koe Koe Tech
- Led the development of algorithms to detect hate speech on social media.
- Built and maintained a web API for the hate speech detector.
- Provided technical assistance in labeling hate speech.
Data Scientist
Koe Koe Tech
- Analyzed data from different data sources for regular and ad hoc reporting.
- Performed performance tuning and documentation of 10+ tables in a database.
- Built a predictive model of referral fraud based on user behavior.
Experience
Hate Speech Detector for Social Media Comments
Referral Fraud Detector for a Health App
Python Client for Myanmar Tools
https://github.com/google/myanmar-tools/tree/master/clients/pythonEducation
Bachelor's Degree in Medicine
University of Medicine, Mandalay - Mandalay, Myanmar
Certifications
Learning from Data (Introductory Machine Learning) (CS115x)
edX
Introduction to Probability - The Science of Uncertainty (6.041x)
edX
Introduction to Computer Science (CS50)
edX
Skills
Libraries/APIs
NumPy, Pandas, Matplotlib, Scikit-learn, Web API
Tools
Vim Text Editor, PyPI
Languages
Python 3, Python, SQL, C, JavaScript
Platforms
Linux, Jupyter Notebook
Frameworks
Flask
Storage
Databases
Other
Predictive Modeling, Data Science, Medicine, Algorithms, APIs, Performance Tuning, Data Reporting, Artificial Intelligence (AI), Text Classification, Tokenization, Natural Language Processing (NLP), Computer Science, Data Structures, Probability Theory, Statistics, Machine Learning, Generative Pre-trained Transformers (GPT)
How to Work with Toptal
Toptal matches you directly with global industry experts from our network in hours—not weeks or months.
Share your needs
Choose your talent
Start your risk-free talent trial
Top talent is in high demand.
Start hiring