
Khoa Nguyen
Verified Expert in Engineering
Data Scientist Developer
Melbourne, Victoria, Australia
Toptal member since July 1, 2021
Khoa is a data scientist specializing in providing businesses with high-quality machine-learning solutions. He successfully helped deploy AI modules that assisted many advertising campaigns in optimizing their marketing strategies. He also contributed to several PoC projects outlining their feasibility of solving practical business problems. Khoa is also a well-rounded individual who can collaborate with many people at work and work independently.
Portfolio
Experience
- Python - 6 years
- Data Mining - 6 years
- Machine Learning - 6 years
- PyTorch - 5 years
- Anaconda - 5 years
- NumPy - 5 years
- Pandas - 5 years
- TensorFlow - 4 years
Availability
Preferred Environment
TensorFlow, Pandas, NumPy, SQL, Jupyter Notebook, Machine Learning, Python, Artificial Intelligence (AI), Data Mining, PyTorch
The most amazing...
...research I have done was using Artificial Intelligence prediction models to understand the interaction between cross-reactive T-cells and various pathogens.
Work Experience
Python Engineer
Captario
- Optimized CPU and memory utilization for drug database infrastructure.
- Developed Python code to optimize model drug projects.
- Distributed computations using cloud infrastructure and Kubernetes.
Data Engineer
Yedda Co. Ltd.
- Developed a module that manages data collection and database management for customers.
- Provided UML diagrams and solutions for database architecture.
- Performed research to optimize SQL queries and database performance.
Data Scientist
Knorex Co., Ltd.
- Developed bid landscape model for predicting optimal bidding prices in real-time advertising display with a ROC AUC score of up to 80%.
- Implemented the first version of the audience segmentation module to identify similar user groups for the ads targeting scheme.
- Provided a comprehensive proof-of-concept (POC) of federated learning in the predicting CTR of online advertising (trade-off with a decrease of AUC score by 15-20% while ensuring data privacy).
- Provided a machine learning architecture handling training and serving sessions of bid landscape for multiple advertising campaigns.
- Provided a proof-of-concept of feature store using Feast to prepare a better data source for data scientists in extracting qualities features.
- Analyzed data to extract relevant information for customers to improve their marketing strategies.
AI Engineer
Viralint Co. Ltd.
- Analyzed metadata and lyrics of different songs to determine the user's configuration for optimal song generation.
- Built a data system for crawling and labeling data for music generation.
- Created a deep learning model for lyrics segmentation and semantic analysis.
- Designed a multiprocessing system for deep learning tasks.
- Provided a PoC of a module automatically detecting faults in the lens using OpenCV.
Data Engineer Intern
Younet Media Social Enterprise
- Supported building a database of user's social network information.
- Implemented artificial intelligence models detecting human faces and ages.
- Assisted in building modules to extract data from Facebook API.
Experience
Artificial Intelligence to Predict How T-cells Recognize Diverse Pathogens
Smart Bid Recommendation of Knorex's KAIROS Engine
POC of Federated Learning for CTR Prediction
Automatically Finding the Number of Clusters for Large Datasets Based on Coresets
Education
Master's Degree in Computer Science
University of Melbourne - Melbourne, VIC
Engineer's Degree in Computer Science
Ho Chi Minh University of Technology, Vietnam National University - Vietnam
Certifications
Advanced Data Science with IBM Specialization
Coursera
Global Project-based Learning
Shibaura Institute of Technology and Ho Chi Minh University of Technology
Skills
Libraries/APIs
TensorFlow, Pandas, NumPy, OpenCV, PyTorch, Matplotlib, TensorFlow Deep Learning Library (TFLearn), Keras, Scikit-learn, Dask
Tools
Plotly, Scikit-image
Languages
C++, Python, Python 3, SQL, R, Java, Scala
Platforms
Visual Studio Code (VS Code), Anaconda, Jupyter Notebook, Azure, Kubernetes
Frameworks
Ray
Storage
Google Cloud, Databases
Other
Machine Learning, Data Mining, Programming, Predictive Analytics, Predictive Modeling, Boost.Python, Data Science, Artificial Intelligence (AI), Deep Learning, Linear Algebra, Big Data, Data Analysis, Multiprocessing, Data Preparation, Exploratory Data Analysis, Data Visualization, Streaming, Neural Networks, Computer Vision, Statistics, Feast, Calculus, UML Diagrams, Signal Processing, Biology, Research, Algorithms
How to Work with Toptal
Toptal matches you directly with global industry experts from our network in hours—not weeks or months.
Share your needs
Choose your talent
Start your risk-free talent trial
Top talent is in high demand.
Start hiring