
David Dai
Verified Expert in Engineering
Machine Learning Developer
David has extensive experience in building machine learning and deep learning (DL) solutions at top companies, including Apple, Google, and Facebook, unicorn startups, and academia, as he has a PhD from Carnegie Mellon U. He holds multiple patents in DL-based medical imaging tech and large-scale AI systems. David has grown an AI team to 60+ as the director and tech lead.
Portfolio
Experience
Availability
Preferred Environment
C++, Python, Scikit-learn, PyTorch, TensorFlow, Django, ChatGPT, OpenAI GPT-3 API, Chatbots
The most amazing...
...project I've created was best-in-class DL methods for a chest X-ray to assist radiologists in diagnosis, resulting in the first contract for the company.
Work Experience
Staff Machine Learning Engineer
Apple
- Deployed high-performance deep-learning models to run on-device to improve inference speed and user privacy.
- Led the design of CoreML, a deep-learning framework for on device and cloud for first- and third-party developers at Apple.
- Served as a tech lead and helped grow the team from three to ten engineers and researchers.
Senior Director of Engineering
Petuum
- Worked as part of the founding team and helped to grow the engineering team from less than ten in 2016 to over 60 full-time engineers in the second quarter of 2018 (2Q18).
- Influenced, as a domain expert in machine learning, a trench worker, and a manager, the product design and delivered quarterly internal platform product milestones from 3Q17-1Q18.
- Led the medical imaging team of four and personally developed best-in-class deep learning methods for chest X-rays to assist radiologists in diagnosis. The project led to the first contract for the company.
- Served as the tech lead and managed the back-end design of Petuum's AI platform that combined all stages of machine learning, including data cleaning, feature engineering, data and model visualization, deployment, and monitoring.
Software Engineer Intern
- Contributed more than 10,000 lines of C++ code during my three-month internship and helped initiate a large-scale distributed training system for Facebook newsfeed and ad algorithms.
- Developed a distributed large-scale logistic regression using LBFGS solver and parameter server-based synchronization.
- Benchmarked the implementation against Facebook's internal system and Vowpal Wabbit. Showed that the implementation achieves high system throughputs and produces comparable to better models.
Software Engineering Intern
- Optimized one of the world's largest machine learning systems – Google's ads algorithms for click-through-rate prediction.
- Developed a large-scale hyperparameter tuning framework to enable Google's ads training system to discover the best hyperparameter settings for ML models. The system is used by dozens of teams and later published as Google Vizier.
- Built the frontend to the hyperparameter tuning engine.
Software Developer Intern
- Implemented a number of background tasks in the payment back end.
- Improved access control for internal payment handling.
- Developed a dynamic terminal folder navigator to replace the "CD" in Linux in a hackathon.
Experience
Chest X-ray Organ Segmentation Through Adversarial Deep Neural Networks
A medical imaging device that a client needed to differentiate their products with AI.
BRAINSTORMING
I collaborated closely with radiologists to identify tasks that can be automated by AI in the near term. While most AI apps in medical imaging aren't interpretable, e.g., black box classification of diseases, the solution needed to be interpretable to build trust with doctors. We chose chest X-ray segmentation, which can be visualized and directly leads to clinical indicators.
DEVELOPMENT
Existing methods use traditional computer vision pipelines, which are very brittle and inaccurate. I found only a minimal dataset with pixel labels (<300 samples), which was challenging for data-hungry neural networks. After intensive studies, I successfully reduced the ImageNet model by 500 times to adapt to a small training dataset and achieved human accuracy for the first time.
DEPLOYMENT AND IMPACT
I led a team of four engineers to produce the model for the client, including building REST API endpoints on public clouds designated by the client for security and latency considerations. I trained the team to fine-tune the model on client data. The developed solution landed the first contract for our company.
Wayfinder Vizier | Rich Question Answering via Neural Semantic Search
For example, Wayfinder Vizier answers the following questions about Chihuly Garden and Glass, a tourist destination, based on TripAdvisor out-of-box and without any tuning user reviews. The answers, i.e., excerpts from user reviews in this case, are both diverse and relevant.
Q: What's the best time to visit the museum?
• A sunny day is the best time to visit.
• I think the garden at night is the best time to view it.
• The best time to visit is just before the sunset.
• My family of three made our reservations for 7:00 PM. (They request not only the date of your visit, but you must select a time that you will visit.) It was the perfect time to visit.
• Summer is not the best time to visit.
Q: Is it a good idea to bring kids?
• Not a place to bring young kids.
• It may not be worth it to bring the little ones.
• Is a good attraction for families.
• What a great place to visit with kids.
• Not a place for kids.
Wayfinder Vizier was selected as one of 10 Emerging AI Pioneers to present at O'Reilly AI Conference 2019.
Skills
Languages
Python, C++, GraphQL, SQL, Java
Other
Machine Learning, Deep Learning, Distributed Systems, Deep Neural Networks, Artificial Intelligence (AI), Computer Vision, Natural Language Understanding (NLU), Natural Language Processing (NLP), GPT, ChatGPT, OpenAI GPT-3 API, Chatbots, Tools, Protocol Buffers, Vowpal Wabbit, Facebook, Google, Generative Pre-trained Transformers (GPT)
Libraries/APIs
TensorFlow, PyTorch, Scikit-learn
Platforms
Linux, Docker, Kubernetes
Frameworks
Core ML, Spark, Spring, Django
Tools
Git
Storage
MySQL, RocksDB, LevelDB
Education
PhD in Machine Learning
Carnegie Mellon University - Pittsburgh, PA, USA
Bachelor of Science Degree in Computer Science
Caltech | California Institute of Technology - Pasadena, CA, USA
Bachelor of Arts Degree in Physics and Mathematics
Wesleyan University - Middletown, CT, USA