Anthony Khong, Developer in Jakarta, Indonesia
Anthony is available for hire
Hire Anthony

Anthony Khong

Verified Expert  in Engineering

Mathematical Modeling Developer

Jakarta, Indonesia

Toptal member since December 15, 2020

Bio

Anthony is an experienced data scientist and developer with a demonstrated history of working in research, algorithmic trading, and technology consultancy. He is skilled in software development, statistical analysis, and machine learning. Anthony is a strong entrepreneurship professional with an MSc in applied statistics from the University of Oxford and an MPhil in economics research from the University of Cambridge.

Portfolio

Zero One Group
Entrepreneurship, Mathematical Modeling, Google Cloud Platform (GCP), DevOps...
Agoda Company, Pte. Ltd.
Python, Customer Lifetime Value (CLV), Digital Marketing, A/B Testing, Spark...
Seamless Global, Ltd.
C++, Trading, Docker, Python, NumPy, Pandas, TensorFlow, GPU Computing...

Experience

  • Mathematical Modeling - 7 years
  • Machine Learning - 7 years
  • Python - 7 years
  • Data Science - 6 years
  • Pandas - 6 years
  • Econometrics - 5 years
  • Bayesian Inference & Modeling - 4 years
  • Clojure - 2 years

Availability

Part-time

Preferred Environment

Functional Programming, Scala, Spark, Pandas, NumPy, Google Cloud Platform (GCP), Python, Clojure, SSH, Vim Text Editor

The most amazing...

...experimentation methodologies and guidelines I've designed for Agoda's A/B testing platform changed how most teams analyze and define their experiments.

Work Experience

Co-founder

2019 - PRESENT
Zero One Group
  • Spearheaded a large-scale cost optimization project for an industrial manufacturing client that resulted in multimillion-dollar estimated raw-material cost savings.
  • Led a data-driven customer segmentation and persona generation project for one of Indonesia's retail giants.
  • Managed a mathematical modeling and constrained optimization project for one of Indonesia's leading engine lubricant manufacturing companies.
Technologies: Entrepreneurship, Mathematical Modeling, Google Cloud Platform (GCP), DevOps, Pandas, Spark, Python, Clojure

Lead Data Scientist

2017 - 2018
Agoda Company, Pte. Ltd.
  • Led the research team for the A/B testing platform.
  • Carried out research for automated bidding and customer loyalty.
  • Contributed to the then-new Python-based machine learning library as the lead developer.
  • Contributed to the data-science efficiency team focusing on quick research and deployment times.
Technologies: Python, Customer Lifetime Value (CLV), Digital Marketing, A/B Testing, Spark, Scala

Data Scientist

2014 - 2016
Seamless Global, Ltd.
  • Served as a full-stack data scientist using state-of-the-art machine learning algorithms and tools.
  • Worked on scientific computing and high-performance computing projects using Python, C++, and CUDA.
  • Conducted research focusing on the areas of applied machine learning and algorithmic trading.
  • Collaborated using Agile software development principles and practices.
  • Practiced clean code and test-driven development with strict standards.
Technologies: C++, Trading, Docker, Python, NumPy, Pandas, TensorFlow, GPU Computing, Machine Learning

Supervisor

2012 - 2015
University of Cambridge
  • Supervised different courses in economics, including Economics Part I—Quantitative Methods for Economists, Economics Part IIA—Theory and Application of Econometrics, and Economics Part IIB—Applied Econometrics.
  • Oversaw the course called Management—The Economics of Markets and Firms.
  • Interviewed the candidates for undergraduate admissions.
Technologies: Management, Econometrics, Economics

Experience

Data-driven Customer Segmentation and Persona Generation

One of Indonesia's retail giants wanted to bridge the gap between executives in the HQ and their clients. They wanted to be reminded and consider their typical clients' needs to inform them about high-level strategic decisions.

I led a research team that used multiple years of transactions to come up with about a dozen customer archetypes that are easy to relate to while still backed by data. The final persona deliverables were later used to inform executive decisions on marketing campaigns, brand positioning, and product offering gaps.

Large-scale Constrained Optimization Application in Manufacturing

A large-scale constrained optimization application to optimize production planning for a manufacturing company in Indonesia. The app considers over a hundred thousand constraints related to production quality, stock availability, and stock replenishment possibilities. The project involved creating a mathematical model to represent the production processes, developing the application using mainly Python, OR-Tools, and Django, and deploying the application on AWS.

Optimal Stock Replenishment Algorithm

I worked with one of Indonesia's retail giants to develop a production planning application, which involves an optimal stock replenishment algorithm. The algorithm is comprised of a supervised learning model that uses several years of past transaction data to predict the sales schedule of a particular stock keeping unit (SKU). It takes into account features such as seasonality, trends, special dates, and item characteristics. The predictions are used to guide merchandise planners in making the right decisions on the timing and quantity for stock replenishments.

Geni—Open-source Clojure DataFrame Library

https://github.com/zero-one-group/geni
I am the author of Geni, a Clojure DataFrame library that runs on Apache Spark. The main idea of Geni is to provide an idiomatic DataFrame API in Clojure while leveraging on the already mature Spark ecosystem. As a result, Geni is stable, performant, and has all the important Spark features, including Spark ML.

I started the project, designed the API, wrote most of the documentation, and provided examples, notably one for deploying Geni to Google Cloud's Dataproc. The project has 100% test coverage. It is installable with a few Bash lines and includes a cookbook to optimize the getting started experience. It has been used as the main data analysis and machine learning library for a couple of past projects.

Education

2013 - 2014

Master of Science Degree in Applied Statistics

University of Oxford - Oxford, UK

2012 - 2013

Master of Philosophy Degree in Economic Research

University of Cambridge - Cambridge, UK

2009 - 2012

Bachelor of Arts Degree in Economics

University of Cambridge - Cambridge, UK

Certifications

AUGUST 2017 - PRESENT

Parallel Programming

Coursera, Inc.

AUGUST 2017 - PRESENT

Big Data Analysis with Scala and Spark

Coursera, Inc.

JUNE 2017 - PRESENT

Functional Program Design in Scala

Coursera, Inc.

JUNE 2017 - PRESENT

Functional Programming Principles in Scala

Coursera, Inc.

FEBRUARY 2017 - PRESENT

Algorithms on Graphs

Coursera, Inc.

DECEMBER 2016 - PRESENT

Algorithmic Toolbox

Coursera, Inc.

JULY 2014 - PRESENT

CUDA Programming

University of Oxford

Skills

Libraries/APIs

NumPy, Pandas, TensorFlow, Thrust, Dask, Spark ML, Scikit-learn

Tools

Vim Text Editor, Google OR-Tools, Jupyter, Spark SQL

Languages

Python, Clojure, SQL, R, Scala, C++, Haskell, Java

Paradigms

Functional Programming, Unit Testing, Test-driven Development (TDD), Agile, High-performance Computing (HPC), DevOps, REST, Management, Distributed Computing, Parallel Programming

Frameworks

Spark, Apache Spark, Django

Platforms

Docker, Google Cloud Platform (GCP), JVM, NVIDIA CUDA, Amazon Web Services (AWS)

Storage

PostgreSQL

Other

Bayesian Statistics, Bayesian Inference & Modeling, Mathematical Modeling, Machine Learning, Computational Statistics, Statistical Methods, A/B Testing, Big Data, Version Control, Data Science, Economics, CI/CD Pipelines, Data Engineering, Econometrics, Time Series Analysis, Macroeconomics, Microeconomics, Deep Learning, Markov Chain Monte Carlo (MCMC) Algorithms, SSH, GPU Computing, Trading, Digital Marketing, Customer Lifetime Value (CLV), Entrepreneurship, Algorithms, Data Structures, Data Analysis, Optimization, Supervised Learning

Collaboration That Works

How to Work with Toptal

Toptal matches you directly with global industry experts from our network in hours—not weeks or months.

1

Share your needs

Discuss your requirements and refine your scope in a call with a Toptal domain expert.
2

Choose your talent

Get a short list of expertly matched talent within 24 hours to review, interview, and choose from.
3

Start your risk-free talent trial

Work with your chosen talent on a trial basis for up to two weeks. Pay only if you decide to hire them.

Top talent is in high demand.

Start hiring