
Jesse Liu
Verified Expert in Engineering
Data Scientist and Quantitative Software Developer
Irvine, CA, United States
Toptal member since March 14, 2022
Jesse is a software engineer developing data science solutions and a machine learning engineer with 10+ years of versatile industry experience. He's worked in the areas of social networks, ad engineering, quantitative investments, and engineering applications. He is passionate about delivering high-quality code and solutions with real business impact and always meets expectations. Jesse enjoys collaborating with team members with a professional attitude and exploring new ideas and technologies.
Portfolio
Experience
- Software Development - 6 years
- Python - 5 years
- C++ - 5 years
- Machine Learning - 5 years
- Advertising Technology (Adtech) - 2 years
- Rankings - 2 years
- Apache Airflow - 2 years
- Recommendation Systems - 1 year
Preferred Environment
Visual Studio Code (VS Code), GitHub, Jupyter Notebook
The most amazing...
...projects I've developed are the machine learning systems for Reddit ads prediction and conversion, including feature engineering and model training pipelines.
Work Experience
Quantitative Developer
Jinhui Capital
- Developed the enhanced equity indexing strategies that aim to outperform the CSI 300 equity index, offering excessive returns with consistent alpha by leveraging machine learning models with factor tilts, security selection, and derivatives/futures.
- Developed the trading signal in production for portfolios with 30+ commodity pairs in the futures market, including ferrous metal, energy, chemical, and agricultural products, and stock index.
- Managed quantitative team recruiting and tech infrastructure development.
Machine Learning Engineer
Reddit, Inc.
- Developed a model training pipeline for ad prediction models based on gradient-boosting decision trees (GBDTs) and bid adjustment algorithms, implemented using TensorFlow Decision Forests (TF-DF).
- Built a feature engineering pipeline leveraging Google BigQuery, Apache Airflow, and dbt to handle daily roll-up of hundreds of TB-level big data, feature generation, ingestion, and training and test dataset creation for ads CPC and CPA cost models.
- Developed and maintained the prediction API between the ads inference server and upstream requesting server, ensuring seamless integration and responsiveness to model updates, requirements, and design changes.
- Developed ads ranking model performance monitoring and tracking dashboards, extracting business metrics such as conversion rate, model prediction rate and calibration error, conversion distribution, click rate, revenue/ARPU, CPM, and more.
- Developed dashboards to track and evaluate ad conversion modeling performance with segmentation analysis on various features, such as placement/platform types, advertiser industry, sales channel, etc., based on A/B tests and DDG experiments results.
Senior Quantitative Researcher
Outremont Technologies
- Handled machine learning modeling of the time series forecasting for cryptocurrency perpetual contracts, with XGBoost gradient boosting decision tree (GBDT) models, and temporal convolutional networks (TCN).
- Developed a break-out momentum predictive model and the trading signal generation framework, employing Bayesian filtering techniques and order/trade flow analysis to discover trading opportunities for crypto derivatives/perpetual contracts.
- Developed a complete backtesting framework for crypto perpetuals in Python, with core filtering algorithms in C++ to speed up, enabling rigorous evaluation of strategies and signal performance.
- Contributed to statistical analysis and evaluation of the off-the-shelf quantitative feature libraries for the cryptocurrency market provided by various vendors.
Quantitative Developer
iFDC Capital Management
- Developed machine learning models to generate directional signals for intraday momentum trading on U.S. stock index futures. Implemented automatic execution and backtesting for statistical arbitrage trading strategies on commodity derivatives.
- Led the development of the event-driven algorithm module in the firm’s proprietary trading platform, ensuring efficient low-latency event processing.
- Developed portfolio management dashboard, enabling real-time market data streaming, position monitoring, indicator and model monitoring, and performance reporting and visualization.
- Built the historical data and feature engineering pipeline, from data ingestion, transformation, to feature generation.
- Built a robust research Clickhouse database with exchange all-events market data, providing a centralized repository for data-driven analysis and strategy development.
Staff Systems Engineer
Broadcom
- Developed a GPS receiver host software and signal processing algorithms in C++, log analysis and processing with Python, and software and algorithm defect tracking via Jira.
- Analyzed a large amount of lab data for cellular radio chips and built statistical models to trade off between radio communication metrics. Developed numerical programs with Python to search circuit parameters and optimize chips' performance.
- Developed a comprehensive set of automation tools in Python for regression characterization. The tools manage testing cases, error recovery, lab instrument control, and data reporting.
- Developed the statistical models and algorithms for the highly accurate GPS receiver on-chip clock system.
Experience
Ad Prediction - 10x Features
Ad Prediction Batch Feature Engineering Pipeline
Ads Prediction Modeling Performance-tracking Dashboard
Evaluation Framework for Cryptocurrency Investment Strategies
Algorithmic Trading Module of the Proprietary Asset Management Platform
• Led the quantitative software development of the event-driven algorithm module for the microservice-structured proprietary asset management cloud application.
• Developed machine learning models to generate directional signals for intraday momentum trading on U.S. stock index futures. Implemented automatic execution and backtesting for statistical arbitrage trading strategies on commodity derivatives.
• Built the historical data and feature engineering pipeline, from data ingestion, transformation, to feature generation.
• Built a robust research Clickhouse database with exchange all-events market data, providing a centralized repository for data-driven analysis and strategy development.
• Developed portfolio management dashboard, enabling real-time market data streaming, position monitoring, indicator and model monitoring, and performance reporting and visualization.
Education
PhD in Electrical Engineering
University of California, Riverside - Riverside, California, USA
Skills
Libraries/APIs
Pandas, Scikit-learn, NumPy, PyTorch, Keras, SciPy, TensorFlow, Matplotlib
Tools
MATLAB, Apache Airflow, GitHub
Languages
Python, SQL, C++, Go, HTML
Paradigms
Quantitative Research, ETL, Microservices
Platforms
Jupyter Notebook, Docker, Visual Studio Code (VS Code), Google Cloud Platform (GCP), Linux, Amazon EC2, Amazon Web Services (AWS)
Storage
MySQL, Data Pipelines, ClickHouse, PostgreSQL
Other
Data Science, Machine Learning, Signal Processing, Quantitative Finance, Wireless Systems, Software Development, Finance, Financial Modeling, Visualization, Statistics, Statistical Analysis, Software Engineering, Deep Learning, Data Modeling, Statistical Data Analysis, Bokeh, Data Visualization, Predictive Modeling, Artificial Intelligence (AI), Back-end Development, Data Build Tool (dbt), Semiconductors, GPS, MLflow, Machine Learning Operations (MLOps), Data Engineering, Google BigQuery, Dashboards, Data Analytics, Data Manipulation, Quantitative Analysis, Forecasting, Regression, Classification, Data Analysis, Algorithms, Rankings, Recommendation Systems, Advertising Technology (Adtech), Convolutional Neural Networks (CNNs), Time Series Analysis, Back-end, Distributed Systems, Ads, Statistical Modeling, Signal Filtering
How to Work with Toptal
Toptal matches you directly with global industry experts from our network in hours—not weeks or months.
Share your needs
Choose your talent
Start your risk-free talent trial
Top talent is in high demand.
Start hiring