
Jesse Liu
Verified Expert in Engineering
Data Scientist and Quantitative Software Developer
Irvine, CA, United States
Toptal member since March 14, 2022
Jesse is a data scientist and machine learning engineer with 10+ years of versatile industry experience developing robust solutions for social networks, ads engineering, quantitative investments, and engineering applications. He is passionate about delivering high-quality code and solutions that meet the needs and expectations of his clients. Jesse enjoys collaborating with team members with a professional attitude and exploring new ideas and technologies.
Portfolio
Experience
- Software Development - 6 years
- Python - 5 years
- Machine Learning - 5 years
- Forecasting - 5 years
- Google BigQuery - 2 years
- Apache Airflow - 2 years
- Advertising Technology (Adtech) - 2 years
- Rankings - 2 years
Availability
Preferred Environment
Visual Studio Code (VS Code), GitHub, Jupyter Notebook
The most amazing...
...projects I worked on include machine learning for Reddit ads engineering, quantitative trading for hedge funds, and software apps for Samsung Galaxy GPS chips.
Work Experience
Machine Learning Engineer
Reddit, Inc.
- Conducted data analysis on the business metrics of Reddit online ads prediction models, e.g., conversion rate, prediction rate, calibration error, conversion distribution, revenue and ARPU, CPM, etc.
- Analyzed and evaluated ads conversion modeling performance with segmentation analysis on various features, including placement and platform types, advertiser industry, user privacy flags, and sales channel, based on A/B tests and DDG experiments results.
- Constructed ads prediction model performance monitoring and tracking dashboards on Mode.com, extracting metrics from TBs of ads measurement data stored in Google BigQuery.
- Developed a model training pipeline for ads prediction models based on gradient-boosting decision trees (GBDTs) and bid adjustment algorithms, implemented using TensorFlow Decision Forests (TF-DF).
- Built a feature engineering pipeline leveraging Google BigQuery, Apache Airflow, and dbt to handle daily roll-up of hundreds of TB-level big data, feature generation, ingestion, and training and test dataset creation for ads CPC and CPA cost models.
- Developed and maintained the prediction API between the ads inference server and upstream requesting server, ensuring seamless integration and responsiveness to model updates, requirements, and design changes.
Senior Quantitative Researcher
Outremont Technologies
- Conducted research on machine learning models for financial security time series forecasting, regression, and classification, including gradient boosting trees (Scikit-Learn, XGBoost, and LightGBM) and neural network models.
- Investigated the application of convolutional deep learning architectures to transformed signals, evaluating their effectiveness in momentum and volatility time series forecasting for crypto assets.
- Developed a robust break-out momentum trading signal generation framework, employing probabilistic filtering and transaction cost analysis techniques to identify opportunities in the cryptocurrency market.
- Performed a thorough statistical analysis and evaluation of all-events cryptocurrency market data, examining the efficacy of quantitative signals and features provided by various vendors.
Quantitative Researcher | Data Scientist
iFDC Capital Management, LLC
- Leveraged statistical data analysis, trading signal hypothesis testing and optimization, and quantitative strategies to enhance financial derivatives portfolio management.
- Developed time series regression models for parameter estimation of mean-reversion stochastic process models, contributing to the refined stock index investment strategies.
- Created a data pipeline and management dashboard, enabling real-time data streaming, historical data query, model monitoring, performance reporting, and visualization.
- Established robust research databases for all-events historical market data using Clickhouse, providing a centralized repository for data-driven analysis and strategy development.
Staff Systems Engineer
Broadcom
- Developed the statistical models and algorithms for the highly accurate GPS receiver on-chip clock system.
- Analyzed a large amount of lab data for cellular radio chips and built statistical models to trade off between radio communication metrics. Developed numerical programs with Python to search circuit parameters and optimize chips' performance.
- Developed a comprehensive set of automation tools in Python for regression characterization. The tools manage testing cases, error recovery, lab instruments control, and data report.
- Developed a GPS receiver host software and signal processing algorithms in C++, log analysis and processing with Python, and software and algorithm defect tracking via Jira.
Experience
Ads Prediction 10x Features
Ads Prediction Batch Feature Engineering Pipeline
Ads Prediction Modeling Performance-tracking Dashboard
Quantitative Cryptocurrecy Trading Model
Algorithmic Trading Module of an Asset Management Platform
• Led the quantitative software development of the event-driven algorithm module for the micro-service structured proprietary asset management cloud application.
• Researched the convolutional deep learning models (Conv1D, ResNet, EfficientNet, WaveNet, DenseNet, etc.) applied on the CWT transformed signal for time series forecasting.
• Innovated algorithmic order types to reduce transaction cost and risk and optimized the leg synchronous execution for spread trading to reduce timing risk.
• Developed data pipelines and a portfolio management dashboard for real-time market data streaming, historical database query, model optimization and monitoring, statistics and indicators generation and reporting, and user front-end visualization.
Education
Ph.D. in Electrical Engineering
University of California, Riverside - Riverside, California, USA
Skills
Libraries/APIs
Pandas, Scikit-learn, NumPy, PyTorch, Keras, SciPy, TensorFlow, Matplotlib
Tools
MATLAB, Apache Airflow, GitHub
Languages
Python, SQL, C++, Go, HTML
Paradigms
Quantitative Research, ETL, Microservices
Platforms
Jupyter Notebook, Docker, Visual Studio Code (VS Code), Google Cloud Platform (GCP), Linux, Amazon EC2, Amazon Web Services (AWS)
Storage
MySQL, Data Pipelines, ClickHouse, PostgreSQL
Other
Data Science, Machine Learning, Signal Processing, Quantitative Finance, Wireless Systems, Software Development, Finance, Financial Modeling, Visualization, Statistics, Statistical Analysis, Software Engineering, Deep Learning, Data Modeling, Statistical Data Analysis, Bokeh, Data Visualization, Predictive Modeling, Artificial Intelligence (AI), Back-end Development, Data Build Tool (dbt), Semiconductors, GPS, MLflow, Machine Learning Operations (MLOps), Data Engineering, Google BigQuery, Dashboards, Data Analytics, Data Manipulation, Quantitative Analysis, Forecasting, Regression, Classification, Data Analysis, Algorithms, Rankings, Recommendation Systems, Advertising Technology (Adtech), Convolutional Neural Networks (CNNs), Time Series Analysis, Back-end, Distributed Systems, Ads, Statistical Modeling, Signal Filtering
How to Work with Toptal
Toptal matches you directly with global industry experts from our network in hours—not weeks or months.
Share your needs
Choose your talent
Start your risk-free talent trial
Top talent is in high demand.
Start hiring