Senior Data Scientist
2020 - PRESENTLevel- Joined Level as its second-most senior data scientist. Level is a fast-growing healthtech startup backed by First Round Capital and other elite VC firms.
- Built machine learning models to predict insurance costs, detect fraud, and determine the quality of service providers. These models are key to Level's competitive advantage.
- Performed exploratory data analysis on various data sets to deliver critical business insights and inform product development.
Technologies: Kubernetes, SQL, PythonContibutor
2020 - PRESENTscikit-learn- Read statistics papers related to data sampling to find evidence for new stratified regression data sampling feature.
- Contributed to docs on machine learning and data science best practices.
- Advocated for new features such as p-value support in linear regression models.
Technologies: Scikit-learnSenior Data Engineer
2018 - 2020Temple Capital- Hired as employee #1 at a hedge fund backed by Pantera Capital and Bain Capital that used machine learning to find profitable trading opportunities.
- Worked with a machine learning stack that used proprietary statistical features, XGBoost and Random Forest regression models, and Bayesian hyperparameter optimization to train profitable time series prediction models.
- Used pandas and SQL to create dashboards that analyzed trading strategy performance and identified changes to our trading system, increasing trading profits by up to 3%..
- Explored historical market data to find interesting patterns and trends that could support novel trading strategies.
- Built the fund's data platform and data lake on AWS using Postgres (RDS), S3, Presto (Athena), Batch, Step Functions, and ECS. The platform was used by our machine learning platform to ingest TBs of data and discover new trading strategies.
- Served as the sole engineer on the DevOps and cloud automation side, and set up a robust and stable system that needed very little maintenance and had almost zero downtime over 1.5 years. Deployed hundreds of machines with Docker and Terraform.
Technologies: Amazon Web Services (AWS), SQL, PythonSenior Software Engineer
2016 - 2018Fitbit- Worked with the data science team to investigate and predict user behavior to increase Fitbit user retention.
- Used pandas and SQL to investigate and debug system outages to ensure a seamless Fitbit user experience.
- Served as a lead contributor to the Fitbit API Authorization Service, a distributed system that handles authentication and authorization using OAuth 2. The system receives 80,000 RPS, serving all third party apps, Fitbit mobile, and Fitbit web apps.
- Built migration framework used to convert the Auth Service database from MySQL to Cassandra, providing service scalability with no API downtime incurred. Designed the Cassandra schema.
Technologies: Java, SQL, Python