Kyle Heuton, Machine Learning Developer in Boston, MA, United States
Kyle Heuton

Machine Learning Developer in Boston, MA, United States

Member since April 21, 2020
Kyle is a scientific software developer with eight years of experience building data engineering applications for healthcare. In global health, he built mortality forecasting tools to forecast global deaths for a public health research institute. In US healthcare, Kyle built a data ingestion platform to receive and normalize data on hundreds of millions of patients.
Kyle is now available for hire

Portfolio

Experience

Location

Boston, MA, United States

Availability

Part-time

Preferred Environment

Amazon Web Services (AWS), NumPy, Pandas, Scala, Python

The most amazing...

...project I've completed was building mortality forecasting tools to forecast deaths due to 205 different causes in 195 countries.

Employment

  • Software Engineer

    2018 - 2020
    OM1
    • Designed a Python platform on the AWS cloud to receive, de-identify, and normalize electronic medical records and insurance claims data from diverse sources on hundreds of millions of patients.
    • Deployed the ingestion platform to receive data from 10 different data partners to ingest GBs of data daily. This project was accomplished within one year.
    • Constructed a data processing service in Scala to define cohorts of patients based on clinical disease criteria and enrich those cohorts with predictive metrics on disease outcomes and medical expenditures from machine learning models.
    • Managed the data team, as the interim team lead, to develop data transmission procedures with customers. I also planned the team’s roadmap and mentored junior engineers.
    • Created SQL queries and workflows to manage complex ETL tasks on medical data including de-duplication, patient linking, and deriving clinically relevant metrics such as insurance histories and drug eras.
    Technologies: Amazon Web Services (AWS), Claims, Amazon S3 (AWS S3), SQL, Python, Scala
  • Software Engineer and Forecasting Researcher

    2012 - 2017
    Institute for Health Metrics and Evaluation
    • Built a predictive modeling platform to generate forecasts of health scenarios worldwide and the potential impacts of specific policies on global health as the team’s lead Spark engineer. Forecasted mortality from 205 causes in 195 countries.
    • Developed a scientific software pipeline in Python used by dozens of modelers to run more than 20,000 models annually. Data and results were stored in our SQL database, and models run on a Univa Grid-Engine cluster.
    • Created Python tools to support data analysts and researchers in modeling disease prevalence and economic drivers of health.
    Technologies: STATA, SQL, Data Science, Data Analysis, R, Python

Experience

  • Data Ingestion Platform for TBs of Medical Data

    I worked for a health data company that needed to ingest TBs of data from dozens of different partners. These data sets were diverse—they were different sizes, arrived at different frequencies, and came from organizations with varying levels of technological capacity.

    I built a system on AWS using S3 buckets for storage. A serverless Lambda script listened to the buckets for any incoming files, and when a file arrived it would log its receipt and process the files accordingly. This platform was successfully deployed to 10 different partners within its first year, and it was ingesting GBs of data every day.

  • Health Profiles for Every Country in the World

    For the release of a massive 2013 Global Burden of Disease Study, my research group needed to create a two-page profile on the health of every country. I rapidly created the scripts to generate these reports, pulling the relevant data from a MySQL database, and creating the unique graphics and text for each country.

Skills

  • Languages

    Python, SQL, Scala, R
  • Libraries/APIs

    Pandas, Scikit-learn, NumPy, D3.js
  • Tools

    STATA, MATLAB
  • Platforms

    Amazon Web Services (AWS), Amazon EC2
  • Storage

    Amazon S3 (AWS S3)
  • Other

    Data Analysis, Machine Learning, Regression, Claims
  • Paradigms

    Data Science

Education

  • Master of Public Health Degree in Quantitative Health Metrics
    2012 - 2015
    University of Washington - Seattle, WA
  • Bachelor of Science Degree in Chemical Engineering
    2003 - 2012
    University of Minnesota - Minneapolis, MN
  • Bachelor of Science Degree in Mathematics
    2003 - 2012
    University of Minnesota - Minneapolis, MN

To view more profiles

Join Toptal
Share it with others