Zachary A. Goodman, Data Science Developer in San Diego, United States
Zachary A. Goodman

Data Science Developer in San Diego, United States

Member since August 25, 2022
Zachary is a silicon-valley experienced and PhD educated data scientist with expertise in experiments, causal effect estimation, and cloud data tools. He holds a PhD in economics with a research focus on estimating causal effects using big data. Zachary has over six years of experience using Python, SQL, R, and Stata to answer questions using data. He is a tech lead that enjoys building data pipelines in Google Cloud and BigQuery and automated dashboards and reports using Looker.
Zachary is now available for hire

Portfolio

  • Recidiviz
    Google BigQuery, BigQuery, Google Cloud, Google Cloud Platform (GCP), SQL...
  • University of California - San Diego
    Causal Inference, Big Data, Python, Jupyter, STATA, Experimental Design...
  • Quora
    Python, Redshift, Spark, Jupyter, Causal Inference, A/B Testing, Clustering...

Experience

  • Python 10 years
  • Causal Inference 6 years
  • Experimental Design 6 years
  • A/B Testing 4 years
  • SQL 4 years
  • Looker 2 years
  • Google Cloud 2 years
  • BigQuery 2 years

Location

San Diego, United States

Availability

Part-time

Preferred Environment

Python, Jupyter, RStudio, Google BigQuery, Google Cloud Platform (GCP), SQL, STATA, Looker, Pandas, Matplotlib

The most amazing...

...project I've done is leading the team of analysts and data scientists to build automated data pipelines, saving the organization around $5 million annually.

Employment

  • Senior Data Scientist

    2021 - PRESENT
    Recidiviz
    • Led a team of data scientists and analysts to conduct analysis and develop new data pipelines, saving the partner organization around $5 million per year.
    • Served as the primary point person for Looker infrastructure development, allowing analysts to build fast and reliable business intelligence dashboards.
    • Analyzed multiple natural experiments to determine the causal effects of policies and programs by partner organizations and the company.
    Technologies: Google BigQuery, BigQuery, Google Cloud, Google Cloud Platform (GCP), SQL, Python, Causal Inference, Jupyter, Cloud Infrastructure, Looker, Data Science
  • Researcher

    2016 - 2021
    University of California - San Diego
    • Served as the research lead on multiple projects, where I designed, implemented, and analyzed field experiments to provide evidence of program effectiveness for several education technologies.
    • Worked with the world's largest grocery store purchase dataset to determine the relationship between price changes and consumer behavior.
    • Analyzed a leading consumer panel dataset to examine how large income shocks affect consumer decision-making.
    Technologies: Causal Inference, Big Data, Python, Jupyter, STATA, Experimental Design, Experimental Research, Economics, Economic Analysis, Surveys, Data Science
  • Data Science Intern

    2019 - 2019
    Quora
    • Demonstrated changes to the A/B testing suite to conduct experiments using much smaller samples and to require less time to detect effects.
    • Improved the design of the advertisement auctions by applying economic theory and testing the design using live data.
    • Built features and adjusted the existing machine learning architecture to improve advertisement click through rate predictions.
    Technologies: Python, Redshift, Spark, Jupyter, Causal Inference, A/B Testing, Clustering, Machine Learning, Predictive Modeling, Marketplace Design, Data Science

Experience

  • Looker Infrastructure

    Developed core Looker infrastructure, including integration with the Google Cloud Platform (GCP) and BigQuery, allowing analysts to build business intelligence dashboards and leadership to see KPIs in real-time.

  • Multiple Causal Analyses

    Used applied econometric methods to analyze field and natural experiments. I incorporated differences-in-differences, synthetic controls, regression discontinuity, instrumental variables, and matching methods.

Skills

  • Languages

    Python, SQL
  • Libraries/APIs

    Pandas, Matplotlib
  • Tools

    Jupyter, Looker, BigQuery, STATA, MATLAB
  • Paradigms

    Data Science, Business Intelligence (BI)
  • Platforms

    Google Cloud Platform (GCP), RStudio
  • Other

    Google BigQuery, Causal Inference, Experimental Design, Experimental Research, A/B Testing, Regression, Econometrics, Applied Research, Linear Regression, Logistic Regression, Economics, Economic Analysis, Big Data, Machine Learning, Cloud Infrastructure, Predictive Modeling, Marketplace Design, Linear Algebra, Surveys, Clustering, Dashboards, Dashboard Design, Technical Writing
  • Storage

    Google Cloud, Redshift, Google Cloud SQL
  • Frameworks

    Spark

Education

  • PhD in Economics
    2016 - 2021
    University of California - San Diego, CA, USA
  • Bachelor's Degree in Economics and Mechanical Engineering (Dual Degree)
    2012 - 2016
    North Carolina State University - Raleigh, NC, USA

To view more profiles

Join Toptal
Share it with others