Josh Chapman, Data Warehousing Developer in Los Angeles, CA, United States
Josh Chapman

Data Warehousing Developer in Los Angeles, CA, United States

Member since October 14, 2019
Josh is an engineering leader and architect with a special passion for data. He's been building data warehouses and reporting platforms for top retailers and eCommerce startups for over a decade. He loves learning and meeting new teams, and he's well-versed in the latest trends and tools in data.
Josh is now available for hire


  • The Honest Company
    Amazon Web Services (AWS), Dimensional Modeling, Data Warehouse Design...
  • Nordstrom
    Amazon Web Services (AWS), Data Warehouse Design, Data Warehousing, SQL, Bash...
  • ideel
    SQL, Data Warehouse Design, Data Warehousing, Bash, NetSuite, Epicor, Talend...



Los Angeles, CA, United States



Preferred Environment

Git, Atom, JetBrains, Linux, MacOS

The most amazing...

...projects are those that have the biggest impact, not the fanciest code. I've built data warehouses using cheap tools that saved hundreds of person-hours a week.


  • Senior Manager | Data Engineering

    2019 - 2021
    The Honest Company
    • Built and scaled the company's data warehouse using Amazon Redshift.
    • Integrated with new SaaS vendors including Salesforce Commerce Cloud.
    • Implemented Airflow for job orchestration, using the Astronomer platform.
    • Brought modern data SDLC practices to the data engineering team.
    • Rebuilt users' trust in data by fixing reports, implementing clear support procedures, and migrating to a new BI tool.
    Technologies: Amazon Web Services (AWS), Dimensional Modeling, Data Warehouse Design, Data Warehousing, AWS, Domo, Tableau, SQL, Bash, PostgreSQL, MySQL, Relational Database Services (RDS), Talend, Apache Airflow, Python, Redshift, Data Engineering
  • Engineering Manager, Data Services

    2014 - 2019
    • Scaled up the data warehouse while improving performance and availability.
    • Implemented a data lake on AWS using S3, Spectrum, and Spark.
    • Scripted all infrastructure provisioning and configuration using Terraform and Ansible.
    • Redesigned and rewrote the ETL loading our Amazon Redshift Data Warehouse.
    • Built the Data Services team from zero to seven engineers, and served as interim manager for the Mobile Apps Engineering team.
    Technologies: Amazon Web Services (AWS), Data Warehouse Design, Data Warehousing, SQL, Bash, AWS, MySQL, Terraform, Ansible, Python, Pentaho Data Integration (Kettle), Google BigQuery, Redshift, Data Engineering
  • Manager of Data Services

    2011 - 2014
    • Led the effort to design and build ideeli's enterprise data warehouse.
    • Served as the team's architect and senior developer - designing, building, reviewing, deploying, and supporting ETL jobs, SQL scripts, and reports.
    • Led combined onshore and offshore teams in the implementation of MicroStrategy for end-user reporting and analysis.
    • Designed the company's ETL architecture from scratch, using Pentaho Data Integration (Kettle) and custom MySQL routines. Led development and production support.
    • Designed a key data integration system between the ERP and Warehouse Management systems, using PDI and MySQL.
    • Debugged reports in a wide range of technologies: MicroStrategy, Pentaho, MySQL, MS SQL Server, SSAS, MS Access, Excel, Epicor, NetSuite, Talend ETL, Hive, and shell scripts.
    Technologies: SQL, Data Warehouse Design, Data Warehousing, Bash, NetSuite, Epicor, Talend, Microsoft SQL Server, MicroStrategy, Pentaho, MySQL, Data Engineering
  • Software Developer

    2009 - 2011
    Bridgewater Associates
    • Designed and developed the data model, ETL process, and reports for the system used to monitor counterparty exposure. Ran status meetings with sponsors, and managed the project plan (SSIS, T-SQL, SSRS, and AgilePoint).
    • Got up to speed on previously unfamiliar business concepts – for instance, developed the database portion of a major new risk-control monitoring tool within a month of joining.
    • Played key role in reorg to separate production support from development: designed the training and certification plan for new support associates; improved the procedures for incident handling and escalation; served as lead trainer.
    Technologies: SQL, Data Warehousing, Data Warehouse Design, Stored Procedure, SQL Server Integration Services (SSIS), SSRS, Microsoft SQL Server, Data Engineering
  • Senior Consultant

    2005 - 2008
    Palladium Group
    • Designed and led the development of a budget analysis application for a leading office supply retailer (MS SQL Server, Hyperion Essbase, Windows Batch Scripting).
    • Led redesign of the sales-reporting app for a leading yellow-pages publisher. (Informatica and Oracle).
    • Designed and developed database components of a sales reporting and forecasting tool for a major quick-service restaurant company (PL/SQL, Bash, Hyperion Planning, and Hyperion Reports).
    Technologies: SQL, Data Warehouse Design, Data Warehousing, Batch Scripting, Bash, Reports, Essbase, Hyperion, PL/SQL, Oracle Database, DB, Oracle, Microsoft SQL Server, Informatica, Data Engineering
  • Consultant

    2003 - 2005
    • Served as a data warehouse developer on a long-term engagement for a large telecom.
    • Reviewed requirements and data models, wrote designs, wrote and tested code, and supported production.
    • Served as tech lead for the major redesign of an existing marketing data warehouse (Informatica PowerCenter, Oracle PL/SQL, Bash).
    Technologies: SQL, Data Warehouse Design, Data Warehousing, Bash, PL/SQL, Oracle Database, Informatica, Data Engineering


  • Flask App for Interactive Commands in Slack

    This is a fun hobby project of mine. It's a simple Flask app that runs on Heroku with a Postgres database. It handles interactive commands for a Slack workspace that I created and maintain.

  • Reporting Data Pipeline Using DBT, Redshift, and Periscope

    Redesigned the reporting environment for an insurance startup. They were struggling with performance issues in their reports, due to the use of nested, overly complex views in Periscope and resource contention in Redshift. I chose and implemented DBT to create tables in the database, and used Python to automate the migration of reports to use the new tables.

  • Migration from Redshift to Snowflake

    A financial-services company with very large data volumes wanted to migrate their data warehouse from Redshift to Snowflake while continuing to support many Tableau dashboards that used it. I recreated the key database objects in Snowflake and updated the dashboards to use it.


  • Languages

    SQL, Bash, Stored Procedure, Python, Snowflake
  • Tools

    Pentaho Data Integration (Kettle), Domo, Salesforce Marketing Cloud, JetBrains, Atom, Git, Apache Airflow, Ansible, Terraform, SSRS, Hyperion, Talend ETL, Tableau, AWS EBS, Stitch Data, Periscope Data, Matillion ETL for Redshift
  • Paradigms

    ETL, Business Intelligence (BI), Database Design, Agile, Dimensional Modeling
  • Other

    Data Warehouse Design, Data Warehousing, Data Modeling, Software Development, Data Engineering, Data Building Tool (DBT), Google BigQuery, MicroStrategy, eCommerce, SSRS Reports, Fivetran, Relational Database Services (RDS), AWS, Epicor, NetSuite, Reports, Batch Scripting, Informatica, Analytics
  • Platforms

    Amazon EC2, MacOS, Linux, Talend, Pentaho, Oracle, Oracle Database, Amazon Web Services (AWS), Google Analytics 360, AWS Kinesis
  • Storage

    Redshift, MySQL, PostgreSQL, SQL Server 2012, Data Lakes, Data Integration, AWS S3, Microsoft SQL Server, DB, PL/SQL, Essbase, SQL Server Integration Services (SSIS), Salesforce Commerce Cloud, Databases


  • Bachelor's Degree in Computer Science, German Studies
    1998 - 2002
    Amherst College - Amherst, MA, USA

To view more profiles

Join Toptal
Share it with others