
William Leslie Theisinger
Verified Expert in Engineering
Data Architect and Developer
Los Angeles, CA, United States
Toptal member since October 23, 2024
William has deep experience in all aspects of data-related initiatives—from data design, architecture, and development to data management, strategy, and product and project management. He has designed, architected, and implemented data platforms and re-engineered and refactored existing data systems. William is fully autonomous and can work with teams or as an individual contributor. He also has experience in GCP, AWS, and on-premise and remote data centers.
Portfolio
Experience
- Data Warehousing - 20 years
- Data Engineering - 20 years
- Data Warehouse Design - 20 years
- ETL - 20 years
- Data Architecture - 20 years
- SQL - 20 years
- Data Modeling - 15 years
- CI/CD Pipelines - 7 years
Availability
Preferred Environment
Visual Studio Code (VS Code), Slack, GitHub, GitHub Actions, MacOS, Google Cloud Platform (GCP), Amazon Web Services (AWS), Kubernetes, Python
The most amazing...
...thing I've developed is a full event-based modern data stack that maintained a 99.99% uptime.
Work Experience
Data Architect | Data Engineer
Paramount Commerce
- Migrated the core data processing stack from AWS to GCP.
- Completed a full CI/CD implementation using Airflow (Composer), dbt, Docker, Terraform, and GitHub Actions.
- Created documentation for the run book, handoff, and training.
Data Management Consultant
FloSports
- Handled evaluation and recommendations for a data governance program based on the company maturity of data management, including the sequence, timing, and milestones for progression to full data maturity.
- Evaluated numerous data governance vendors in various Gartner quadrants and newcomers in the space and recommended the implementation of different governance capabilities using services within AWS.
- Provided detailed recommendations for implementing any chosen vendor and tradeoffs versus homebuilt alternatives.
VP, Data Engineering
System1
- Collaborated with the team to create new APIs that supported external partner reporting and data.
- Initiated the migration to an event-based system to leverage in-stream processing on Kafka.
- Created plans to ensure GDPR, CCPA, CPRA, and PII/non-PII compliance across all systems and data platforms.
- Increased productivity by 30% by establishing a project planning process that aligned team focus on more throughput.
- Managed and maintained the data platform through numerous acquisitions.
Data Architect Consultant
Procore
- Oversaw the design, architecture, and scaling of a new enterprise data platform, including development and engineering tasks and owning the product manager and product owner roles in the Agile development and requirements process.
- Collaborated with the internal team to develop a platform on AWS, leveraging EKS, EMR, Spark, and Airflow for workflow management and various serverless components combined to manage orchestration.
- Evaluated internal talent at the request of management and provided input to senior leadership, resulting in staffing team for impending IPO.
CTO
LERETA
- Developed architecture and project plan to move from an on-prem data center to a hybrid model of cloud (GCP) and colocation facility.
- Tasked with digital transformation, platform modernization, and consolidation that enabled adaptability to changing market conditions at a reduced cost. Supported over 30% YoY growth.
- Designed and architected a complete plan to migrate from the legacy platform of IBM Mainframe VSe, VFP, Delphi, MSSQL13, Cobol, Natural, DYL280, and JCL to Google Cloud services, Go, JavaScript, Vue, Node.js, and Python.
- Participated in the executive team reporting to the CEO. Worked closely with the BOD (Tarsadia Investments), regularly presenting technology road map, product management vision, project plans, and budget.
- Managed a complete technology P&L—15 million in operating expenses (OPEX) and varying capital expenditures (CAPEX).
VP, Data Platform Engineering & Data Science
YP
- Designed, architected, and implemented a system re-platforming effort to move from a Microsoft data platform to a lightweight event system data collection, Hadoop processing, and Vertica.
- Handled the design, implementation, and maintenance of 1,100 servers across 12 clusters with 99.99% uptime that never missed SLA and housed seven petabytes, ensuring 24/7 data availability.
- Delivered a multichannel data collection framework in Kafka, supporting streams of up to 20,000 messages per second.
- Processed 3+ billion daily transactions on the platform, which generated 3+ terabytes of data per day.
Manager, Strategic Data Services
Yahoo!
- Designed, developed, tested, and deployed several new table structures and ETL processes using Unix shell scripting, Perl, and SQL to support the business's data needs.
- Integrated SOAP-based XML API with the business operations group to provide a scalable reporting solution for that business team.
- Managed the reporting needs for Yahoo Search Marketing, which required communication company-wide. Worked very closely with product marketing, product management, sales, business operations, and technology teams in the US and abroad.
Experience
Prediction Platform for Real Estate Transactions
ETL Platform Using Spark and EMR
Education
Bachelor's Degree in Economics
California State University, Los Angeles - Los Angeles, CA, USA
Skills
Libraries/APIs
PySpark, Scikit-learn, Pandas
Tools
GitHub, Slack, Google Cloud Composer, Apache Airflow, Amazon Elastic MapReduce (EMR), Informatica ETL, Talend ETL, Looker, Amazon EKS, Cloud Dataflow, Terraform
Languages
SQL, Python, Snowflake, Java
Paradigms
Business Intelligence (BI), ETL, Agile Project Management
Platforms
Visual Studio Code (VS Code), Google Cloud Platform (GCP), MacOS, Amazon Web Services (AWS), Kubernetes, Apache Kafka, Oracle, Unix, Docker
Storage
Data Integration, Data Pipelines, Column-oriented DBMS, Distributed Databases, Vertica, PostgreSQL, Dell Boomi, Elasticsearch
Frameworks
Spark, Hadoop, Apache Spark
Other
Data Architecture, Data Management, Big Data, Data Engineering, Software Development Lifecycle (SDLC), Data Warehouse Design, Data Product Manager, Business Analysis, Data Modeling, Data Warehousing, Analytical Thinking, Technical Leadership, Data, Data Analysis, GitHub Actions, Google BigQuery, Data Build Tool (dbt), Analytics, CI/CD Pipelines, Data Governance, Data Quality Governance, Business Requirements, Consumer Data Platform, Machine Learning Operations (MLOps), Statistics, Economics, Data Collection, BI Reporting, Data Processing
How to Work with Toptal
Toptal matches you directly with global industry experts from our network in hours—not weeks or months.
Share your needs
Choose your talent
Start your risk-free talent trial
Top talent is in high demand.
Start hiring