
Alex Clark
Verified Expert in Engineering
Data Engineer and Developer
Spokane, WA, United States
Toptal member since November 14, 2022
Alex is a senior data engineer with 10+ years of experience designing and building scalable data pipelines and analytics platforms. He specializes in data modeling, distributed systems, and cloud technologies such as AWS, delivering reliable, high-quality datasets that enable data-driven product and business decisions.
Portfolio
Experience
- Data Analytics - 10 years
- Python - 10 years
- SQL - 10 years
- Data Pipelines - 10 years
- Data Architecture - 10 years
- Hadoop - 5 years
- Amazon Elastic MapReduce (EMR) - 5 years
- PySpark - 4 years
Preferred Environment
Python 3, Amazon Elastic MapReduce (EMR), Apache Hive, Amazon DynamoDB, SQL, Amazon Web Services (AWS), Amazon Redshift, Apache Airflow, MongoDB, Apache Spark
The most amazing...
...system I built enabled platform-wide cost attribution, bringing transparency to infrastructure usage across teams.
Work Experience
Freelance Data Engineer
ProjectPro
- Designed and implemented real-time and batch data pipelines using Python and AWS services, integrating data from APIs, application databases, and third-party sources.
- Built analytics-ready datasets and data models to support reporting, dashboards, and metric-driven decision-making.
- Developed interactive dashboards and visualization layers to surface actionable insights for business stakeholders.
- Designed and maintained ETL workflows and automation to ensure data reliability, consistency, and scalability.
Senior Data Engineer
Meta
- Designed and implemented scalable data pipelines and analytics datasets to support product decision-making during a large-scale platform migration, enabling real-time visibility into key user and system metrics.
- Led deep-dive analyses into product and user behavior trends, identifying root causes and enabling data-driven decisions across product and engineering teams.
- Partnered with cross-functional stakeholders to define key metrics, improve data quality, and establish best practices for analytics workflows.
- Influenced data standards and instrumentation practices to improve consistency, reliability, and trust in product analytics.
- Led a compute optimization initiative, reducing infrastructure usage by 10% and deprecating 50% of stale pipelines.
Data Engineer
D2 Nova
- Optimized SQL queries and data storage strategies (partitioning, indexing), improving performance by up to 99% and significantly reducing compute costs.
- Designed and implemented a MongoDB database with a lightweight Flask-based front end. Developed a custom, low-latency search solution supporting multilingual partial text matching, enhancing usability and performance.
- Delivered robust ETL solutions and improved analytical workflows for clients across multiple domains.
Data Engineer
Amazon.com
- Built and maintained large-scale data pipelines and analytics datasets supporting product insights, customer behavior analysis, and business reporting.
- Processed large-scale clickstream data to generate behavioral insights, enabling product and marketing teams to better understand user engagement and conversion.
- Developed internal API-based billing systems and analytics datasets/dashboards to track product performance and support financial and operational reporting.
- Provided subject matter expertise on platform data and collaborated with cross-functional teams to deliver business-critical solutions.
- Partnered with product, analytics, and business teams to define metrics, design data models, and deliver actionable insights.
- Built attribution models linking customer behavior to revenue and retention, establishing foundational metrics used for business and product decision-making.
- Managed relational and large-scale data systems for analytics and reporting.
- Automated log parsing and reporting pipelines to provide timely, reliable business insights.
Business Systems Analyst
Liberty Mutual Insurance
- Created an automated process to build and maintain 24 data sets in a centralized location.
- Delivered presentations to educate SAS users about data sets and their analytical potential.
- Facilitated biweekly meetings with stakeholders to improve the usability and integrity of data sets.
- Leveraged SAS and Teradata to efficiently execute numerous ad hoc requests.
- Developed SQL queries and VBA macros to streamline monthly reporting.
- Built a Microsoft Access database and VBA scripts to automate the production of a weekly status report.
Data Analyst
Efinancial
- Presented complex analyses to upper management, driving high-level decision-making.
- Collaborated with the analytics team to develop a calling strategy which led to a 50% increase in sales.
- Automated the production of weekly scorecards and reports using SQL and VBA.
- Wrote SQL queries and performed data analysis to aid in the development of monthly and/or weekly goals.
Experience
Page-level Metrics & Multi-touch Attribution
Internal Billing System
Content Platform ETL & Analytics Pipeline
Education
Master's Degree in Business Analytics & Data Science
Bentley University - Waltham, MA, USA
Bachelor's Degree in Accounting
Central Washington University - Ellensburg, WA, USA
Skills
Libraries/APIs
PySpark, Pandas, NumPy
Tools
Amazon Athena, Amazon CloudWatch, Amazon Elastic MapReduce (EMR), AWS Glue, Cron, Boto, AWS Step Functions, Amazon Simple Queue Service (SQS), PyCharm, Amazon QuickSight, GitHub, Microsoft Power BI, AWS CloudFormation, Apache Airflow, AWS Cloud Development Kit (CDK)
Languages
SQL, Python, Stored Procedure, SAS, Scala
Paradigms
ETL, Business Intelligence (BI)
Storage
Apache Hive, Databases, PostgreSQL, Redshift, RDBMS, JSON, Database Administration (DBA), Relational Databases, MySQL, Data Pipelines, Teradata, Amazon S3 (AWS S3), Amazon DynamoDB, Microsoft SQL Server, MongoDB, NoSQL, Amazon Aurora
Frameworks
Hadoop, Spark, Flask, Apache Spark
Platforms
AWS Lambda, Linux, Oracle, Amazon Web Services (AWS), Amazon EC2, Apache Flink
Other
Information Systems, Data Architecture, EMR, Data Analytics, Datasets, Data Engineering, Data Analysis, Data Cleansing, Data Profiling, CSV File Processing, Data Modeling, Data, Metrics, Big Data, Pipelines, Conda, PIP, APIs, Data Warehousing, Big Data Architecture, BI Reporting, Analytics, Business Requirements, Data Visualization, Scripting, Amazon RDS, BI Reports, Dashboards, Database Optimization, Amazon Redshift, Machine Learning, Statistics, Time Series Analysis, Optimization, Attribution Modeling, Marketing Attribution, Web Analytics, IT Project Management, User-defined Functions (UDF), Dashboard Design, Predictive Modeling, Orchestration, Key Performance Indicators (KPIs), Data Quality, Data Orchestration
How to Work with Toptal
Toptal matches you directly with global industry experts from our network in hours—not weeks or months.
Share your needs
Choose your talent
Start your risk-free talent trial
Top talent is in high demand.
Start hiring