Zhihao (Alex) Zhong
Verified Expert in Engineering
Data Engineer and Developer
Alex is a senior technical data engineer whose areas of expertise include ETL pipeline design, performance tuning, metadata management, data modeling, data warehousing, business intelligence design, data profiling, and data visualization. He has helped an eCommerce company, Loblaw Digital, architect a data pipeline from a microservice back end to a data warehouse in GCP as part of the data science platform. Alex manages more than 2,000 tables and moves more than 2TB of data daily in real time.
Portfolio
Experience
Availability
Preferred Environment
PyCharm
The most amazing...
...system I've architected and implemented from end to end is a real-time ELT pipeline in Google Cloud Platform (GCP) that reduced latency by 80%.
Work Experience
Senior Data Engineer
Loblaw Digital
- Worked as a senior member of the data engineering team to build a configurable real-time replication pipeline capable of handling large-scale and muti-source data in GCP.
- Developed and enhanced features in a custom Python BI framework for ETL/ELT batch jobs.
- Configured CI/CD pipelines and Docker images for code deployment in a project repository and GitLab.
- Managed two junior data engineers and performed as a solution architect. Conducted peer coding and code review as a senior engineer.
Cloud Data Engineer
Microsoft
- Migrated data to Azure Synapse from an on-premise Microsoft SQL Server system regarding data from the Azure marketing team.
- Collaborated in designing ETL pipelines with Azure stacks, including Azure Data Factory, a lift-and-shift SQL Server Integration Services (SSIS) package, Databricks, and Synapse.
- Implemented designs with two other senior engineers and migrated ETL and all downstream reports in Azure.
Database Administrator
Microsoft
- Constructed database architecture for partner investments and KPIs.
- Created and performed ETL with the SSIS package for data updates.
- Exported data to Microsoft Power BI for reporting using the direct query mode.
Data Engineer
CAMH
- Implemented an ETL and data warehousing solution with the Hadoop environment and helped the company migrate the data warehouse from IBM Db2 to Apache Hive for the neuroinformatics platform.
- Constructed data pipelines in Apache NiFi to perform real-time ETL and ELT.
- Created dashboards for research study in Spotfire.
Database Administrator
The Bargains Group
- Implemented and managed the CRM system and Microsoft SQL Server to produce reports for marketing and sales.
- Updated the opt-out and hard-bounce email list in Microsoft SQL Server and generated a targeting email list for marketing email pieces.
- Set up and tested automatic email campaigns in CRM.
- Reviewed and tested the connection between the website back end and the CRM system.
Experience
Dataflow Architecture in GCP
PySpark Jobs in GCP
Migration of a Data Pipeline and Data Warehouse to Azure Cloud
Education
Bachelor of Mathematics in Actuarial Science and Statistics
University of Waterloo - Waterloo, Ontario
Certifications
Professional Machine Learning Engineer
Google Cloud
GCP Professional Data Engineer
Google Cloud
Microsoft Certified: Azure Data Engineer Associate
Microsoft
Skills
Libraries/APIs
PySpark
Tools
PyCharm, Cloud Dataflow, BigQuery, Apache Beam, Google Cloud Composer, Apache NiFi, SQL Server BI, Microsoft Power BI, Tableau, Google Cloud Dataproc, GitLab, Spotfire
Languages
SQL, Python 3, Java
Storage
Relational Databases, HDFS, Apache Hive, IBM Db2, SQL Server Integration Services (SSIS), Database Modeling, Microsoft SQL Server
Paradigms
ETL
Platforms
Google Cloud Platform (GCP), Azure, Azure Synapse, Databricks, Docker
Frameworks
Hadoop
Other
Google BigQuery, Pub/Sub, Dataproc, Azure Data Factory, Azure Databricks, Microsoft Data Transformation Services (now SSIS), Azure Stream Analytics, Azure Data Lake, ELT, Performance Tuning, Data Modeling, Data Warehousing, Data Profiling, Data Visualization, Data Engineering
How to Work with Toptal
Toptal matches you directly with global industry experts from our network in hours—not weeks or months.
Share your needs
Choose your talent
Start your risk-free talent trial
Top talent is in high demand.
Start hiring