Senior Data Engineer
2021 - PRESENTcarsales- Set up dbt projects for Redshift and Snowflake to enable both local executions using Docker and execution on dbt Cloud.
- Set up an Infrastructure as Code project for Snowflake using Terraform and CI/CD pipelines using Github Actions to enable automated and repeatable resource deployment.
- Proposed and built role-based access control in Snowflake.
- Designed and built various data pipelines to support data transfer and transformation in AWS and GCP.
- Built an extensible solution to monitor common failures and alert team members. This greatly improves system observability and increases team ownership.
Technologies: DBT, Snowflake, Terraform, GitHub Action, Docker, Python, Apache Airflow, Redshift, AWS ECS, AWS DynamoDB, Google BigQuery, Google Cloud StorageSenior Data Engineer
2021 - 2021AusNet Services- Designed and built reusable Azure Data Factory pipeline patterns, from Sharepoint to storage account and transformation on Databricks.
- Designed and built spatial data processing framework and practice on Databricks.
- Mapped out patterns of integrating Azure Machine Learning with data platform, including storage accounts, Azure Databricks, and Synapse dedicated SQL pool.
- Drafted a Synapse data warehouse design to integrate Azure Machine Learning and a Python application on Azure Kubernetes Services.
Technologies: Azure, Azure Data Factory, Databricks, Azure Synapse, Docker, Python, Azure Machine LearningData Platform Delivery Lead
2020 - 2021Officeworks- Led a team of five data and cloud engineers to deliver a data platform from scratch.
- Designed and implemented key components of a data platform.
- Reviewed all solutions to ensure architectural standards were met.
- Conducted design workshops with implementation and technology partners.
- Worked with internal teams to standardize and establish usage patterns of the platform.
- Ramped up data analytics team capabilities by building DevOps standards and cross-team knowledge sharing.
Technologies: AWS Athena, AWS DynamoDB, AWS RDS, Apache Airflow, Databricks, Snowflake, Jenkins, PythonPrincipal (Junior) Data Engineer
2018 - 2020McKinsey & Company- Delivered a large-scale machine learning project to automate the decision-making of plant operations at a mining client.
- Designed ETL pipeline architecture, integration strategy, and end-to-end monitoring solution for a multi-tier machine learning application.
- Led data management and ETL activities in multiple machine learning projects.
- Contributed to building firm-wide reusable assets, including application frameworks for data engineers and scientists.
Technologies: Python, Pandas, Docker, Spark ML, Scala, AWSData Analytics Manager
2017 - 2018EY- Single-handedly migrated 15 on-premise reports to data pipelines in Azure.
- Liaised with multiple finance subsidiaries to define a unified strategy for data consolidation and reporting based on SAP S/4HANA.
- Designed and led the development of an end-to-end data warehouse and reporting solution to consolidate financial statements of all four major subsidiaries for the first time at a client.
- Engaged in presales and won the bid proposal on a reporting transformation project.
Technologies: Azure Data Factory, Azure SQL Data Warehouse (SQL DW), C#, Python, SAP BW on HANA, AWS BatchSenior Data Warehouse Developer
2013 - 2017Australia Post- Led a team of five developers to design and build NIM, the largest data warehouse on SAP HANA in Australia.
- Built a custom data management framework in SAP HANA purely based on SQL. This provided a robust and simplified interface for developers and support.
- Continuously improved the performance of NIM to support 10 million data points per day and more than 50 reports.
Technologies: SAP BW, SAP HANA, Data Warehousing, ETL, SQLSenior BI Consultant
2010 - 2014Innogence Limited- Built a data warehousing and reporting solution for an SAP HR system, including employee, leave, and payroll.
- Developed a data warehousing and reporting solution for Australia's largest SAP logistics user.
- Created a data warehousing and reporting solution for an SAP sales and distribution system, including purchasing, sales, and delivery.
Technologies: SAP BW, SAP HANA, Data WarehousingBI Consultant
2007 - 2010ECENTA- Single-handedly built a data warehousing and reporting solution for an SAP CRM system, including customer interactions, service incidents, and customer data.
- Built heavily custom data extractors in ABAP for an SAP logistics system.
- Led two consultants to remotely support the ETL and reporting for an SAP finance system.
Technologies: SAP BW, SAP, ABAPSoftware Engineer
2003 - 2007IBM Singapore- Designed and built an IBM order status online site using Spring.
- Built the terms and conditions section of the IBM Expressed Management Services site.
- Supported a partner software lab on internal web projects.
Technologies: Java, JavaScript, IBM Db2, IBM WebSphere, Apache Tomcat