Wenlong Dong, Database Developer in Sydney, Australia
Wenlong Dong

Database Developer in Sydney, Australia

Member since January 21, 2022
Wenlong is a data engineer with five years of experience building data solutions and ETL solutions primarily in SQL and Python. He also has experience building data transformation and data analysis processes with R and Stata. Additionally, Wenlong is familiar with other tools, including PowerShell and Excel VBA development, and he has rich experience working with AWS cloud environments.
Wenlong is now available for hire

Portfolio

  • AstraZeneca
    Microsoft Power BI, Snowflake, Apache Airflow, DBT, Python 3, SQL, DBeaver...
  • IBM
    Python, Salesforce, SQL, IBM Cloud, GitHub, Data Analysis, Data Engineering...
  • University of New South Wales
    STATA, R, Excel VBA, Data Analysis, Dashboards, SQL, Data Engineering...

Experience

Location

Sydney, Australia

Availability

Part-time

Preferred Environment

PyCharm, Windows, SQL Server 2016, VS Code, SQL Server Integration Services (SSIS)

The most amazing...

...project I've independently designed and completed is a complex medical data validation platform with built-in validation rules using Excel VBA.

Employment

  • Data Engineer

    2022 - PRESENT
    AstraZeneca
    • Supported the analytics team for Microsoft Power BI reporting. Created a Power BI data flow and built report templates.
    • Developed and maintained a Snowflake-based data warehouse via DBT.
    • Administrated the Snowflake data warehouse and supported data users with troubleshooting issues.
    • Built and maintained Apache Airflow schedules. Completed BAU and troubleshooting tasks.
    Technologies: Microsoft Power BI, Snowflake, Apache Airflow, DBT, Python 3, SQL, DBeaver, Dataiku, Data Visualization, Data Building Tool (DBT)
  • Data Engineer

    2021 - 2022
    IBM
    • Participated as the primary data engineer in a Salesforce data migration project using Python, SQL, and Salesforce APEX.
    • Completed training and learning activities in Hadoop and MongoDB.
    • Worked in an Agile team with a CI/CD development method implemented.
    • Contributed as the primary data engineer for a data migration project with Python-based development.
    Technologies: Python, Salesforce, SQL, IBM Cloud, GitHub, Data Analysis, Data Engineering, SQL Server DBA, SQL Stored Procedures, ETL, Microsoft SQL Server, MongoDB, Database Administration (DBA), T-SQL, Docker, ETL Development, Data Warehousing, Data Architecture, Pandas, Data Modeling, ETL Testing, Database Modeling, Schemas, Microsoft Excel
  • Data Management Officer

    2020 - 2021
    University of New South Wales
    • Designed and developed a complete data solution with STATA, including data cleansing modules, data validation, and generating statistical reports.
    • Independently designed and developed a medical data collection and validation platform with Excel VBA.
    • Built an R-based model for data cleansing and producing academic reports.
    • Designed and developed SQL Server-based databases and relevant stored procedures.
    Technologies: STATA, R, Excel VBA, Data Analysis, Dashboards, SQL, Data Engineering, SQL Server DBA, SQL Stored Procedures, Microsoft SQL Server, Database Administration (DBA), T-SQL, ETL Development, Data Science, Business Intelligence (BI), Data Architecture, Pandas, Data Modeling, Database Modeling, Schemas, Microsoft Power BI, Reports, Reporting, Microsoft Excel
  • PowerShell Developer

    2019 - 2020
    Macquarie Bank
    • Designed and built SSIS solutions to create an ETL pipeline between the central data warehouse and a financial analysis platform.
    • Developed a file loading system and data processing jobs with Control-M job flows and PowerShell-based functions.
    • Contributed to the data lake project with a Hive data warehouse.
    Technologies: AWS, Windows PowerShell, SQL Server 2016, Control-M, SourceTree, Jira, SQL Server Integration Services (SSIS), JSON, YAML, SQL, Data Engineering, SQL Server DBA, SQL Stored Procedures, ETL, Microsoft SQL Server, T-SQL, ETL Development, Data Warehousing, Data Modeling, ETL Testing, Database Modeling, Schemas, Microsoft Excel
  • Data Developer

    2018 - 2019
    CoreLogic AU
    • Completed a massive data warehouse and data loading pipeline upgrade based on the business rules boost for Australian property data.
    • Supported all BAU processes for the entire data team and the property data platform, including troubleshooting SQL agent jobs, AWS environments, and SSIS packages.
    • Performed detailed analysis on geographic data items. Built a data loading and validation process for geographic data types in SQL Server.
    • Created dynamic SQL processes to optimize the SQL Server performance on giant data tables with more than one million records.
    Technologies: SQL Server 2016, BIML, XML, AWS, Jira, Confluence, Agile, Python, Unit Testing, SQL Server Integration Services (SSIS), Data Analysis, Dashboards, SQL, Data Engineering, SQL Server DBA, SQL Stored Procedures, ETL, Tableau, Microsoft SQL Server, T-SQL, ETL Development, Data Warehousing, Business Intelligence (BI), Pandas, Data Modeling, ETL Testing, Database Modeling, Schemas, Reports, Reporting, Microsoft Excel
  • SyteLine and System Support Officer

    2017 - 2018
    Le Mac Australia Group
    • Designed and maintained the Infor SyteLine ERP system.
    • Designed Crystal Reports and written relevant SQL Server stored procedures.
    • Analyzed production cost data and manipulated data calculation via SQL Server and Excel Pivot Table.
    Technologies: SQL Server 2016, Crystal Reports, Syteline ERP, C#, Pivot Tables, SQL Server DBA, SQL Stored Procedures, Microsoft SQL Server, Database Administration (DBA), T-SQL, Database Modeling, Schemas, Microsoft Excel

Experience

  • SalesForce Data Migration Project

    Oversaw, as part of a team, the migration of Salesforce data from the source environment to the target environment. The client wished to separate part of its business into an independent Salesforce environment.

    I set up the primary Python framework and built the initial version of the data extraction process—from Salesforce to Python DataFrame. I created the complete solution for duplicate records identification and merging dup records. I designed and developed the parallel computing process for comparing huge amounts of data as well as the grouping logic based on Graph theory. I also designed and built many SQL Server objects, including views, stored procedures, and functions.

  • Excel VBA-based Medical Data Validation Platform

    I designed and completed a medical data validation platform with Excel VBA independently. I implemented complex validation rules within the Excel modules so that users could have data automatically and entirely validated in Excel.

    This platform has been accepted and used for the data collection process worldwide.

  • ETL Solution to Update Existing Real Estate Data

    A property data ETL solution project aimed at manipulating existing ETL data flow to fit new government requirements. I was one of the primary SQL Server and SSIS solution developers and completed approximately 50% of the development tasks.

Skills

  • Languages

    Python 3, Python, SQL, Excel VBA, T-SQL, R, Snowflake, SAS, Java, C, YAML, BIML, XML, C#
  • Libraries/APIs

    Pandas, NetworkX
  • Tools

    VS Code, STATA, Jira, Confluence, Spreadsheets, Microsoft Excel, PyCharm, MATLAB, GitHub, Microsoft Power BI, Control-M, SourceTree, Crystal Reports, Tableau, Apache Airflow
  • Paradigms

    ETL, Data Science, Business Intelligence (BI), Agile, Unit Testing
  • Platforms

    Windows, Amazon Web Services (AWS), Salesforce, Docker, Azure, Azure PaaS, Azure IaaS, Salesforce SOQL/SOSL, Linux, Windows Server 2016, Amazon EC2, Dataiku
  • Storage

    SQL Server 2016, SQL Server Integration Services (SSIS), Databases, SQL Stored Procedures, SQL Server DBA, Microsoft SQL Server, Database Administration (DBA), Database Modeling, MySQL, Database Performance, JSON, Azure SQL, Azure Blobs, MongoDB, DBeaver
  • Other

    Data Engineering, Data Warehousing, Data Analysis, Data Cleaning, ETL Development, Data Modeling, ETL Testing, Schemas, Statistics, AWS, Dashboards, Data Architecture, Reports, Reporting, Data Building Tool (DBT), SWOT Analysis, MRP, Knowledge Management, Minitab, Calculus, Linear Algebra, IBM Cloud, IT Service Management (ITSM), Web Scraping, Syteline ERP, Pivot Tables, Multiprocessing, DBT, Data Visualization
  • Frameworks

    Windows PowerShell

Education

  • Graduate Certificate in Health Data Science
    2020 - 2021
    University of New South Wales - Sydney, NSW, Australia
  • Master's Degree in Information Systems
    2013 - 2014
    The University of Melbourne - Melbourne, Victoria, Australia
  • Bachelor's Degree in Logistics and Supply Chain Management
    2007 - 2011
    Huazhong University of Science and Technology - Wuhan, Hubei, China

Certifications

  • Microsoft Certified: Azure Fundamentals
    MARCH 2022 - PRESENT
    Microsoft
  • ITIL Foundation Certificate in IT Service Management
    MARCH 2017 - PRESENT
    AXELOS

To view more profiles

Join Toptal
Share it with others