Amr Saleh, Data Engineer and Developer in Auckland, New Zealand
Amr Saleh

Data Engineer and Developer in Auckland, New Zealand

Member since October 7, 2021
Amr is a data engineer with 8+ years of international experience. He has been a BI analyst at Vodafone, a consultant at Teradata, and worked with telecom operators, banks, and government organizations in Western Europe and the Middle East. Amr's skills include AWS Glue, Athena, CloudFormation, S3, and Snowflake; SQL, HQL, Python, and PySpark; Power BI, Tableau, and QuickSight; and Hadoop, NiFi, and Splunk. Amr has an MSc in data science and teaches enterprise and other data engineering classes.
Amr is now available for hire

Portfolio

  • Sprints
    AWS, Hadoop, SQL, Snowflake, Data Warehousing...
  • 2degrees Mobile Limited
    AWS, AWS Athena, AWS Glue, Snowflake, SQL, Netezza, Oracle, Data Warehousing...
  • Teradata
    Data Engineering, Data Analysis, Big Data, SQL, Data Warehousing, Data Pipelines

Experience

Location

Auckland, New Zealand

Availability

Part-time

Preferred Environment

SQL, AWS, Snowflake, AWS Glue, AWS Athena, Microsoft Power BI, Data Analysis, Big Data, Databases, Python, BigQuery, Redshift

The most amazing...

...experience was leading the data architecture, design, and implementation of Lyticshub—from the initial startup until Vodafone became the first customer.

Employment

  • Lead Trainer

    2020 - PRESENT
    Sprints
    • Trained five cohorts of professionals to enter the data engineering market.
    • Helped the Telecom Egypt technical team increase their data-related capabilities.
    • Led the team to design and deliver the curriculum for data engineering.
    Technologies: AWS, Hadoop, SQL, Snowflake, Data Warehousing, Hortonworks Data Platform (HDP), Redshift
  • Data Engineer

    2018 - PRESENT
    2degrees Mobile Limited
    • Built a data lake in AWS Cloud and Snowflake to substitute an on-premise Hadoop cluster and integrated it with Tableau and a Netezza data warehouse.
    • Designed and rolled out new data pipelines for big data and an enterprise data warehouse and maintained the existing Hadoop and Hortonworks big data environment and ETL pipelines.
    • Supported enterprise data warehouse processes and operations and delivered ad hoc SQL reports.
    • Integrated with different sources, including AWS S3, Oracle, IBM Netezza, SharePoint, and Active Directory.
    • Explored opportunities for new data avenues, such as Snowflake and Anaplan.
    Technologies: AWS, AWS Athena, AWS Glue, Snowflake, SQL, Netezza, Oracle, Data Warehousing, Data Lakes, Redshift, Data Pipelines
  • Data Consultant

    2017 - 2018
    Teradata
    • Designed and implemented ETL jobs and data management processes across different platforms.
    • Extracted insights from data and delivered reports to high-level decision-makers.
    • Automated data warehouse processes using Unified Data Integrator (a DevOps product) as part of a bank's digital transformation.
    Technologies: Data Engineering, Data Analysis, Big Data, SQL, Data Warehousing, Data Pipelines
  • Business Intelligence Analyst

    2014 - 2017
    Vodafone Group
    • Introduced IBM Infosphere Streams to perform real-time analytics on big data streams.
    • Designed, built, and tested ETL/ELT solutions using dimensional modeling and sound design, performance tuning, and optimization.
    • Implement and manage small to large-scale projects involving multiple systems with focus on performance tuning, optimization and availability to ensure efficiency in the environment.
    Technologies: SQL, AWS, ETL, Big Data, Data Pipelines

Experience

  • Data Lake in AWS and Snowflake

    Built a data lake in AWS Cloud and Snowflake to substitute an on-premise Hadoop cluster and integrate with Tableau and a Netezza Data Warehouse. I started the project from scratch, assessed providers (AWS, GCP, and Azure), and led a POC to compare processing and pricing. In the end, I implemented the pipelines in AWS Glue and Snowflake while using SAP data services to inject data from Netezza.

  • National Data Warehouse

    Served on a huge team of consultants from IBM, Teradata, and Microsoft to build Egypt's first data warehouse. I was actively involved in the following activities:
    • Designing and implementing a huge number of ETL jobs and data management processes across different platforms.
    • Sourcing and integrating 50+ different data sources from across the country to build a unified data warehouse.
    • Extracting insights from data and delivering reports to high-level decision-makers.

  • Intesa Sanpaolo Bank Data Platform Revamp

    Automated data warehouse processes using Unified Data Integrator as part of the bank's digital transformation. I also developed and upgraded several ETL solutions for the bank. This work was part of a Teradata consulting engagement.

  • Djezzy Postpaid Stream

    Built a new postpaid stream from scratch. This involved modeling and mapping existing data into models and tables and ETL development and implementation, which was done in parallel with another big data stream using the Hortonworks platform.

Skills

  • Languages

    SQL, Snowflake, Python, R
  • Tools

    Google Sheets, Microsoft Excel, AWS Glue, AWS Athena, Microsoft Power BI, Excel 2016, Tableau, Spark SQL, Apache Airflow, BigQuery
  • Paradigms

    ETL, Database Design, Business Intelligence (BI), Data Science, DevOps, Object-oriented Programming (OOP)
  • Platforms

    Amazon Web Services (AWS), Oracle, Hortonworks Data Platform (HDP), Google Cloud Platform (GCP), Apache Kafka, AWS IoT, Azure
  • Storage

    Data Pipelines, PostgreSQL, Netezza, Teradata, AWS S3, Oracle DBA, Database Architecture, SQL Server DBA, Azure SQL, Redshift, Data Lakes, Databases, Google Cloud Storage, NoSQL
  • Other

    Data Engineering, Data Analysis, Data Warehousing, Data Modeling, Pipelines, Data Analytics, Data Cleansing, Data Warehouse Design, Complex Data Analysis, BI Reporting, AWS, Big Data, MySQL DBA, Teradata DBA, Data Architecture, Big Data Architecture, Cloud Infrastructure, Forecasting, Financial Modeling, Data Visualization, Data Governance, Data Reporting, Financial Data Analytics, Data Quality Analysis, Cloud Storage, Infrastructure, Analytics, Data Analyst, Reporting, Predictive Analytics, Machine Learning, Deep Learning, Informatica, Entity-relationships Model (ERM), Software, Computer Science, Revenue & Expense Projections, GAAP, Directed Acrylic Graphs (DAG), Machine Learning Operations (MLOps)
  • Frameworks

    Apache Spark, Spark, Hadoop, .NET
  • Libraries/APIs

    Pandas, PySpark

Education

  • Master of Science Degree in Computer Engineering
    2018 - 2021
    Cairo University - Cairo, Egypt
  • Bachelor's Degree in Computer Engineering
    2009 - 2013
    Cairo University - Cairo, Egypt

Certifications

  • Data Analysis Professional Nanodegree
    MAY 2021 - PRESENT
    Udacity

To view more profiles

Join Toptal
Share it with others