Marc Matt
Verified Expert in Engineering
Data Engineer and Developer
Hamburg, Germany
Toptal member since January 5, 2021
Marc is a data engineer with a passion for data and 15+ years of experience in leading teams and building data platforms focusing on the information technology, real estate, and services industries. He created a Python-based AVRO schema generator that makes parts of a scheme reusable. Marc excels with automation, integrations, analysis, the building of models, statistics, big data, CI/CD pipelines, and data modeling.
Portfolio
Experience
Availability
Preferred Environment
Apache Airflow, Tableau Server, Tableau, SQL, Pandas, Python, Apache Beam, Git, Linux
The most amazing...
...app I've developed provides pose estimation data in real-time to help optimize customer fitness goals.
Work Experience
Data Engineer
RTL Deutschland GmbH
- Set up AlloyDB to serve media consumption for recommendations in real time.
- Optimized the recommender ranking for the streaming platform.
- Built microservices for serving customer recommendations in real time.
Senior Data Analyst
Bold Metrics Inc.
- Created a template for ad hoc reporting for all clients.
- Designed and implemented streaming data entry into the data warehouse using Amazon Kinesis, Lambda, and Python.
- Optimized and standardized transformations in the Redshift data warehouse.
Data Engineer
MediaMarktSaturn Retail Group
- Established a supply chain monitoring system for the national distribution centers.
- Implemented APIs to all logistic services providers and transformed them for use in companywide reporting.
- Set up a real-time order tracking system using Apache NiFi on GKE.
Cloud Data Engineer and Architect
Spin (Tier Mobility) - Main
- Designed and established an MLOps workflow with Google Vertex AI.
- Operationalized ML models for real-time use cases.
- Prepared the migration of DWH from BigQuery to Snowflake.
- Built a tool for operational support of traffic violation incidents.
ETL Engineer
Food Marketing Company
- Parsed JSON data in Talend and loaded it into Redshift.
- Integrated data from web APIs with Talend into Redshift.
- Transformed customer data using Talend and loaded it into Salesforce.
Data Engineer
Janus
- Translated legacy ETL pipelines to scalable AWS Glue jobs.
- Automated resource deployment using AWS CloudFormation.
- Designed and built the framework in PySpark to make adding future pipelines easier.
Senior Data Engineer
Emma
- Designed a new data entry API for the data platform to enable streaming analytics.
- Set up the binlog streaming process and parsing of events in real time using Kinesis, Lambda, and Kinesis Data Firehose.
- Optimized the data load in Redshift by analyzing queries and tables to add optimized sort and distkeys.
Data Specialist
Ear-Reality GmbH
- Developed a data lake based on Kinesis and Athena, including embedded reporting in Metabase.
- Shifted a production system to a serverless scalable architecture.
- Automated load testing of an application using Python and Locust.io.
Senior Data Engineer
Engel & Völkers
- Designed and built a data platform, including tool selection and data modeling.
- Built a TensorFlow model to predict property values in a real-time environment.
- Implemented CI/CD pipelines to automatically deploy all features of the data platform.
Head of Data Engineering | Machine Learning
Surf Media
- Led a team of six and was responsible for their personal development.
- Designed big data systems and data lakes including tool selection and data modeling.
- Designed data pipelines and model selection for the development of recommendation engines and fraud. The recognition systems work in a real-time environment.
- Created the technology roadmap. Oversaw the advancement of all affected data systems.
Business Intelligence Analyst
Surf Media
- Designed, developed, and operated a DWH for the company group consisting of five companies.
- Developed a statistical model for predicting orders.
- Analyzed customers to understand how best to optimize revenue in a social network.
Database Consultant
EOS Information Services, GmbH.
- Designed, developed, and operated a DWH for a Decision Engine used in risk management.
- Designed processes for risk management.
- Completed conception and development of a process for managing addresses using Perl and Uniserv.
Datawarehousing Consultant
Key-Work Consulting, GmbH.
- Migrated the sales reporting for a mailorder company.
- Developed a statistical model to optimize sales planning of a mail order company.
- Built a statistical model for a dynamic shipping schedule.
Database Management
Coxulto Marketing Solutions, GmbH.
- Defined and selected target groups for marketing campaigns.
- Completed affinity analysis for the complete customer base.
- Administered and operated the address database including duplicate termination.
Lead of Business Intelligence Consumer Products
1&1 Internet A
- Coordinated and prioritized all tasks of the Business Intelligence team.
- Designed and developed KPI reports for the board of directors.
- Analyzed customer structures and built a model for churn prediction.
Business Intelligence Analyst
1&1 Internet AG
- Designed and developed an automated reporting system for customer and contract inventory, as well as internet usage and customer behavior.
- Integrated the customer usage data of the company websites into the DWH.
- Coordinated all tasks between management and development departments.
- Analyzed all new and existing customer campaigns for effectiveness.
Experience
AVRO Schema Generator
https://gitlab.com/datascientists.info/avro-generatorIf certain data structures are used in several schemas, this tool provides the ability only to define these structures once and then reuse them over several schemas.
Evalution of Property Value
Design and Set-up of Data Platform
Certifications
Google Cloud Certified - Professional Data Engineer
Skills
Libraries/APIs
Pandas, PySpark, TensorFlow, REST APIs, Node.js
Tools
BigQuery, Apache HAWQ, Apache Avro, Git, Apache Beam, Tableau, Apache Airflow, Jenkins, Apache NiFi, RabbitMQ, Microsoft Excel, Terraform, Amazon Elastic MapReduce (EMR), Amazon EKS, AWS IAM, Google Kubernetes Engine (GKE), Talend ETL, Amazon Athena, AWS CloudFormation, Amazon Redshift Spectrum, Matillion ETL for Redshift, AWS Fargate, AWS Glue, GitLab CI/CD
Languages
Python, SQL, Perl, Java, XML, Snowflake, Python 3, TypeScript
Paradigms
ETL, Business Intelligence (BI), DevOps, Microservices
Platforms
Amazon Web Services (AWS), Cloud Run, Linux, Docker, Talend, Hortonworks Data Platform (HDP), Oracle, AWS Lambda, Google Cloud Platform (GCP), Kubernetes, AWS Elastic Beanstalk, Kubeflow
Storage
MySQL, Google Cloud, Database Modeling, Redshift, Databases, Database Architecture, SQL Server 2010, Data Pipelines, Amazon S3 (AWS S3), PostgreSQL, Google Cloud SQL, Data Lakes, Apache Hive, HDFS, NoSQL, Amazon Aurora, JSON
Frameworks
Spark, Apache Spark, Flask, Django, Hadoop, Serverless Framework
Other
Data Visualization, Data Analysis, Data Architecture, Data Engineering, Data Warehousing, Data Modeling, Data Warehouse Design, Data Reporting, Database Schema Design, Data Management, Google Cloud Functions, APIs, Data Wrangling, ETL Tools, Tableau Server, Google BigQuery, Data Profiling, Data Science, Google Data Studio, Fivetran, Serverless, Scaling, Dashboards, Amazon Kinesis, Parquet, Cloud Architecture, Big Data, Architecture, Big Data Architecture, Machine Learning Operations (MLOps), CI/CD Pipelines, Cloud Security, Data Build Tool (dbt), Cloud Tasks, Azure Databricks, Argo CD, FastAPI
How to Work with Toptal
Toptal matches you directly with global industry experts from our network in hours—not weeks or months.
Share your needs
Choose your talent
Start your risk-free talent trial
Top talent is in high demand.
Start hiring