Attila Tozser, Developer in Munich, Bavaria, Germany
Attila is available for hire
Hire Attila

Attila Tozser

Verified Expert  in Engineering

Data Engineer and Developer

Location
Munich, Bavaria, Germany
Toptal Member Since
September 29, 2021

Attila is a senior data engineer with 14 years of experience, initially specializing in data warehousing and then big data and machine learning. He has worked on large-scale use cases with over 100TB of data and excelled in challenging environments. Attila's industry experience is backed by a master's degree in technical informatics and certifications in Azure data science, Databricks, and DevOps, and AI engineering.

Portfolio

Freelance
MapR, Azure, Azure Data Lake, Big Data, Big Data Architecture, Hadoop, Ansible...
E.ON
Azure Machine Learning, Azure Databricks, Azure Data Factory...
Amadeus
MapR, MongoDB, Puppet, DevOps, NoSQL, Big Data

Experience

Availability

Part-time

Preferred Environment

Azure, Azure DevOps, Azure Data Factory, Azure Databricks

The most amazing...

...things I've achieved were becoming a MongoDB Master in 2016 and receiving a MongoDB Innovation Award in 2015 for building a real-time analytics engine.

Work Experience

Senior Data Engineer

2019 - PRESENT
Freelance
  • Co-developed a large-scale data lake infrastructure in Azure cloud while working as an Azure big data and DevOps consultant.
  • Supported several implementation and architecture projects (DevOps for big data) while serving as a technical consultant.
  • Further developed a several-hundred terabyte MapR Hadoop environment while working as a big data architect.
  • Optimized several processes from dozens of hours of runtime to minutes. This included DevOps automation of maintenance tasks in Ansible and Puppet.
  • Served as a senior data scientist for a computer vision object recognition and identification application in the pharmaceutical industry. The technology stack included Google Cloud AI Platform services, GKE, Exoscale, TensorFlow, and OpenCV.
Technologies: MapR, Azure, Azure Data Lake, Big Data, Big Data Architecture, Hadoop, Ansible, Puppet, Data Engineering, Azure Databricks, Azure DevOps

Manager, Data Engineering

2017 - 2019
E.ON
  • Managed a multi-cloud, big data environment on Azure Data Lake Storage Gen 1 and Azure Databricks, including the industrialization of POC applications for serverless and service-based infrastructure.
  • Maintained 20+ machine learning applications in production with a team of five data and machine learning engineers, providing predictive services for global holdings, predictive maintenance, trading forecasts, and many more.
  • Built the team of engineers from zero to ten. Managed the hiring process, including the selection of candidates and technical interviews, and played a key role in people development.
Technologies: Azure Machine Learning, Azure Databricks, Azure Data Factory, Google Cloud Platform (GCP), Azure DevOps, Big Data, Azure Data Lake, Serverless Architecture, Team Management, Technical Hiring, Data Engineering, Azure

Senior Technical Manager

2016 - 2017
Amadeus
  • Served as a consultant for problem analysis and incident recovery for the in-scope infrastructure and as the main technical contact for internal and vendor support organizations.
  • Designed DevOps concepts and enterprise integration of NoSQL big data solutions.
  • Trained staff and supported the creation of documentation.
Technologies: MapR, MongoDB, Puppet, DevOps, NoSQL, Big Data

System Engineer, Big Data

2013 - 2016
Amadeus
  • Designed and deployed large-scale MongoDB, Couchbase, and Hadoop clusters, involving 100TB+ scale with tens of clusters and hundreds of shards.
  • Managed big data on hundreds of servers with end-to-end automation using Puppet, Ansible, and MongoDB Ops Manager.
  • Participated in communities and conferences and organized a local meetup group for MongoDB.
Technologies: MapR, MongoDB, Puppet, Couchbase, Hadoop, Big Data, Ansible

Developer

2011 - 2013
Phonedeck GmbH
  • Participated in the development in one of the first mobile push notification services.
  • Built data analytics capabilities business process enhancement, AWS cost optimization, user acquisition optimization, and funnel analytics.
  • Managed a fully scalable automated infrastructure on AWS for 10,000+ real-time connected mobile devices and several hundred thousand active users.
Technologies: Amazon Web Services (AWS), AWS Elastic Beanstalk, Amazon Elastic MapReduce (EMR), Data Analytics

System Administrator

2009 - 2011
Budapest Corvinus University ISZK
  • Managed the server infrastructure of the central IT services of the university.
  • Acted as the technical specialist in the public procurement process of new hardware and software infrastructure items.
  • Managed the tape backup infrastructure based on IBM technology.
Technologies: Solaris, Bash, DHCP, IBM Tivoli Storage Manager

Data Warehouse Developer

2009 - 2011
MI Software kft.
  • Developed a POC project to migrate a TB-scale data warehouse to open-source technologies.
  • Implemented several reports in Microsoft BI technologies for a large telecommunication company.
  • Optimized SSIS ETL processes to run faster and cheaper for various clients.
Technologies: Microsoft SQL Server, SQL Server Reporting Services (SSRS), SSAS, Microsoft Data Transformation Services (now SSIS), Pentaho, Data Warehousing, Database Migration, Proof of Concept (POC), Open-source Software (OSS), ETL

Developer

2007 - 2009
IQSYS Zrt.
  • Developed SAS-based ETL pipelines for several Hungarian banks.
  • Participated in the development of compliance reports for Hungarian authorities at several Hungarian Banks.
  • Developed automation to copy SAS-based data warehouse environments to support different development scenarios and testing.
Technologies: SAS, ETL, Data Warehousing

Data Lake Migration (Azure Data Lake Storage Gen 1 to Gen 2)

Co-developed a large-scale Azure Data Lake infrastructure while supporting several projects as a technical consultant for implementation and architecture (DevOps for big data). Also served as an Azure big data and DevOps consultant.

Key Activities
• Designed and implemented Azure DevOps tooling for Azure Data Factory and Azure Databricks.
• Conceptualized and implemented Azure Data Lake Storage Gen 2 security. - Developed meta and master data management tooling.
• Developed Spark on Azure Databricks.
• Migrated Azure Data Lake Storage Gen 1 to Gen 2.

Tech Stack
Azure DevOps, Azure Data Lake Storage Gen 1 and Gen2, Azure Data Factory, Azure Databricks, Microsoft SQL Server, PowerBI
2003 - 2010

Master's Degree in Technical Informatics

Budapest University of Technology and Economics - Budapest, Hungary

MAY 2021 - PRESENT

Azure Data Scientist Associate

Microsoft

MAY 2021 - PRESENT

Azure AI Engineer Associate

Microsoft

APRIL 2020 - APRIL 2022

Azure DevOps Engineer Expert

Microsoft

MARCH 2020 - MARCH 2022

Microsoft Azure Administrator Associate

Microsoft

MARCH 2019 - PRESENT

AWS Solutions Architect – Associate

Amazon Web Services

JANUARY 2019 - PRESENT

Databricks Certified Developer for Apache Spark 2.x for Scala

Databricks

DECEMBER 2017 - PRESENT

Microsoft Certified Solutions Expert: (Azure) Cloud Platform and Infrastructure

Microsoft

MAY 2014 - PRESENT

MongoDB Certified Developer, Associate Level

MongoDB University

JANUARY 2014 - PRESENT

Cloudera Certified Administrator for Apache Hadoop (CDH4)

Cloudera

DECEMBER 2013 - PRESENT

MongoDB Certified Database Administrator, Associate Level

MongoDB University

Languages

Python, Bash, SAS

Frameworks

Spark, Hadoop

Paradigms

ETL, Azure DevOps, DevOps, Serverless Architecture

Platforms

MapR, Azure, Google Cloud Platform (GCP), Amazon Web Services (AWS), AWS Elastic Beanstalk, Pentaho, Solaris

Storage

NoSQL, Azure Blobs, MongoDB, Azure SQL, Couchbase, Microsoft SQL Server, SQL Server Reporting Services (SSRS), Database Migration

Other

Azure Data Factory, Azure Data Lake, Big Data, Data Engineering, Azure Databricks, Data Warehousing, Big Data Architecture, Team Management, Technical Hiring, Microsoft Data Transformation Services (now SSIS), DHCP, IBM Tivoli Storage Manager, Data Analytics, Proof of Concept (POC), Open-source Software (OSS)

Tools

Azure Machine Learning, Ansible, Terraform, Puppet, Amazon Elastic MapReduce (EMR), SSAS, MongoDB Shell

Libraries/APIs

Azure Cognitive Services

Collaboration That Works

How to Work with Toptal

Toptal matches you directly with global industry experts from our network in hours—not weeks or months.

1

Share your needs

Discuss your requirements and refine your scope in a call with a Toptal domain expert.
2

Choose your talent

Get a short list of expertly matched talent within 24 hours to review, interview, and choose from.
3

Start your risk-free talent trial

Work with your chosen talent on a trial basis for up to two weeks. Pay only if you decide to hire them.

Top talent is in high demand.

Start hiring