Attila Tozser
Verified Expert in Engineering
Data Engineer and Developer
Attila is a senior data engineer with 14 years of experience, initially specializing in data warehousing and then big data and machine learning. He has worked on large-scale use cases with over 100TB of data and excelled in challenging environments. Attila's industry experience is backed by a master's degree in technical informatics and certifications in Azure data science, Databricks, and DevOps, and AI engineering.
Portfolio
Experience
Availability
Preferred Environment
Azure, Azure DevOps, Azure Data Factory, Azure Databricks
The most amazing...
...things I've achieved were becoming a MongoDB Master in 2016 and receiving a MongoDB Innovation Award in 2015 for building a real-time analytics engine.
Work Experience
Senior Data Engineer
Freelance
- Co-developed a large-scale data lake infrastructure in Azure cloud while working as an Azure big data and DevOps consultant.
- Supported several implementation and architecture projects (DevOps for big data) while serving as a technical consultant.
- Further developed a several-hundred terabyte MapR Hadoop environment while working as a big data architect.
- Optimized several processes from dozens of hours of runtime to minutes. This included DevOps automation of maintenance tasks in Ansible and Puppet.
- Served as a senior data scientist for a computer vision object recognition and identification application in the pharmaceutical industry. The technology stack included Google Cloud AI Platform services, GKE, Exoscale, TensorFlow, and OpenCV.
Manager, Data Engineering
E.ON
- Managed a multi-cloud, big data environment on Azure Data Lake Storage Gen 1 and Azure Databricks, including the industrialization of POC applications for serverless and service-based infrastructure.
- Maintained 20+ machine learning applications in production with a team of five data and machine learning engineers, providing predictive services for global holdings, predictive maintenance, trading forecasts, and many more.
- Built the team of engineers from zero to ten. Managed the hiring process, including the selection of candidates and technical interviews, and played a key role in people development.
Senior Technical Manager
Amadeus
- Served as a consultant for problem analysis and incident recovery for the in-scope infrastructure and as the main technical contact for internal and vendor support organizations.
- Designed DevOps concepts and enterprise integration of NoSQL big data solutions.
- Trained staff and supported the creation of documentation.
System Engineer, Big Data
Amadeus
- Designed and deployed large-scale MongoDB, Couchbase, and Hadoop clusters, involving 100TB+ scale with tens of clusters and hundreds of shards.
- Managed big data on hundreds of servers with end-to-end automation using Puppet, Ansible, and MongoDB Ops Manager.
- Participated in communities and conferences and organized a local meetup group for MongoDB.
Developer
Phonedeck GmbH
- Participated in the development in one of the first mobile push notification services.
- Built data analytics capabilities business process enhancement, AWS cost optimization, user acquisition optimization, and funnel analytics.
- Managed a fully scalable automated infrastructure on AWS for 10,000+ real-time connected mobile devices and several hundred thousand active users.
System Administrator
Budapest Corvinus University ISZK
- Managed the server infrastructure of the central IT services of the university.
- Acted as the technical specialist in the public procurement process of new hardware and software infrastructure items.
- Managed the tape backup infrastructure based on IBM technology.
Data Warehouse Developer
MI Software kft.
- Developed a POC project to migrate a TB-scale data warehouse to open-source technologies.
- Implemented several reports in Microsoft BI technologies for a large telecommunication company.
- Optimized SSIS ETL processes to run faster and cheaper for various clients.
Developer
IQSYS Zrt.
- Developed SAS-based ETL pipelines for several Hungarian banks.
- Participated in the development of compliance reports for Hungarian authorities at several Hungarian Banks.
- Developed automation to copy SAS-based data warehouse environments to support different development scenarios and testing.
Experience
Data Lake Migration (Azure Data Lake Storage Gen 1 to Gen 2)
Key Activities
• Designed and implemented Azure DevOps tooling for Azure Data Factory and Azure Databricks.
• Conceptualized and implemented Azure Data Lake Storage Gen 2 security. - Developed meta and master data management tooling.
• Developed Spark on Azure Databricks.
• Migrated Azure Data Lake Storage Gen 1 to Gen 2.
Tech Stack
Azure DevOps, Azure Data Lake Storage Gen 1 and Gen2, Azure Data Factory, Azure Databricks, Microsoft SQL Server, PowerBI
Education
Master's Degree in Technical Informatics
Budapest University of Technology and Economics - Budapest, Hungary
Certifications
Azure Data Scientist Associate
Microsoft
Azure AI Engineer Associate
Microsoft
Azure DevOps Engineer Expert
Microsoft
Microsoft Azure Administrator Associate
Microsoft
AWS Solutions Architect – Associate
Amazon Web Services
Databricks Certified Developer for Apache Spark 2.x for Scala
Databricks
Microsoft Certified Solutions Expert: (Azure) Cloud Platform and Infrastructure
Microsoft
MongoDB Certified Developer, Associate Level
MongoDB University
Cloudera Certified Administrator for Apache Hadoop (CDH4)
Cloudera
MongoDB Certified Database Administrator, Associate Level
MongoDB University
Skills
Languages
Python, Bash, SAS
Frameworks
Spark, Hadoop
Paradigms
ETL, Azure DevOps, DevOps, Serverless Architecture
Platforms
MapR, Azure, Google Cloud Platform (GCP), Amazon Web Services (AWS), AWS Elastic Beanstalk, Pentaho, Solaris
Storage
NoSQL, Azure Blobs, MongoDB, Azure SQL, Couchbase, Microsoft SQL Server, SQL Server Reporting Services (SSRS), Database Migration
Other
Azure Data Factory, Azure Data Lake, Big Data, Data Engineering, Azure Databricks, Data Warehousing, Big Data Architecture, Team Management, Technical Hiring, Microsoft Data Transformation Services (now SSIS), DHCP, IBM Tivoli Storage Manager, Data Analytics, Proof of Concept (POC), Open-source Software (OSS)
Tools
Azure Machine Learning, Ansible, Terraform, Puppet, Amazon Elastic MapReduce (EMR), SSAS, MongoDB Shell
Libraries/APIs
Azure Cognitive Services
How to Work with Toptal
Toptal matches you directly with global industry experts from our network in hours—not weeks or months.
Share your needs
Choose your talent
Start your risk-free talent trial
Top talent is in high demand.
Start hiring