Gabriel Petracca, Developer in Madrid, Spain
Gabriel is available for hire
Hire Gabriel

Gabriel Petracca

Verified Expert  in Engineering

Data Engineer and Developer

Location
Madrid, Spain
Toptal Member Since
November 23, 2023

Gabriel is a senior data engineer with 10+ years of experience delivering scalable solutions and solving complex problems for large, medium, and small companies. With a background in telecommunications engineering and business intelligence, he continued learning and obtaining industry certifications in emerging technologies. Gabriel combines his expertise in big data, cloud architecture, DevOps, and software development with excellent leadership, strategic planning, and communication skills.

Portfolio

Roche
Git, Scala, Python 3, Amazon Web Services (AWS), Apache Spark, Apache Hive...
Plan A
Python 3, PostgreSQL, MySQL, Docker, Amazon Web Services (AWS), REST APIs...
Telefónica
Hadoop, Apache Hive, Python 3, PySpark, Scala, Docker...

Experience

Availability

Part-time

Preferred Environment

MacOS, Linux, Git, Python 3, Scala, Amazon Web Services (AWS), Docker

The most amazing...

...project I’ve ever worked for is a telecommunications satellite.

Work Experience

Senior Data Engineer

2023 - 2023
Roche
  • Developed an in-house big data solution to replace a commercial application, helping reduce over €100,000 per year in licenses and infrastructure.
  • Promoted data engineering best practices within the team, encompassing testing, data quality, code reviews, and refactoring activities.
  • Optimized data pipelines running in Apache Spark with Scala and Amazon EMR, reducing processing times and costs by up to 50%.
Technologies: Git, Scala, Python 3, Amazon Web Services (AWS), Apache Spark, Apache Hive, Amazon Elastic MapReduce (EMR), Amazon S3 (AWS S3), Big Data, Data Lakes, PostgreSQL, SQL, Docker, Apache Maven, GitHub, Object-oriented Programming (OOP), Functional Programming

Senior Data Engineer

2022 - 2022
Plan A
  • Designed and implemented a microservices architecture for deploying new features as API-based microservices, increasing scalability and performance.
  • Developed and optimized CI/CD pipelines to automate the deployment of containerized applications, resulting in improved operational efficiency and reduced deployment time.
  • Built an application to calculate carbon emissions, creating an API that reduced the time-to-market for new carbon emission calculation methods.
Technologies: Python 3, PostgreSQL, MySQL, Docker, Amazon Web Services (AWS), REST APIs, DevOps, Continuous Integration (CI), Continuous Development (CD), Pandas, Data Analytics, Infrastructure as Code (IaC), Terraform

Big Data Engineer

2019 - 2021
Telefónica
  • Migrated on-premise Apache Hadoop and Apache Spark workflows to Amazon EMR, Amazon EKS, and AWS Glue, reducing data pipeline processing times by +50%.
  • Optimized Apache Spark jobs to further reduce processing times and Amazon EMR instance fleets for higher cost savings.
  • Designed and built a secure and scalable data-lake-centric architecture on AWS to process +1TB per day using Amazon S3, AWS Lake Formation, AWS Glue, Amazon EMR, Amazon EKS, and AWS Lambda.
  • Developed Apache Spark data processing pipelines with Python and Scala to process large amounts of data.
  • Migrated data pipeline workflows from a legacy tool to Apache Airflow, impacting processing times and the reliability of data pipelines.
  • Implemented a Docker development and testing environment to develop and test Apache Spark, Scala, and Python applications. Reduced the development time and increased the reliability and quality of the codebase.
  • Administered an on-premise Hortonworks Data Platform (HDP) cluster consisting of bare-metal servers. It involved fine-tuning the cluster, installing new tools and software, and implementing security access policies.
  • Migrated legacy on-premise applications and web services to Kubernetes running on Amazon EKS.
Technologies: Hadoop, Apache Hive, Python 3, PySpark, Scala, Docker, Amazon Web Services (AWS), Linux, Networking, Infrastructure as Code (IaC), Terraform, Ansible, SQL, Kubernetes, Amazon EKS, GitHub, Jenkins, HDFS, Hortonworks Data Platform (HDP), Apache Airflow, Apache Spark

Big Data Architect

2018 - 2019
Telefónica
  • Designed and automated the deployment of big-data-cloud architectures for data analysis in AWS using Terraform and AWS CloudFormation.
  • Supported the consulting team as a tech expert in designing and defining big data architectures for several customers.
  • Outlined and developed a template to deploy big data infrastructures for several customers, reducing deployment times from days to hours.
  • Optimized on-premises big data platforms for our customers.
Technologies: Python 3, Hadoop, Amazon Web Services (AWS), Cloud Infrastructure, Infrastructure as Code (IaC), Terraform, Ansible, Linux, Apache Hive, PySpark, Docker, Jenkins, Apache Spark

IoT Engineer

2016 - 2017
INVAP
  • Designed, tested, and deployed a large-scale LoRaWAN network.
  • Automated field tests of IoT devices using custom Python applications.
  • Published a research paper at IEEE about our work with LoRaWAN IoT technology.
Technologies: Python 3, Internet of Things (IoT), LoRaWAN, Zigbee, Linux, Hardware, Amazon Web Services (AWS)

Communication Systems Engineer

2014 - 2016
INVAP
  • Designed, tested, and validated an airborne wireless communication system that supports high-availability links for telemetry (TM) and telecommand (TC) and high data rate links for video broadcasting.
  • Used model-based systems engineering (MBSE) and architecture frameworks like DoDAF and MODAF to model systems, employing SysML and Sparx Enterprise Architect software.
  • Worked on requirements and conceptual engineering for complex communication systems, including the definition, management, and traceability of system requirements.
Technologies: Internet of Things (IoT), Python 3, Networking, Network Protocols, SysML, System Requirements

Network Engineer

2013 - 2014
INVAP
  • Designed, configured, and implemented a high availability network between the satellite and a ground control station for ARSAT-1 and ARSAT-2 satellites.
  • Deployed a secure and encrypted network connection, site-to-site VPN with IPsec.
  • Developed LabVIEW software to automate and orchestrate the radiated susceptibility electromagnetic interference (EMI) and electromagnetic compatibility (EMC) tests.
Technologies: Networking, Cisco, VPN, TCP/IP, Layer 2 Networking, Routing

Student Researcher

2011 - 2013
Instituto Universitario Aeronáutico
  • Designed wavelet and empirical mode decomposition (EMD) filters in MATLAB for digital signal processing of radar signals.
  • Implemented digital filters in field-programmable gate array (FPGA) using VHDL, Simulink, and System Generator.
  • Published two research papers in IEEE showcasing the results of my research.
Technologies: MATLAB, Digital Filters, Signal Filtering, Digital Signal Processing, Data Processing, Radar

Mitigation of WLAN/RLAN Interference in Meteorological Radar

https://ieeexplore.ieee.org/document/8211629
This project comprised the development of a digital filter for meteorological radars. It is based on discrete wavelet transform and multi-resolution analysis techniques to mitigate interference caused by C-Band RLAN/WLAN communications systems compliant with IEEE 802.11 standards.
2017 - 2018

Master's Degree in Business Intelligence

EAE Business School - Madrid, Spain

2012 - 2017

Master's Degree in Radar Systems

National University of Córdoba - Córdoba, Argentina

2006 - 2012

Engineer's Degree in Telecommunications

University Institute of Aeronautics (IUA) - Córdoba, Argentina

DECEMBER 2022 - PRESENT

AWS Certified Data Analytics - Specialty

Amazon Web Services

MAY 2022 - PRESENT

Databricks Certified Associate Developer for Apache Spark 3.0

Databricks

JANUARY 2022 - PRESENT

AWS Certified Security - Specialty

Amazon Web Services

JUNE 2019 - PRESENT

AWS Certified Solutions Architect - Professional

Amazon Web Services

JULY 2018 - PRESENT

AWS Certified Solutions Architect - Associate

Amazon Web Services

DECEMBER 2015 - PRESENT

Certificate in Advanced English

University of Cambridge

Languages

Python 3, Scala, Python, R, SQL, SysML

Frameworks

Apache Spark, Spark, Hadoop

Libraries/APIs

PySpark, REST APIs, Pandas

Platforms

Amazon Web Services (AWS), Linux, Docker, MacOS, Kubernetes, Hortonworks Data Platform (HDP)

Storage

Amazon S3 (AWS S3), Redshift, Database Security, PostgreSQL, MySQL, Databases, Apache Hive, HDFS, Data Lakes

Other

Big Data, Networking, Data Engineering, Analytics, Security Architecture, Cloud Security, Security Management, AWS Cloud Architecture, AWS Certified Solution Architect, Cloud Infrastructure, Cloud Migration, Cloud Architecture, Data Analytics, Infrastructure as Code (IaC), Digital Filters, Signal Filtering, TCP/IP, Software, Machine Learning, English, Internet of Things (IoT), LoRaWAN, Zigbee, Hardware, Network Protocols, System Requirements, Cisco, Layer 2 Networking, Routing, Digital Signal Processing, Data Processing

Tools

Terraform, MATLAB, Git, AWS Glue, Spark SQL, AWS IAM, Radar, Ansible, Amazon EKS, GitHub, Jenkins, Apache Airflow, Amazon Elastic MapReduce (EMR), Apache Maven, VPN

Paradigms

DevOps, Continuous Integration (CI), Continuous Development (CD), Radio Frequency (RF) Protocol, Business Intelligence (BI), Data Science, Object-oriented Programming (OOP), Functional Programming

Collaboration That Works

How to Work with Toptal

Toptal matches you directly with global industry experts from our network in hours—not weeks or months.

1

Share your needs

Discuss your requirements and refine your scope in a call with a Toptal domain expert.
2

Choose your talent

Get a short list of expertly matched talent within 24 hours to review, interview, and choose from.
3

Start your risk-free talent trial

Work with your chosen talent on a trial basis for up to two weeks. Pay only if you decide to hire them.

Top talent is in high demand.

Start hiring