Alvaro Parra
Verified Expert in Engineering
Data Engineer and Developer
La Paz, La Paz Department, Bolivia
Toptal member since July 16, 2021
Alvaro is a data engineer with over seven years of experience working on ETL pipelines and databases. He is a certified MCSA SQL database developer with expertise in Azure, Databricks, Python, Airflow, and Snowflake. Alvaro has engineered data solutions for banking, credit card, pharmaceutical marketing, and oil companies. His work has significantly increased revenue, cut expenses, and reduced processing times.
Portfolio
Experience
- Data Engineering - 6 years
- Python - 5 years
- Snowflake - 5 years
- Data Migration - 5 years
- Data Modeling - 4 years
- Apache Airflow - 4 years
- Data Build Tool (dbt) - 3 years
- Database Migration - 3 years
Availability
Preferred Environment
Azure Data Factory, Azure, Snowflake, Apache Airflow, Data Build Tool (dbt), Python, Azure SQL
The most amazing...
...project I've worked on processes credit and debit card use at a bank's ATMs. It has enabled the company to gain a competitive advantage.
Work Experience
Data Engineer
Precision Drilling
- Implemented data quality pipelines for detecting, fixing, and alerting possible issues, reducing data anomalies by 20% using Azure, Snowflake and Apache Airflow.
- Automated backfill pipelines on Snowflake, Python, and dbt for solving data streaming-related issues. This pipeline reduced manual workload by 30% per month.
- Designed a pipeline infrastructure on Snowflake, Python, and Apache Airflow for extracting and ingesting SAP API information. This improved upload time by 20%.
- Deployed an ETL pipeline on Python, Snowflake, and Azure for cleaning duplicate table records. This process runs monthly, and it usually affects 2-5% of the totality of records.
- Optimized five source data pipeline architectures on Python, Snowflake, and Apache Airflow to yield a 30% improvement in data processing speed.
Data Engineer
PROS
- Worked with 30+ customers on designing and implementing pipelines for transforming customer information into XML that could be digested by the PROS system by using Azure Data Factory, Databricks, Pyspark, and Python.
- Ingested data from multiple data sources using a combination of AWS, SQL, Salesforce API, and Python to create data views to be used in Power BI and SSRS.
- Automated data extraction using Azure, Apache Airflow, Snowflake, and Python solution for a PROS customer that required integration of his system with PROS data. This reduced manual workload by 20% monthly.
- Created a Python/Pyspark library to reformat and export data from external customers to fit the PROS system format. This saved five hours of coding time per new pipeline.
- Improved and maintained 20% of data infrastructure for different PROS customers in Azure.
Data Management Analyst
Banco FIE
- Automated marketing ETL pipelines by using Python and Microsoft SQL Server and saved 30% of the manual effort.
- Deployed an ETL pipeline to scrape data from a weather API using Python and join it with data from bank branches to help discover branch transactions relative to weather. The bank saved 12% in operating costs by changing the number of cashiers.
- Collaborated with consultants to Rabobank (a Dutch bank) in segmentation and high-value customer and mobile app usability projects by creating Python and Microsoft SQL Server ETL pipelines to extract and process customer data.
- Deployed an ETL pipeline on Python and Microsoft SQL Server that processed debit card information to help discover the bank’s ATM usage and saved 10% on ATM maintenance costs by relocating unused ATMs.
- Automated data pipelines in Python and SQL to process customers' financial information for selling credit cards, resulting in a 35% increase in credit card sales in the first month.
Database Administrator
Administradora de Tarjetas de Credito ATC
- Developed and designed ETL pipelines on SSIS to help reduce debit and credit card fraud, increasing fraudulent debit card detection by 7%.
- Deployed an ETL using Python for processing monthly debit and credit card information sent to affiliated banks and reduced processing time by 30%.
- Migrated a 250GB data warehouse, including stored procedures and ETLs, from Microsoft SQL Server to Azure SQL.
Business Intelligence Developer
Grupo Alcos
- Deployed marketing ETL on SSIS that processes information from OLTP databases to analyze product consumption, resulting in a 15% increase in top product sales.
- Developed and migrated multiple dashboards from Excel to QlikView and reduced manual effort by 50%.
- Migrated a 100GB data warehouse including stored procedures and ETLs from Informix to Microsoft SQL Server.
Experience
Dashboard for Pharmaceutical Marketing Department
PROBLEM
Employees were trying to process and analyze millions of sales on Excel spreadsheets, which was time-consuming and error-prone. The marketing manager requested a solution that could optimize and improve the process.
SOLUTION
We started by designing a data warehouse to extract information from the company's selling system. To store that information, we bought a server and installed a Microsoft SQL Server database. Next, we installed Pentaho Kitchen on another server and developed ETL pipelines to process information.
Once the design was stable, we began work on the visualization tool. We had QlikView at that time, so that's how we started developing the visualization portion. The new dashboard had a significant impact, as processes that had taken hours were done in a matter of minutes. It allowed us to measure each agency's sales and provide information to improve overall company sales.
Airgas | Multiple Reports for PROS
Union Pacific | Complex SSRS Reports for PROS
Education
Bachelor's Degree in Computer Science
Universidad Mayor de San Andrés - La Paz, Bolivia
Certifications
Azure Data Engineer Associate
Microsoft
DP-900: Microsoft Azure Data Fundamentals
Microsoft
MCSA: SQL 2016 Database Development
Microsoft
Skills
Libraries/APIs
PySpark, Pandas
Tools
Apache Airflow, Microsoft Excel, Microsoft Power BI, Azure Kubernetes Service (AKS), AWS Glue
Languages
SQL, Python, Snowflake, Stored Procedure, T-SQL (Transact-SQL)
Paradigms
ETL, Business Intelligence (BI), Database Design
Storage
Databases, SQL Stored Procedures, Data Validation, SQL Server Reporting Services (SSRS), Microsoft SQL Server, Data Pipelines, Database Migration, PostgreSQL, SQL Triggers, SQL Performance, MySQL Server, Data Integration, Dynamic SQL, Oracle PL/SQL, Data Lakes, Amazon S3 (AWS S3), Azure SQL, MySQL, PL/SQL, Database Security, Redshift
Platforms
Databricks, Azure, Azure Synapse, Amazon Web Services (AWS), AWS Lambda
Frameworks
Apache Spark
Other
Data Engineering, Data Analysis, Data Warehousing, Business Intelligence Consultant, Data Management, Star Schema, CSV, CSV Export, Data Extraction, Automated Data Flows, Reports, Data Modeling, Data Analytics, Pipelines, APIs, Azure Data Lake, Data Analysis Consultant, ELT, Scripting, Data Build Tool (dbt), Data Migration, Performance Tuning, ETL Tools, Entity Relationships, Data Governance, Data Structures, Data Entry, Architecture, Cloud Migration, Solution Architecture, Azure Data Factory, Data Marts, Web Scraping, Big Data, Dashboards, Data Visualization, Financial Reporting, Data Queries, Azure Databricks, Data Architecture
How to Work with Toptal
Toptal matches you directly with global industry experts from our network in hours—not weeks or months.
Share your needs
Choose your talent
Start your risk-free talent trial
Top talent is in high demand.
Start hiring