Pablo Agustín Nava Vieyra, Developer in Zapopan, Mexico
Pablo is available for hire
Hire Pablo

Pablo Agustín Nava Vieyra

Verified Expert  in Engineering

Bio

Pablo is a skilled data analytics engineer with over four years of experience. He excels in data engineering and analysis and has a keen interest in mathematics and continuous learning, complemented by strong interpersonal and communication abilities. Pablo specializes in using Bash scripting to develop data pipelines and routinely employs tools such as Airflow, Snowflake, and dbt while remaining enthusiastic about mastering new technologies.

Portfolio

Tata Consultancy Services
Bash Script, Snowflake, Data Build Tool (dbt), Cloudera, Jira, GitLab CI/CD...
Freelance
Python, MATLAB, Whiteboarding, Zoom, Data Interpretation
Agrícola Villagómez
Python, Matplotlib, NumPy, Portfolio Analysis, Statistics, Mathematical Finance...

Experience

  • Mathematical Modeling - 12 years
  • Python - 8 years
  • Software Documentation - 6 years
  • SQL - 6 years
  • Git - 4 years
  • Snowflake - 3 years
  • Bash Script - 3 years
  • Data Build Tool (dbt) - 2 years

Availability

Part-time

Preferred Environment

Bash Script, Apache Airflow, Control-M, Git, Snowflake, Data Build Tool (dbt), C, Python, Pandas, Matplotlib, Amazon Web Services (AWS), Docker, Spark, BigQuery, Google Cloud Platform (GCP)

The most amazing...

...project I've worked on involved improving the advertising budget efficiency of a local international pharmaceutical company by 15%.

Work Experience

Big Data Cloud Engineer

2022 - PRESENT
Tata Consultancy Services
  • Collaborated with a global team to design, build, and optimize ELT data pipelines using the Agile methodology, adhering to continuous improvement and continuous delivery (CI/CD) best practices using GitLab.
  • Specialized in coding Bash scripts for data processing across eleven different 3rd-party pipelines, solving 29% of ServiceNow incident tickets with a single script.
  • Created documentation for software processes and procedures and developed training programs on fundamental technologies for new members, improving training fulfillment speed by 70%.
  • Reduced costs and increased scalability and mobility by migrating from an on-premise Netezza warehouse to a Snowflake cloud service solution within three months.
Technologies: Bash Script, Snowflake, Data Build Tool (dbt), Cloudera, Jira, GitLab CI/CD, Control-M, Informatica ETL, ServiceNow, Big Data, Data Warehousing, ETL, Data Engineering, Data Orchestration, Amazon Web Services (AWS), Amazon S3 (AWS S3), Data Pipelines, Data Management, Informatica, Data Architecture, Data Quality, Data Modeling, Identity, Databases, Orchestration, Hadoop, Scala, Business Intelligence (BI), Databricks, Terraform

Advanced Mathematics Tutor

2011 - 2022
Freelance
  • Taught STEM subjects ranging from foundational concepts for middle and high school students to advanced topics for undergraduates, ensuring the academic success of over 100 students.
  • Identified and addressed key knowledge gaps in students' subjects or career paths while enhancing their attention to detail and intuitively developing mathematical rigor.
  • Worked as a mentor, developing tailored hands-on training content and materials for each student's unique needs and goals, both in person and remotely.
  • Engaged with theory and practical applications to deliver comprehensive course curricula, including advanced statistics, linear algebra, and quantum mechanics over a few days.
Technologies: Python, MATLAB, Whiteboarding, Zoom, Data Interpretation

Financial Analyst Consultant

2020 - 2021
Agrícola Villagómez
  • Performed data analysis and visualization using Python for a long-term investment portfolio.
  • Conducted extensive exploratory data analysis across multiple industries, prioritizing key trends and challenges.
  • Demonstrated that the company's solution outperformed the second-best alternative by 30% over a decade, proving its significant impact on potential investors.
Technologies: Python, Matplotlib, NumPy, Portfolio Analysis, Statistics, Mathematical Finance, Data Management, Data Quality, Databases, Data Visualization, Data Analysis, Data Interpretation, Forecasting

Business Data Analyst

2020 - 2020
Pisa Pharmaceuticals
  • Identified the most effective advertising channels for various pharmaceuticals, boosting the digital media department's operational efficiency by 15%.
  • Built growth rate dashboards and presented the findings to stakeholders to facilitate strategic decision-making.
  • Performed clustering tests to segment high and low sales seasons in three different products, improving their logistics.
Technologies: Python, SQL, NumPy, Pandas, Matplotlib, Tableau, API Databases, Data Management, Databases, Data Visualization, Data Analysis, Data Interpretation, Jupyter, Business Intelligence (BI), Microsoft Power BI

Experience

NLP Analysis During the COVID-19 Pandemic

https://github.com/AgustinVieyra/NLP-about-CoVid-19
Conducted an in-depth sentiment analysis to gauge COVID-19's impact on Guadalajara's metropolitan area in June 2021 in partnership with my university and local government. Utilizing R and the Twitter API, our 6-person team applied sentiment analysis, complemented by insights from our custom survey.

The findings provided valuable insights for authorities during the critical vaccination distribution phase, enabling them to measure and understand the societal effects of the pandemic at that time.

DBT Data Engineering Pipeline

https://github.com/AgustinVieyra/First-dbt-project
Spearheaded a data build tool (dbt) pipeline project that revamped the entire ETL process, transforming raw data into business intelligence (BI) ready data marts. Starting with a basic dbt template in a BigQuery database, I implemented an array of dbt macros, tests, materializations, and models.

This initiative equipped the company with a robust data infrastructure, empowering data-driven decision-making and improving our value proposition.

Advertising Channels Optimization

Conducted a comprehensive analysis of the advertising channels utilized by a pharmaceutical company for two of their products. Drawing on metrics from the advertising agency, I suggested a nonlinear analysis method, providing deeper insight into the effectiveness of various platforms for product promotion. This led to a 15% optimization in the digital media budget.

Deep Learning Cloud Classifier

https://github.com/AgustinVieyra/cloud-classifier
Developed a deep learning image classifier using Google Colab as a proof of concept (POC) to automate cloud data entry for a meteorological database. This tool streamlined the classification and collection of climate data across different cloud types, contributing to richer, more accurate data for climate modeling efforts.

Education

2016 - 2022

Bachelor of Science Degree in Nanotechnology Engineering

Instituto de Estudios Superiores de Occidente - Guadalajara, Jalisco, Mexico

Certifications

NOVEMBER 2024 - NOVEMBER 2025

Microsoft Certified: Azure Data Engineer Associate

Microsoft

Skills

Libraries/APIs

PySpark, Pandas, Matplotlib, NumPy, SciPy, X (formerly Twitter) API

Tools

Apache Airflow, Control-M, MATLAB, Jira, GitLab CI/CD, GitHub, Jupyter, Microsoft Power BI, Git, Cloudera, Informatica ETL, Tableau, Zoom, BigQuery, Terraform

Languages

Bash Script, Python, SQL, Snowflake, C, R, Scala

Paradigms

ETL, Business Intelligence (BI), Design Thinking

Platforms

Databricks, Amazon Web Services (AWS), Azure, Docker, Google Cloud Platform (GCP), Azure Synapse Analytics, Azure Event Hubs, Azure Data Lake Storage

Storage

Data Pipelines, Databases, API Databases, Amazon S3 (AWS S3), Microsoft SQL Server

Frameworks

Hadoop, Delta Live Tables (DLT), Jinja, Spark

Other

Research & Investigation, Mathematical Modeling, Data Engineering, Data Orchestration, Data Visualization, Data Analysis, Data Interpretation, Data Build Tool (dbt), GitOps, Software Documentation, Economics, Statistics, Big Data, Data Management, Data Architecture, Data Quality, Data Modeling, Orchestration, Forecasting, Azure Databricks, DLT Pipelines, Scientific Data Analysis, ServiceNow, Portfolio Analysis, Mathematical Finance, Systems Thinking, Data Warehouse Design, Data Warehousing, Google Colaboratory (Colab), Whiteboarding, Informatica, Encryption, Identity, Azure Data Factory (ADF), Azure Stream Analytics, Design and implement data storage, Data Processing, Secure, monitor, and optimize data storage and data processing, Azure Synapse Link

Collaboration That Works

How to Work with Toptal

Toptal matches you directly with global industry experts from our network in hours—not weeks or months.

1

Share your needs

Discuss your requirements and refine your scope in a call with a Toptal domain expert.
2

Choose your talent

Get a short list of expertly matched talent within 24 hours to review, interview, and choose from.
3

Start your risk-free talent trial

Work with your chosen talent on a trial basis for up to two weeks. Pay only if you decide to hire them.

Top talent is in high demand.

Start hiring