Rodrigo Sampaio de Sá, Developer in São Paulo - State of São Paulo, Brazil
Rodrigo is available for hire
Hire Rodrigo

Rodrigo Sampaio de Sá

Verified Expert  in Engineering

Data Engineer & Analytics Developer

São Paulo - State of São Paulo, Brazil

Toptal member since November 25, 2021

Bio

Rodrigo is an experienced BI and data consultant as well as a data architect with over 11 years of experience. He has excellent problem-solving skills and the ability to work in a multicultural and agile environment with short deadlines. Rodrigo specializes in business intelligence, data warehouse, data science, analytics, and data visualization. An experienced team leader and project manager, he once efficiently oversaw projects for 10 different clients from multiple sectors.

Portfolio

NextRoll
Python 3, Machine Learning, Pandas, PyTorch, Amazon Web Services (AWS), Presto...
Unstoppable Domains
Google Cloud Platform (GCP), Google BigQuery, Data Build Tool (dbt)...
Nordica
Alteryx, QlikView, Qlik Sense, Business Intelligence (BI)...

Experience

  • Data Warehouse Design - 11 years
  • SQL - 11 years
  • ETL - 11 years
  • Data Engineering - 11 years
  • Data Modeling - 11 years
  • Data Analytics - 11 years
  • Python - 5 years
  • Machine Learning - 4 years

Availability

Full-time

Preferred Environment

SQL, Python, Data Modeling, Data Warehousing, Data Engineering, Data Analytics, Machine Learning

The most amazing...

...idea I've implemented was an evaluation and optimization model for the company's marketing programs.

Work Experience

Staff Data Science Tech Lead/Engineer

2021 - PRESENT
NextRoll
  • Expanded the contextual targeting product taxonomy and improved the natural language processing (NLP) model.
  • Improved NLP models by fine-tuning neural network parameters and layers.
  • Enhanced the contextual targeting product by structuring and refactoring it so that including, excluding, or expanding taxonomy is easier.
  • Defined a new way to gather data for NLP projects using zero-shot models.
  • Developed APIs to make NLP model predictions available to the whole company.
Technologies: Python 3, Machine Learning, Pandas, PyTorch, Amazon Web Services (AWS), Presto, Amazon EC2, Docker, Terraform, Natural Language Processing (NLP), Generative Pre-trained Transformers (GPT), Data Pipelines, APIs, ETL Tools, Data Science, Complex Problem Solving, Kanban, Scrum, Python, PL/SQL, Data Engineering, IT Project Management, Programming, SQL, Databases, Software Engineering, Technical Architecture, Amazon Elastic Container Registry (ECR), Amazon S3 (AWS S3), Spark, Amazon Elastic Container Service (ECS), PySpark, Spark SQL, Spark ML, Technical Leadership

Senior Data Engineer

2022 - 2023
Unstoppable Domains
  • Developed the ELT process on DBT to model the dimensional model for the data warehouse (BigQuery).
  • Developed ingestion pipelines on Airflow and Python to load data from APIs into BigQuery, to later be used as sources on the DBT data warehouse model.
  • Created Python scripts to transform data and automate manual processes.
  • Created reports on Mode for the financial area and user tracking.
Technologies: Google Cloud Platform (GCP), Google BigQuery, Data Build Tool (dbt), Apache Airflow, ETL Tools, Data Pipelines, Data Migration, Dashboard Development, Data Visualization, Relational Databases, Data Architecture, ETL Implementation & Design, Architecture, ETL, Business Intelligence (BI), Complex Problem Solving, Analytics, Data Analytics, Dimensional Modeling, Kanban, Scrum, Python, PL/SQL, Data Engineering, Data Warehouse Design, Data Warehousing, IT Project Management, Programming, SQL, Databases, Software Engineering, Technical Architecture

Senior Data Analytics Consultant | Architect

2016 - 2021
Nordica
  • Led and architected the data warehouse of three big pharmaceutical companies. Oversaw the whole process, including gathering requirements, data modeling, ETL processes, dashboard development, and architecture.
  • Applied machine learning to predict a company's sales, replacing their current estimates with a considerably better accuracy model.
  • Administered optimization techniques to solve the quota distribution problems of a customer.
  • Created an evaluation and optimization model for a pharmaceutical company's direct marketing programs.
  • Made the creation and maintenance of the dashboard straightforward by creating an architecture that relies on code reusability.
Technologies: Alteryx, QlikView, Qlik Sense, Business Intelligence (BI), Data Warehouse Design, Data Analytics, Machine Learning, SQL, ETL, Python 3, XGBoost, TensorFlow, Data Warehousing, Architecture, Scrum, Kanban, Team Leadership, Consulting, PL/SQL, Agile Project Management, Complex Problem Solving, Dimensional Modeling, Databases, Data Analysis, ETL Implementation & Design, Excel 365, Microsoft Access, Data Architecture, Analytics, SQL Server Integration Services (SSIS), Data Visualization, Relational Databases, Dashboard Development, Data Pipelines, ETL Tools, Data Migration, Data Science, Pandas, Python, Data Engineering, IT Project Management, Programming, Software Engineering, Technical Architecture, Technical Leadership

Business Intelligence Consultant

2010 - 2016
Talent Group
  • Architected and developed business intelligence (BI) solutions for seven companies from different segments. Oversaw gathering requirements, KPI definition, data modeling, ETL process, and dashboard development.
  • Led the BI area creation of a prominent Brazilian bank's accounting area, overseeing the information that 100 users consumed.
  • Created a new architecture to optimize ETL processing and dashboard building with reusable code.
  • Got recognized by customers as the go-to problem solver.
Technologies: SQL, QlikView, Qlik Sense, Business Intelligence (BI), Data Warehouse Design, Consulting, Complex Problem Solving, Data Warehousing, Architecture, Dimensional Modeling, Databases, Data Analysis, ETL Implementation & Design, Data Architecture, Data Visualization, Relational Databases, Dashboard Development, Data Pipelines, ETL Tools, Analytics, Data Analytics, Data Engineering, Programming, Software Engineering, Technical Architecture, Technical Leadership

Experience

Direct Marketing Evaluation

A pharmaceutical company was interested in optimizing its doctors' direct marketing programs, including visitations, telephony, samples, and digital.

The first approach I tried to conduct research was creating a dashboard with charts to help them answer questions. That did not work correctly since a doctor could be included and excluded from different programs at different times, making it impossible to have a significant enough sample to analyze. Also, the number of combinations was just too high.

To deal with this problem, I used machine learning to predict the doctors' prescriptions based on whether and how long they participated in the marketing programs. I built this model using XGBoost, so it was somewhat of a black box. To extract the insights from it, I used some machine learning explainability techniques, such as partial plots and SHAP values.

Data Warehouse Architecture

This is a data warehouse for a big pharmaceutical company. As an architect and team leader, I oversaw the whole architecture for the company's data and analytics area, including data warehouse and Qlik Sense dashboards. The data warehouse was developed using SQL Server and SSIS.

Education

2006 - 2010

Bachelor's Degree in Computer Science

The University of São Paulo - Sao Paulo, Brazil

Certifications

JUNE 2010 - PRESENT

MCTS SQL Server 2008 – Database Development

Microsoft

Skills

Libraries/APIs

Pandas, XGBoost, PySpark, Spark ML, TensorFlow, PyTorch

Tools

Qlik Sense, Microsoft Access, Apache Airflow, Amazon Elastic Container Registry (ECR), Amazon Elastic Container Service (ECS), Spark SQL, Terraform

Languages

SQL, Python, Python 3

Paradigms

Agile Project Management, Scrum, Kanban, Dimensional Modeling, Business Intelligence (BI), ETL, ETL Implementation & Design

Platforms

QlikView, Alteryx, Amazon Web Services (AWS), Amazon EC2, Docker, Google Cloud Platform (GCP)

Storage

Databases, PL/SQL, Microsoft SQL Server, Relational Databases, Data Pipelines, SQL Stored Procedures, SQL Server Integration Services (SSIS), Amazon S3 (AWS S3)

Frameworks

Spark, Presto

Other

Software Engineering, Programming, IT Project Management, Data Warehousing, Data Warehouse Design, Data Engineering, SQL Server 2015, Data Modeling, Data Analytics, Analytics, Complex Problem Solving, Consulting, Data Science, Architecture, Team Leadership, Data Analysis, Excel 365, Data Architecture, Data Visualization, Dashboard Development, Data Build Tool (dbt), ETL Tools, Data Migration, Technical Architecture, Technical Leadership, Machine Learning, Natural Language Processing (NLP), Google BigQuery, APIs, Generative Pre-trained Transformers (GPT)

Collaboration That Works

How to Work with Toptal

Toptal matches you directly with global industry experts from our network in hours—not weeks or months.

1

Share your needs

Discuss your requirements and refine your scope in a call with a Toptal domain expert.
2

Choose your talent

Get a short list of expertly matched talent within 24 hours to review, interview, and choose from.
3

Start your risk-free talent trial

Work with your chosen talent on a trial basis for up to two weeks. Pay only if you decide to hire them.

Top talent is in high demand.

Start hiring