Kleber Alves, Developer in Santo André - State of São Paulo, Brazil
Kleber is available for hire
Hire Kleber

Kleber Alves

Verified Expert  in Engineering

Data Engineer and Developer

Santo André - State of São Paulo, Brazil

Toptal member since January 12, 2024

Bio

Kleber has eight years of experience as a Python programmer, working on data projects in different industries such as banking, finance, eCommerce, marketing, contact centers, and logistics. He has extensive experience in Spark, SQL, Git, ETL pipelines, dashboards, data analysis, analytics, ML models, and cloud environments. In addition, Kleber is well-versed in Snowflake, Databricks, data lakes, relational and non-relational databases, and data modeling.

Portfolio

BlueShift
Python, PySpark, Databricks, Amazon Web Services (AWS), Azure...
Banco Original
Python, PySpark, Cloudera, Google BigQuery, Apache Hive, Data Lakes...
Atento Brasil
Python, Azure, Scikit-learn, Pandas, Selenium, Beautiful Soup, Seaborn...

Experience

  • Apache Spark - 8 years
  • Python - 8 years
  • SQL - 7 years
  • Git - 7 years
  • Big Data - 7 years
  • Azure - 4 years
  • Databricks - 4 years
  • Snowflake - 2 years

Availability

Full-time

Preferred Environment

Python, SQL, Databricks

The most amazing...

...project I've been involved in is the construction of an efficient data lake for a digital bank using Python.

Work Experience

Data Engineer

2020 - PRESENT
BlueShift
  • Executed impactful projects across diverse client sectors, including eCommerce, finance, agribusiness, and fuel distribution, and delivered tailored solutions for varied industry challenges.
  • Optimized eCommerce marketing by integrating Facebook APIs to automate 20+ diverse marketing audiences and consolidate customer, sales, and product data from physical stores and the website.
  • Developed an ETL workflow for fuel distribution, predicting sales and revenue. I compared four forecast models with actual revenue, feeding a dynamic forecasting dashboard.
Technologies: Python, PySpark, Databricks, Amazon Web Services (AWS), Azure, Azure Data Factory (ADF), Snowflake, Git, Data Lakes, Data Warehousing, Data Engineering, APIs, Excel 365, Databases, Microsoft Excel, Data Analytics, Unstructured Data Analysis, Delta Live Tables (DLT)

Data Engineer

2019 - 2020
Banco Original
  • Designed and constructed a comprehensive data lake catering to the entire company's data needs.
  • Focused on banking marketing strategies, analyzing and exploring customer and account data, and generating targeted audiences for email, SMS, and push campaigns.
  • Built Python and Spark codes and scripts to orchestrate ETL flows, seamlessly managing development processes and ensuring smooth delivery to transactional systems, APIs, and dashboards.
Technologies: Python, PySpark, Cloudera, Google BigQuery, Apache Hive, Data Lakes, Data Warehousing, Git, Spark, Data Engineering, APIs, Excel 365, Databases, Microsoft Excel, Data Analytics, Unstructured Data Analysis

Data Scientist

2019 - 2019
Atento Brasil
  • Led projects involving data structuring, statistical analyses, and ML models for retail debtors. I also implemented web scraping techniques to extract consumer complaints data from web pages.
  • Conducted Python programming and performed exploratory data analyses, identifying operational patterns and preparing data for statistical analysis and ML models.
  • Leveraged Seaborn and Matplotlib to present data and results.
  • Developed robust data pipelines and ETL processes in Python, working within the Azure environment and the Agile methodology.
Technologies: Python, Azure, Scikit-learn, Pandas, Selenium, Beautiful Soup, Seaborn, Matplotlib, Excel 365, Databases, Microsoft Excel, Data Analytics, Unstructured Data Analysis

Data Scientist

2018 - 2019
ML Servios Financeiros
  • Delivered an end-to-end data science project, performed data engineering from relational databases, conducted statistical analyses, and developed machine learning models for debtors in the financial and banking sectors.
  • Created APIs for dynamic model result queries and a web portal to control and showcase machine learning outcomes.
  • Performed Python programming for data exploration, ETL flows, data preparation, statistical and data analyses, and ML model development.
Technologies: Python, SQL Server 2016, Pandas, Scikit-learn, Django, ETL, Git, SQL, Excel 365, Databases, Microsoft Excel, Data Analytics, Unstructured Data Analysis, REST APIs

Data Analyst

2017 - 2018
Santander Brasil
  • Automated tasks and developed Visual Basic for Applications (VBA) and SQL systems for streamlined operations.
  • Created systems for the bank's back-office operations, mainly focusing on frozen accounts and balances due to court orders.
  • Led the delivery of automated daily reports and dashboards for bank operations, significantly enhancing operational efficiency, reducing response time, and minimizing process error rates.
Technologies: SQL Server 2016, MySQL, Excel 2013, Visual Basic for Applications (VBA), Excel VBA, Microsoft Access, Microsoft Power BI, Python, SQL, Excel 365, Databases, Microsoft Excel, Automation, Data Analytics

Mechanical Technician

2009 - 2016
Mercedes-Benz do Brasil
  • Performed tasks related to materials engineering, prototypes, quality, and production.
  • Handled the assembly of light, medium, and heavy-duty trucks and buses for domestic production and export.
  • Conducted testing of automotive parts, including chassis, axles, and engines, analyzing raw materials and evaluating weld points and seams.
Technologies: Mechanics, Production

Experience

Mapping Urban Safety with Public Databases

In December 2018, my team and I won the "Data Battle" hackathon organized by Itaú, a Brazilian bank. We developed an idea for public safety, using public data from police incidents to map high-risk areas in urban regions. The concept integrated official police incident reports with general opinions about city streets, creating a map akin to Google Maps' offerings. Instead of streets turning red due to traffic, they would indicate insecurity, as reported by citizens. The purpose was to assist residents and individuals navigating unfamiliar areas to avoid potentially dangerous routes and times.

Education

2018 - 2019

Master of Business Administration (MBA) in Big Data and Data Science

Faculdade de Informática e Administração Paulista (FIAP) - São Paulo, Brazil

2015 - 2017

Master of Business Administration (MBA) in Project Management

Fundação Getulio Vargas (FGV) - São Caetano do Sul, Brazil

2010 - 2014

Bachelor's Degree in Materials Engineering

Centro Universitário Fundação Santo André (CUFSA) - Santo André, Brazil

Certifications

APRIL 2023 - PRESENT

Astronomer Certification for Apache Airflow Fundamentals

Astronomer

Skills

Libraries/APIs

Pandas, Beautiful Soup, PySpark, Polymer, Scikit-learn, Matplotlib, REST APIs

Tools

Microsoft Excel, Git, Spark SQL, Excel 2013, Microsoft Access, Apache NiFi, Cloudera, Microsoft Power BI, Tableau, Apache Airflow, Seaborn

Languages

Python, SQL, Visual Basic for Applications (VBA), Excel VBA, Snowflake, R

Paradigms

ETL, Automation, Agile Project Management, Scrum

Platforms

Databricks, Azure, Amazon Web Services (AWS), Google Cloud Platform (GCP)

Storage

Data Lakes, Databases, SQL Server 2016, MySQL, Apache Hive, NoSQL

Frameworks

Apache Spark, Selenium, Delta Live Tables (DLT), Metal, Hadoop, Django, Spark

Other

Data Analysis, Scope Management, Big Data, Data Warehousing, Data Engineering, Excel 365, Data Analytics, Unstructured Data Analysis, Mathematics, Data Architecture, ELT, Azure Data Factory (ADF), APIs, Mechanics, Machining, Calculus, Industrial Automation, Electric, Prototyping, Technical Drawing, Algebra, Metallurgy, Physics, Chemistry, Materials Science, Composite Materials, Materials Testing, Materials Failure Analysis, Metal Processing, Metals & Mining, IT Project Management, Projects, PMI, Budget Management, Feasibility Studies, Valuation, Data Science, Machine Learning, Production, Google BigQuery, Data

Collaboration That Works

How to Work with Toptal

Toptal matches you directly with global industry experts from our network in hours—not weeks or months.

1

Share your needs

Discuss your requirements and refine your scope in a call with a Toptal domain expert.
2

Choose your talent

Get a short list of expertly matched talent within 24 hours to review, interview, and choose from.
3

Start your risk-free talent trial

Work with your chosen talent on a trial basis for up to two weeks. Pay only if you decide to hire them.

Top talent is in high demand.

Start hiring