
Victor Martins
Verified Expert in Engineering
Data Engineer and Developer
Curitiba - State of Paraná, Brazil
Toptal member since June 24, 2020
Victor is a data engineer with six years of experience developing cloud-based data pipelines. He specializes in big data applications, data privacy, data-driven software, cloud architecture and he excels at extracting value from data. Two of Victor's largest projects involved creating a data lake for one of Latin America's top fintechs from scratch and developing financial intelligence for one of the largest Brazilian e-commerce startups.
Portfolio
Experience
- Python - 5 years
- SQL - 5 years
- Amazon Web Services (AWS) - 4 years
- Data Engineering - 4 years
- Tableau - 4 years
- Data Analytics - 4 years
- Snowflake - 3 years
- Big Data - 3 years
Availability
Preferred Environment
Amazon Web Services (AWS), Linux, PostgreSQL, MySQL, SQL, Python, Spark, Redshift, Snowflake
The most amazing...
...data lake I've developed was for one of Latin America's largest fintechs—completely serverless, seamlessly scales to 50x its size, and is 100% cloud-based.
Work Experience
Data Engineering Specialist
SimpleTire, LLC - Via Toptal
- Developed a MySQL (AWS Aurora) to Snowflake near real-time replication pipeline. It replicates business analytics-specific data from the core database to the current Snowflake implementation.
- Constructed a near-real-time order processing pipeline that uses AWS serverless architecture to deploy a train, test, and serve machine learning model environment. It's currently used to optimize the logistics tied to the business model.
- Built a Snowflake integration with third-party tools and providers, such as CRM systems, data providers, and other sales channels. This integration ships data from those services and models them accordingly.
Senior Back-end Engineer
Carta Healthcare
- Optimized Jenknis test infrastructure and code, reducing overall test time by 20% on the first iteration, and prepared test structure to further improvements.
- Implemented deploy metrics such as test coverage, code complexity, and other code quality-related KPIs. It covers both Python and Typescript codebases.
- Implemented a default restful Python API abstraction for all business entities.
Data Engineering Technical Lead
Pipefy
- Planned the technical roadmap to migrate from legacy BI systems (BigQuery and Airflow) to an Apache Kafka stream-based approach.
- Oversaw multiple customer-facing data product iterations, most notably improvements on the self-service analytics capability of the core product.
- Created an Apache Kafka-based near real-time analytics solution to replicate databases across the company's entire ecosystem.
Data Engineering Specialist
Pipefy
- Developed the second version of the back end serving the platform analytical tool that allows the customer to build custom dashboards with built-in OLAP capabilities. Built with AWS Redshift and Lambda.
- Led a project to deliver customer data into a segmented Redshift database, allowing every company to have a high-performance analytical database with all of its data.
- Architected the integration between the billing and analytical systems to understand better how customers behave before and after each billing or subscription change.
Senior Data Engineer
EBANX
- Developed the company's data lake from scratch. The first version of the system had 30+ integrations and 1,500 tables and processed around 2TB of compressed data per day, built on top of AWS S3 and Redshift, with Spark as the core processing engine.
- Built a database replication system to import all the database events using a CDC architecture. Built using AWS Kinesis and containers hosted on AWS ECS.
- Migrated a 1,500 dashboard and 200+ daily active users Tableau deployment from a legacy data warehouse infrastructure to a data lake approach.
Business Intelligence Analyst
Rentcars.com
- Architected the company's data warehouse and ETL workflows using AWS Lambda and AWS Redshift.
- Redesigned and standardized the Tableau structure, visualization layouts and access management hierarchy completely.
- Refactored the legacy ETL from a cron job-based system to an event batch-driven serverless architecture.
Business Intelligence Analyst
MadeiraMadeira
- Developed the company's financial data mart, enabling it to deliver faster and more accurate insights to its stakeholders. Made on top of MariaDB using Pentaho as the ETL engine.
- Created a process to ensure data quality in the customer satisfaction pipelines, helping the manager drive up CSAT by 10%.
- Structured the company dashboard tool (Tableau), making data distribution more reliable and governable across all stakeholders.
Experience
Database CDC Engine
I was the lead developer and architect, working with a team of six. The project took six months, from the initial idea to production deployment. The deliverables allowed the company to integrate new systems easily into its existing environments, effectively scale the query workload, and centralize the core of its pipeline in a homegrown application.
Analytical Engine
http://www.pipefy.comThe whole software was developed on top of AWS, using Redshift, S3, Lambda, DynamoDB, and EventBridge. It is completely event-driven and serverless, and it has seamless scaling capabilities.
If you want to check it out, head over to pipefy.com, create a Pipe, and explore its data on the Dashboards tab.
Certifications
AWS Certified Developer Associate
AWS
AWS Certified Solutions Architect Associate
AWS
Skills
Libraries/APIs
Node.js
Tools
Git, Tableau, Jenkins
Languages
SQL, Python, Snowflake, Python 3
Paradigms
ETL, Business Intelligence (BI), OLAP, Lambda Architecture, HL7 FHIR Standard
Platforms
Amazon Web Services (AWS), AWS Lambda, Linux, Pentaho, Kubernetes, Docker, Apache Kafka
Storage
Database Modeling, Redshift, Amazon Aurora, MySQL, Amazon DynamoDB, Relational Databases, Data Integration, PostgreSQL, Amazon S3 (AWS S3), Data Lakes, Elasticsearch
Frameworks
Django, Spark
Other
Data Visualization, Data Transformation, Data Analytics, Data Engineering, Tableau Server, Big Data, Cloud, Data Modeling, Serverless, CI/CD Pipelines, Big Data Architecture, Dashboard Design, Dashboards, Containers, Streaming, MinIO, Amazon Kinesis, Parquet, Data Architecture
How to Work with Toptal
Toptal matches you directly with global industry experts from our network in hours—not weeks or months.
Share your needs
Choose your talent
Start your risk-free talent trial
Top talent is in high demand.
Start hiring