
Alexandre Souza
DevOps Engineer and Developer
Alexandre is a DevOps and site reliability engineer with over 15 years of experience working with development, Infrastructure, networking, and architecture of on-premise and cloud-based solutions (AWS and GCP). He's worked with many tools, workflows, and languages including Java, Ruby, Python, Lua, SQL, and Bash.
Portfolio
Experience
DevOps - 15 yearsAWS CloudFormation - 5 yearsSite Reliability Engineering (SRE) - 5 yearsAWS Certified Solution Architect - 3 yearsTerraform - 2 yearsAmazon EKS - 1 yearAvailability
Preferred Environment
Cloud9, PyCharm, IntelliJ IDEA, Visual Studio Code (VS Code), Git, Linux
The most amazing...
...project I've implemented was an IaC solution that decreases the environment creation time from one week to 45 minutes,
Work Experience
DevOps Lead | Migration Expert (via Toptal)
Veea
- Migrated a group of applications from Mesos/Marathon to EKS (Kubernetes).
- Led a team of SRE/DevOps engineers to improve the metrics and alarms to the existing application.
- Increased the application security using AWS SecretManager instead of Git stored config files.
- Improved application deployment using Jenkins Pipeline Libraries applying Convention over Configuration based on the environment name.
AWS Cloud Formation Expert (via Toptal)
BAZZE & COMPANY
- Maintained the CloudFormation templates related to existing AWS resources.
- Created CDK (AWS Cloud Development Kit) project to automate new resource creation and update. And also imported the existing resources to the CDK project.
- Improved AWS resource maintainance adding all resources to an IaC (Infrastructure as Code) project, keeping them inventoried.
Senior CI/CD Engineer (via Toptal)
Code Particle
- Maintained the CloudFormation templates used to create and update the AWS Infrastructure.
- Improved the application deployment using CI/CD with rolling deployment and Blue/Green deployment technics.
- Created Terraform Infrastructure as Code (IaC) deployment to new projects and imported existing AWS resources to Terraform.
- Created Mongo Atlas clusters using Terraform and integrated them into the other projects.
Systems Engineer {AWS] (via Toptal)
Benetech
- Maintained Chef receipts, EC2 instances, and Nginx configurations.
- Fixed deployment problems and updated CI/CD pipelines.
- Updated AWS profiles and configured CloudWatch Alarms and Dashboards.
Systems Architect
Daitan
- Created and maintained a CI/CD system for a front-end application.
- Maintained a cloud infrastructure creation system suitable to AWS and GCP cloud providers, built using Python, CloudFormation, and deployment management.
- Maintained infrastructure-as-code tools written in Terraform.
- Coached and reviewed the performance of five SRE/DevOps engineers to increase the delivery quality and improve the time to market of new features.
- Wrote training documentation and structured the content among the different levels of knowledge and specialization skills to the newly hired and also to the existing SRE/DevOps Engineers.
Systems Specialist
iFood
- Maintained critical systems fixing performance and stability problems.
- Created a blue/green deployment process and application for the AWS-hosted company systems with configuration options for canary deployment.
- Led the API Gateway implementation initiative based on the Kong solutions, it involved all development teams where I was responsible to propagate the benefits of the initiative and managed the delivery schedule of each team.
- Planned and executed load and stress tests in company applications to verify bottleneck points and performance improvement.
- Maintained Terraform-based Infrastructure as Code solutions.
- Maintained Chef-based configuration management solutions.
Systems Architect
Daitan
- Led cross-functional teams about adopting DevOps culture and implement best practices related to Infrastructure As Code (IaC) development quality and decrease deployment time.
- Automated DevOps procedures by creating applications for business rules handling, CloudFormation (AWS), and Deployment Manager (GCP) template generation, as well as Cloud environment orchestration.
- Optimized the productivity from a five-day environment creation with several script calls done manually to it, taking 45 minutes with the call of a single script that orchestrates the entire process.
- Spearheaded the creation of Jenkins pipelines for unit and integration tests of both the environment creation procedures and infrastructure-testing of the environments, allowing infrastructure-as-a-code acceptance tests for the environments.
- Implemented dockerized environments to enable more parallelized infrastructure testing.
- Executed high-availability infrastructure migration projects (all services ran in all instances) to a clustered environment (each service runs on its own set of machines), using SaltStack as the configuration manager.
- Fixed problems in legacy infrastructure scripts/procedures.
- Architected AWS and GCP cloud resources usage in projects for new environment features.
- Developed improvements in cloud environments to improve performance and optimize costs.
Senior Performance Analyst (APM)
Inmetrics S/A
- Oversaw and responsible for software incident analysis, creating root cause reports.
- Developed and installed monitoring solutions based on Zabbix—customizing scripts and plugins to provide custom monitoring.
Systems Architect
Lumis EIP
- Automated and orchestrated Linux and Windows servers using configuration management software like Chef, Puppet, and Capistrano.
- Handled build management with Ant, Maven, Jenkins, and Nexus.
- Created the architecture of web systems and cloud high availability infrastructure (AWS and Azure).
- Worked as an infrastructure architect at Petrobras, responsible for the stability of the Lumis portal environment, analyzing and fixing problems in WebLogic and Oracle 11g-based systems.
- Worked as a software architect at SulAmérica, responsible for system migration projects, development of new solutions, performance optimization, and correction of problems on Lumis Portal and WebSphere systems using Oracle database. Deployed automation solutions in mixed environments and checked the production environment deployment control.
- Worked as an on-site analyst at R7.com, responsible for the stability of Lumis portal environment. Implemented new infrastructure solutions, maintained Windows operating systems and Linux, Java Server, and HTTP Server, cache servers and other back-end applications network, evaluating and resolving infrastructure problems, development, and performance.
- Designed and developed solutions for network administrators (Windows Server 2003 and 2008, Linux and Solaris), DBAs (SQL Server 2000, 2005 and 2008, Oracle 9i, 10g and 11g, MySQL 5 and DB2), and web administrators (Java, Tomcat Application Server, JBoss, WebSphere and WebLogic, and IIS).
- Developed and consulted on web solutions, procedures creation, reports, and statistics using Java code, Groovy, Python, Ruby, JavaScript, SQL, Shell, and .NET.
- Evaluated, fixed, and improved the performance of web solutions and batches with high availability and performance requirements.
Experience
Pod Provision
Monitoring Project
• Servers (using auto-deploy)
• Networking (using network devices auto-discovery)
• VoIP
• Links
• CloudWatch integration
API Gateway
Blue/Green Deployment
Skills
Languages
Python 3, Java, JavaScript, TypeScript, SQL, Python, Bash, Perl
Frameworks
Flask, AWS HA, Django, Spring, Spring Core, Spring Boot
Tools
Amazon Elastic Container Service (Amazon ECS), Apache HTTP Server, NGINX, Boto, Boto 3, AWS CLI, AWS ELB, AWS SDK, Amazon Elastic Container Registry (Amazon ECR), Google Compute Engine (GCE), Google Kubernetes Engine (GKE), Amazon EKS, Amazon Simple Email Service (SES), AWS CloudFormation, Amazon CloudFront CDN, Terraform, SaltStack, Amazon EBS, Git, IntelliJ IDEA, PyCharm, Chef, Apache Maven, Jenkins, Zabbix, Kong, Bitbucket, Amazon Virtual Private Cloud (VPC), AWS IAM
Paradigms
DevOps, Automated Testing, Agile, Scrum, Kanban
Platforms
Amazon EC2, Kubernetes, Java EE, AWS Lambda, Linux, Visual Studio Code (VS Code), Windows Server, Nexus, Amazon Web Services (AWS), AIX, JBoss EAP, WebSphere, Google Cloud Platform (GCP), Docker, AWS ALB, AWS NLB, HP-UX
Storage
Amazon S3 (AWS S3), Amazon EFS, PostgreSQL, Datadog, SQL Server 2000, SQL Server 2005, SQL Server 2010, SQL Server 2008 R2
Other
Sanic Web Server, Cloud9, AWS DevOps, Site Reliability Engineering (SRE), Cloud, AWS Certified Solution Architect, Identity & Access Management (IAM), Cloud Services, Spring Cloud, WebLogic, Cloud Security, Ant, Networking, Content Management, Telephony, ECS, CI/CD Pipelines, Amazon Route 53, AWS Cloud Development Kit (CDK), Opsgenie, Slackbot, AWS SSH Keys, AWS Secrets Manager
Libraries/APIs
jQuery, Hystrix
Education
Master's Degree in Computer Engineering
Universidade Estácio de Sá - Rio de Janeiro, Brazil