Marcelo Grebois
Verified Expert in Engineering
Cloud Architect and MLOps Developer
Berlin, Germany
Toptal member since February 4, 2021
Marcelo is an experienced technology leader, infrastructure solutions expert, open-source advocate, and multilingual. With more than 24 years of expertise in purpose-led high availability infrastructure solutions, he has excelled in engineering to executive leadership positions across Europe and Latin America. Marcelo focuses on building highly automated systems and has consistently delivered exceptional results. He is an open source enthusiast, CNCF contributor, AWS certified, and IETF writer.
Portfolio
Experience
Availability
Preferred Environment
Kubernetes, Amazon Web Services (AWS), GitLab, GitLab CI/CD, Terraform, Azure, Google Cloud Platform (GCP), AIOps, Machine Learning Operations (MLOps), Large Language Models (LLMs)
The most amazing...
...experience was helping many companies achieve their platform goals within tight deadlines.
Work Experience
Senior DevOps Consultant
ManagedKube
- Recreated the full infrastructure and trained the team for squelch.io.
- Deployed a full Apache Spark cluster on Kubernetes for leanplum.com.
- Revamped the full infrastructure and led the migration to Azure for Parsable.
- Created and deployed three enterprise Kong clusters with multiple stagings.
DevOps Engineer
Dukkantek DMCC
- Implemented a zero-trust network to connect to all the POS kiosks using NetBird and Headscale.
- Designed and implemented a GitOps pipeline to manage 2k IoT K3s clusters.
- Set up their optimized ML pipelines using CUDA on Kubernetes and Kueue.
Site Reliability Engineer
Makersite GmbH
- Reconfigured the OrientDB database to be more reliable, testing different HA setups, including Dockerizing.
- Restructured the Terraform script to be able to work with multiple clouds and reduce their complexity.
- Reworked the Ansible scripts completely to be more robust, using flow control and ensuring everything was in sync with the IaC repository.
- Implemented a new backup system based on ZFS, which allows snapshots of the database, allowing faster recovery in case of failure.
- Reconfigured most of the Azure and AWS resources to optimize cost.
- Implemented deployment pipelines using GitHub Actions and Jenkins.
- Mentored the new DevOps in the areas of cloud providers and IaC.
Senior DevOps Engineer
Pienso
- Designed and implemented a scalable, high-availability infrastructure on Google Cloud Platform (GCP), ensuring robust and efficient operations for the company's core services.
- Led the migration of critical systems to GCP, resulting in enhanced performance and reliability.
- Developed and maintained GCP-based solutions, integrating services such as Compute Engine, Kubernetes, and BigQuery to optimize data processing and storage capabilities.
- Automated deployment processes using GCP tools, significantly reducing deployment times and improving system resilience.
- Collaborated with cross-functional teams to align infrastructure development with organizational goals, leveraging GCP for innovative solutions.
- Ensured compliance with industry best practices and security protocols within the GCP environment, enhancing overall system security.
- Monitored and optimized GCP resource usage continuously, achieving cost savings while maintaining high-performance standards.
- Provided technical guidance and training to team members on GCP services and best practices, fostering a culture of continuous learning and improvement.
DevOps Engineer (via Toptal)
medflex GmbH
- Migrated the complete Kubernetes-based platform from AWS to OVH.
- Set up provisioning of multiple clusters via GitOps using Flux.
- Oversaw and performed the setup of automatic stages provisioning for QA and testing.
- Migrated Medflex's applications to OVH's OpenShift, achieving a 50% quicker deployment and 30% cost reduction, enhancing scalability and system reliability for better service management.
- Optimized Medflex's API management using 3scale on OVH's OpenShift, leading to a 40% increase in API throughput and 25% load reduction, significantly improving healthcare service performance.
Linux Systems Administrator
AirLift LLC
- Configured the server farm in the Hetzner cloud using Docker.
- Set up Locust testing cluster with auto-provisioning.
- Load-tested the website to receive over two million requests per second.
DevOps Engineer
Shift
- Migrated the complete solution to Lambda on AWS and SQS.
- Configured the complete cloud infrastructure based on Amazon EKS.
- Set up the deployment and provision using GitHub Actions.
Senior DevOps Engineer
Deutsche Bahn
- Migrated a full analytics pipeline based on BigQuery to Google Cloud Knative.
- Designed and migrated the new infrastructure from GCP to AWS.
- Implemented several deployment pipelines and monitoring tools.
Cloud Infra Tech Lead
Daimler Mobility Services GmbH
- Set up and configured all developer productivity, including self-hosted HA GitLab, GitLab CI/CD, Kubernetes clusters, and AWS cross-account access.
- Migrated all our infrastructure from three different service providers, first from IBM to DHC and then from the DHC to AWS.
- Set up data ingestion pipelines for several of our clusters.
- Deployed and provisioned several enterprise-grade clusters on base metal using Kubespray and Kubeadm.
Infrastructure Architect and Cloud Developer
Telefónica NEXT
- Joined as an IT infrastructure developer to redesign the current AWS architecture and automate the CI/CD.
- Owned the production environments and redesigned the complete system to be AWS agnostic and work with sidecar deployment of microservices, using Kafka, Consul, LinkerD, and Kubernetes.
- Successfully migrated all legacy software to AWS.
DevOps Engineer, Infrastructure Architect, Django/Python Developer
ProfitBricks Deutschland
- Improved the creation and provisioning of test environments for the CI/CD.
- Created a master SalkStack state for data center orchestration that was used company-wide.
- Supported and migrated most of the CI to Jenkins CI.
CTO and Founder
GutenChef
- Developed a PoC using Node.js and Python, including CI/CD, on AWS.
- Coordinated a remote development team for the mobile apps.
- Launched the operation in Berlin, including marketing and sales.
Managing Director of Information and Technology
SPPIN TV - Waimax Telecommunications
- Managed technology and infrastructure, advised about all the processes and procedures, designed and implemented different types of projects regarding the enterprise, and dealt with clients and providers.
- Coordinated work with my team of around 200 people, handling department budget and taking final decision on new implementations.
- Oversaw our LAN and an HFC WAN. We also provided triple-play services over HFC and FTTx, including the use of IPT and IPBX.
- Set up IPT over HFC via DOCSIS 3.0 as part of our triple-play package, which was the most challenging—in Brazil, not every operator is interconnected, and PSTN trunking must be done in-house.
- Automated and optimized infrastructure using Linux, MySQL, Security, Keepalived, Heartbeat, ldirectord, Pacemaker, Cacti, MRTG.
CEO and Founder
BuscoTurno
- Developed an online appointment booking system for healthcare institutions in Latin America, with its core engine based on HL7.
- Oversaw the development of the booking system's mobile app.
- Spent six months in the Start-Up Chile, a seed accelerator created by the Chilean government based in Santiago de Chile, bootstrapping the project.
Software Project Manager
Huawei Technologies Co.
- Managed ringback tone (RBT) and long-distance and international (LDI) products as the project manager for the Claro account.
- Developed next-generation intelligent network (NGIN) projects for Argentina, Uruguay, and Paraguay.
- Handled VPN, dynamic tariff (DT), and Rich Communication Services (RCS) for Movistar.
Chief Technology Officer
Waimax Telecomunicaciones
- Started as a network engineer, but shortly after that, was promoted to project manager and later to CTO.
- Handled the migration from HFC to HPNA and FTTH and supported and developed DOCSIS solutions for 3rd-party companies.
- Promoted to CTO of Red Control after one year as CTO for Itapema. Red Control is part of the holding of Sppin Telecom/Waimax Telecomunicaciones, a Brazil-owned holding of telecommunications companies.
Information Technology Specialist
IBM
- Served as a Windows administrator for the Sanofi-Aventis account, handling Windows Servers administration, Citrix administration, SOX audit, NAV console administration, and security administration.
- Collaborated with other accounts like Novartis—created a script to gather information from about 6,000 servers that were later used to automate several tasks.
- Headed the implementation of the local remedy solution for ticketing management.
- Assisted the Missouri headquarters with the knowledge transfer of the Case New Holland account.
- Acted as a disaster recovery support technician for the Manpower account.
- Contributed to the server consolidation team as a VMware engineer.
Information Security Auditor and Designer
Penta Security Solutions SRL
- Developed and implemented security standards and procedures based on my security audits, following ISO or SOX standards according to the industry involved.
- Performed several pen tests, wireless security analysis, vulnerability testing, IDP and IDP implementation, honeypot logs analysis, disaster recovery plans, and forensics.
- Traveled to Spain and provided in-person, hands-on support for several clients, including Petrobras Argentina, Banco Patagonia, SGC Spain, OXY Oil, and Aerolineas Argentinas.
- Designed an open-source IDP solution based on Snort for very complex network implementations—the paper is available for download.
Information Security Auditor
Pampa Energía
- Provided security consulting services to every project handled by IT, defining the security guidelines and writing the appropriate standards/procedures.
- Audited code, performed applications vulnerabilities analysis, and pen-tested over the internal network. Web security and physical security were also part of several implementations.
- Contributed to the implementation of ISO 17799 and SOX standards for the entire company. Designed an IDS solution and developed the rules for Proventia IDP products.
Information Security, Network Administrator, Server Administrator
Terramed
- Worked with 10 Windows servers as a system administrator.
- Designed, planned, and implemented an information security infrastructure that included ISA servers, IDSs, network antivirus, and a VPN server.
- Migrated the data center from Windows 2000 to Windows 2003.
Network Administrator, Server Administrator, Security Auditor
Toyota Argentina
- Supported and maintained over 1,000 workstations and over 400 network equipment.
- Configured and implemented two fully complete Dell racks with PowerEdge servers, migrated the Symantec antivirus server to the new farm, and updated the client on every workstation.
- Implemented a CiscoWorks instance to manage the entire network and participated in the design of the Windows 2000 to 2003 migration.
Technical Support, Network Administrator, Pre-sales Consulting
Canal Uno
- Focused on technical support, consulting, network administrator, and server configuration.
- Implemented the company CRM and assisted the sales department as a pre-sales technician.
- Performed security audits to the internal network.
Technical Support, Network Administrator, Server Administrator
Red-Com Sistemas
- Provided broad technical support to the local network and small 3rd-party companies.
- Configured servers and network, including security hardness.
- Successfully migrated the infrastructure from Windows to Linux.
Experience
GutenChef | CTO and Founder
BuscoTurno | CEO and Founder
Data-driven Real Estate Site
Education
Executive MBA in Finance
European School of Management and Technology - Berlin, Germany
Master's Degree in Computer Forensics
National Technological University (Universidad Tecnológica Nacional (U.T.N.)) - Buenos Aires, Argentina
Master's Degree in Information Security
CAECE University - Buenos Aires, Argentina
Bachelor's Degree in Computer Science
University of Buenos Aires - Buenos Aires, Argentina
Bachelor's Degree in Physics
University of Buenos Aires - Buenos Aires, Argentina
Certifications
AWS Cloudformation Workshop
tecRacer Group
Architecting on AWS (AWS-A)
tecRacer Group
Security Engineering on AWS
tecRacer Group
AWS Technical Professional
AWS
Cisco Certified Network Professional (CCNP)
Cisco
Certified Information Systems Security Professional (CISSP)
ISC2
VMware Certified Professional (VCP)
VMWare
Skills
Libraries/APIs
Node.js, 3Scale API
Tools
GitLab, Terraform, Ansible, Google Kubernetes Engine (GKE), GitHub, CircleCI, Helm, GitLab CI/CD, VPN, NMap, Nessus, Wireshark, Interactive Disassembler (IDA) Pro, VMware, Jenkins, SaltStack, Keepalived, Pacemaker, Cacti, Kong, Keycloak, Dynatrace, Grafana, Azure Kubernetes Service (AKS)
Languages
YAML, Python, Java, TypeScript
Paradigms
DevOps, Continuous Integration (CI), DevSecOps, Azure DevOps, HIPAA Compliance
Platforms
Kubernetes, Amazon Web Services (AWS), Google Cloud Platform (GCP), Linux, Azure, Windows, Citrix, Apache Kafka, Windows XP, AWS Lambda, OpenStack, Docker, AWS IoT, Amazon, NVIDIA CUDA
Frameworks
Crossplane, Spark
Storage
Datadog, MySQL, Redis, OVH
Other
Kubernetes Operations (kOps), IT Security, CI/CD Pipelines, Infrastructure, DevOps Engineer, Configuration Management, Cloud Security, Infrastructure as Code (IaC), Kubespray, Cloud, Finance, Argo CD, Computer Science, Digital Forensics, CCNA, Networking, LAN, Dell PowerEdge Servers, Symantec, Cisco, SOX, IDS/IPS, ISO/IEC 17799, ISO 27001, SOX Compliance, Disaster Recovery Plans (DRP), People Management, IT Project Management, CTO, Dynamic Tariff (DT), Rich Communication Services (RCS), HFC, FTTx, PSTN, HL7, Mobile Apps, Web Development, Consul, Intrusion Detection Systems (IDS), Security, Heartbeat, ldirectord, Multi Router Traffic Grapher (MRTG), Prometheus, Technology, OpenStack Swift, Amazon API Gateway, Linux Administration, High Availability Disaster Recovery (HADR), Cloudflare, CMT, Machine Learning, STRIDE, Site Reliability Engineering (SRE), Large Language Models (LLMs), CUDA Kernel, AIOps, Machine Learning Operations (MLOps), Wireless Security, CCNP, Monitoring, K3s, Serverless
How to Work with Toptal
Toptal matches you directly with global industry experts from our network in hours—not weeks or months.
Share your needs
Choose your talent
Start your risk-free talent trial
Top talent is in high demand.
Start hiring