Saad Ali
Verified Expert in Engineering
DevOps Engineer and Developer
Lahore, Punjab, Pakistan
Toptal member since October 3, 2022
With 11+ years of experience in multiple roles, Saad is passionate about designing, deploying, and managing scalable, reliable, and secure cloud infrastructure for web applications and services. He has extensive experience with AWS, CloudFormation, Terraform, Ansible, Jenkins, ArgoCD, Docker, and Kubernetes, as well as various tools and frameworks for monitoring, logging, security, and automation. Saad is also CKAD certified. He ensures that he always follows best practices while working.
Portfolio
Experience
- Bash - 10 years
- Amazon Web Services (AWS) - 10 years
- Jenkins - 5 years
- Python - 5 years
- Ansible - 5 years
- Terraform - 4 years
- Kubernetes - 2 years
- Argo CD - 1 year
Availability
Preferred Environment
Terraform, Linux, Amazon Web Services (AWS), NGINX, Bash, Ansible, AWS CloudFormation, Kubernetes, Amazon EKS, Argo CD
The most amazing...
...thing I've contributed to is the open source project Argo CD helm chart with a ConfigMap that allows you to add the ConfigManagementPlugin configuration.
Work Experience
Senior Site Reliability Engineer
Arbisoft
- Provided DevOps best practices and recommendations to Edly projects.
- Used Tutor for deploying the Open edX platform in a containerized environment.
- Used Tutor for deploying the Open edX platform in a containerized environment.
DevOps Engineer and Team Lead | Site Reliability Engineer | Senior Site Reliability Engineer
Arbisoft (Pvt) Ltd
- Built and led a team of 20 DevOps engineers between December 2018 and May 2021 before being promoted as a site reliability engineer at edX, 2U.
- Decreased the Open edX deployment time from two weeks to two days through automation using Jenkins, Ansible, and Bash scripts.
- Enforced a highly available, scalable, and self-healing infrastructure through CloudFormation for Open edX deployments.
- Monitored Open edX deployments using CloudWatch, Datadog, and New Relic.
- Reduced the dependency on Splunk licenses by utilizing the Elasticsearch, Fluentd or Fluent B, and Kibana stack for the log aggregation of Open edX and other Python applications.
- Used Lambda functions in AWS for scheduled jobs on EC2s and routine cleanups of AMIs.
- Implemented Jenkins Master with an EKS slave cluster.
- Incorporated various automation tools to facilitate a CI/CD process for multiple projects at Arbisoft.
Senior DevOps Engineer
Unifonic
- Created a new RabbitMQ Helm chart that uses the RabbitMQ Operator to launch a new cluster in Kubernetes and manage custom RabbitMQ resources.
- Completed EKS upgrade analysis to upgrade from 1.24 to 1.29.
- Utilized Terraform to easily launch new Kubernetes cluster Node.js pools in the environment as needed.
Site Reliability Engineer
edX
- Updated application code in open-source Open edX Git repositories to make the Django user management command more generic for other Open edX Django applications.
- Containerized the in-house ChatOps application previously hosted on AWS Lambda to run it in a Kubernetes cluster.
- Wrote an Argo CD deployment pipeline for ChatOps and other internal applications in GoCD.
- Used the Jenkins Job DSL plugin with Groovy language to write Jenkins jobs in code.
- Managed the infrastructure at scale using Terraform.
- Reduced manual toil by working on other forms of automation using automated jobs with Jenkins or GoCD that run different scripts.
Senior Network Systems Engineer | Associate NOC Manager
Nextbridge (Pvt) Ltd
- Led the team as an associate network operations center (NOC) manager in 2016.
- Improved the inbound email reception of the company's self-hosted email service by implementing a backup mail exchange (MX).
- Replicated the email storage to a secondary IMAP server in the same email system, removing a single point of failure.
- Implemented anti-spoofing measures in the email system, effectively preventing email spoofing.
- Automated FreePBX outbound call reports via Python to keep track of phone calls.
- Deployed web applications in an auto-scaling environment using AWS Developer Tools.
- Incorporated OpenVAS for external vulnerability assessment.
- Used MySQL Galera replication in AWS for an eCommerce business required to keep data in the US, Germany, and Australia.
- Increased the security of the eCommerce legacy application built in PHP through an AppArmor.
System Administrator
Happy Hosts
- Managed, maintained, and troubleshot servers for shared and dedicated web and email hosting.
- Utilized Puppet for automating administrative tasks.
- Implemented MySQL replication for ISP services that depended on it, such as PowerDNS.
Associate Network Systems Engineer
Nextbridge (Pvt) Ltd
- Implemented cloud infrastructure for various customers on AWS and Rackspace Cloud.
- Enforced disaster recovery plans for the company and many of its customers.
- Managed Subversion, Git, VMware ESXi and Proxmox hypervisors, VPN, LAMP or LEMP stacks, ROR, and email servers.
- Utilized Nagios for monitoring company and customer nodes.
- Executed MySQL Master-Master replication for a set of applications deployed in multiple physical locations.
System Administrator
Self-employed
- Increased a boot server's availability using Oracle Enterprise Linux 5.7 and Oracle Clusterware for diskless client machines.
- Used Debian Linux kernel patched with the Kerrighed single-system image for the client machines.
- Booted the diskless systems as part of a cluster—a single system leveraging the combined processing power of all machines to do various tasks. The set up was used in the research for breaking RSA encryption.
System Administrator
CoreZee (Pvt) Ltd
- Assisted development teams with resolving package dependencies on Linux and BSD systems.
- Tested the software built atop Linux- or BSD-embedded system boards from Cavium Networks.
- Built a testing scenario for an intrusion prevention system, a network security product.
- Utilized the Scapy packet manipulation framework to build and test exploits on the Snort rule set.
- Reduced the time it takes to package a release on a compact-flash card through Bash scripts.
Experience
MIT Open Learning Library
https://openlearninglibrary.mit.edu/aboutPhilanthropy University
https://www.philanthropyu.org/UC San Diego Online
https://online.ucsd.eduEducation
Master of Engineering in Communication Systems and Networks
Mehran University of Engineering and Technology - Jamshoro, Sindh, Pakistan
Bachelor of Engineering in Computer Systems
Mehran University of Engineering and Technology - Jamshoro, Sindh, Pakistan
Certifications
CKAD: Certified Kubernetes Application Developer
The Linux Foundation
Skills
Libraries/APIs
Jenkins Pipeline, Jenkins Job DSL
Tools
Jenkins, NGINX, Postfix, Ansible, AWS CloudFormation, Terraform, AWS CodeBuild, Iptables, VPN, Nagios, Parallels Plesk Panel, Puppet, FreePBX, AppArmor, Kibana, Fluentd, Amazon EKS, Splunk, Amazon CloudWatch, Vault, Amazon OpenSearch, RabbitMQ, Helm, ELK (Elastic Stack)
Paradigms
DevOps, Continuous Integration (CI), Continuous Delivery (CD)
Platforms
Linux, Amazon Web Services (AWS), Open edX, Kubernetes, Docker, Amazon EC2, BSD, Oracle Linux, Apache2, Proxmox, Rackspace Cloud, AWS Lambda, Ubuntu, Apache Kafka, Drone CI
Languages
Bash, Python
Storage
MongoDB, Amazon S3 (AWS S3), MySQL, LAMP Server, Database Replication, Auto-scaling Cloud Infrastructure, Elasticsearch, Oracle Cloud
Frameworks
Django, Flask
Other
CI/CD Pipelines, Scalability, Load Balancers, Networks, Shell Scripting, High Availability Disaster Recovery (HADR), Dovecot, GitHub Actions, Argo CD, GoCD, Site Reliability Engineering (SRE), Amazon RDS, Amazon Route 53, Infrastructure as Code (IaC), Orchestration, Programming, Intrusion Detection Systems (IDS), Intrusion Prevention Systems (IPS), Network Exploitation, IT Automation, Kerrighed, VMware ESXi, Disaster Recovery Plans (DRP), Web Servers, PowerDNS, SMTP, IMAP, Vulnerability Assessment, AWS DevOps, Prometheus, Apache Cassandra, Argo Workflows
How to Work with Toptal
Toptal matches you directly with global industry experts from our network in hours—not weeks or months.
Share your needs
Choose your talent
Start your risk-free talent trial
Top talent is in high demand.
Start hiring