Fayaz Abdul, Developer in London, United Kingdom
Fayaz is available for hire
Hire Fayaz

Fayaz Abdul

Verified Expert  in Engineering

Azure Certified DevOps Engineer and Developer

London, United Kingdom
Toptal Member Since
October 24, 2019

Fayaz is a DevOps engineer and architect with experience in designing end-to-end solutions. He has extensive cloud and infrastructure expertise with multiple AWS, Azure, and Google Cloud certifications. He specializes in monitoring tools (ELK, Zabbix, and Prometheus) and automation with Python, Ansible, Terraform, and Docker. Fayaz has implemented Jenkins CI/CD pipelines to build, test, and deploy large apps, including microservices, with zero downtime.


Amazon Web Services (AWS), Puppet, Big Data, Web Application Firewall (WAF)...
Buddy Punch LLC
Azure, Security, IT Security, CI/CD Pipelines, Load Balancers, Scaling, DevOps...
St Jude Childrens Research Hospital - Bioinformatics
Python, ETL, Azure, Linux, Azure Data Factory, DevOps, Azure DevOps, Perl, Bash...




Preferred Environment

Visual Studio Code (VS Code), Windows, MacOS, CentOS, RHEL, Ubuntu

The most amazing...

...thing I've coded is a grid utilization tool, gathering data from 60,000 nodes and presenting the same in a usable graphical representation to help the business.

Work Experience

Senior DevOps Engineer

2016 - PRESENT
  • Migrated the servers (1,400) from on-premise data centers to the public cloud.
  • Converted monolithic applications to microservices.
  • Used IAC Terraform and Cloudformation to deploy the infrastructure.
  • Set up CI/CD pipelines starting from the creation of images, rolling out the environments to pushing the application code changes/features to production.
  • Built many monitoring solutions for different projects including ELK, Zabbix, and Prometheus. Involved in creating and managing Kubernetes clusters, ELK clusters, Kafka clusters for many projects.
  • Architected the end-to-end solutions securely including logging, alerting, and auditing(ITHC).
  • Coded in multiple languages, well versed in serverless programming and front end using React, Amplify, AppSync, GraphQL, DynamoDB.
  • Migrated and managed Atlassian Product Suite to the Cloud.
  • Migrated SAP HANA workloads to the public AWS Cloud.
Technologies: Amazon Web Services (AWS), Puppet, Big Data, Web Application Firewall (WAF), Bitbucket, Artifactory, SonarQube, Apache Maven, Ruby, Git, Groovy, AWS Lambda, SAP, Squid, Zabbix, Heroku, Rancher, Kubernetes, ELK (Elastic Stack), Vault, Consul, Packer, Docker, AWS CloudFormation, Jenkins, Ansible, Terraform, Python, Google Cloud Platform (GCP), Azure, Machine Learning, Data Science, Azure Synapse, Azure Machine Learning, Project Management, IT Project Management, Azure IaaS, .NET, Amazon CloudFront CDN, Amazon Aurora, Amazon EC2, Amazon RDS, Vulnerability Identification, Vulnerability Scanning, Vulnerability Management, Cyber Defense

Azure Expert

2023 - 2023
Buddy Punch LLC
  • Improved the security posture of all infrastructure resources.
  • Introduced web application firewall to protect the application from distributed denial-of-service (DDoS) attacks and make the application internet of things (IoT) secure.
  • Conducted a comprehensive audit of the estate and remediated vulnerabilities.
Technologies: Azure, Security, IT Security, CI/CD Pipelines, Load Balancers, Scaling, DevOps, Vulnerability Identification, Vulnerability Scanning, Vulnerability Management, Cyber Defense, Cybersecurity

DataOps Engineer

2022 - 2023
St Jude Childrens Research Hospital - Bioinformatics
  • Created a Python framework to replace the existing Perl library to do the analysis runs.
  • Containerized the application using Singularity and used the existing data folders.
  • Developed the framework to be configuration-driven for easy introduction of new class types.
Technologies: Python, ETL, Azure, Linux, Azure Data Factory, DevOps, Azure DevOps, Perl, Bash, Bash Script

SRE Consultant

2021 - 2022
Toptal - Visma
  • Automated and rolled out Debezium and changed the data capture framework on GKE.
  • Automated the creation of resources using Terraform and Kubernetes manifest files, including CloudKarafka (a Kafka SAAS offering provider).
  • Created CI/CD pipelines using Spinnaker and set up monitoring using Datadog.
Technologies: Google Cloud Platform (GCP), Google Kubernetes Engine (GKE), Kubernetes, Terraform, Microsoft SQL Server, Debezium, Apache Kafka, Linux, Spinnaker, Datadog, Project Management, IT Project Management

DevOps Engineer

2020 - 2021
  • Automated the RabbitMQ rollout across multiple environments using Terraform.
  • Converted RabbitMQ and Jenkins from server-based to microservices using Kubernetes.
  • Created CI/CD pipelines in Jenkins to create the environments and deploy the applications seamlessly across multiple environments.
Technologies: Amazon Web Services (AWS), Amazon EKS, Helm, Kubernetes, Jenkins, RabbitMQ, Terraform, IT Project Management, Azure IaaS, Amazon CloudFront CDN, Amazon Aurora, Amazon EC2, Amazon RDS, Vulnerability Identification

DevOps Engineer for System Migration to IBM Cloud and AWS Cloud

2020 - 2020
  • Migrated the entire framework from bare metal to IBM and AWS.
  • Implemented hybrid cloud framework for DR and BCP, coded in such a way it can run across multiple regions in case of disaster recovery.
  • Implemented CI-CD pipelines in Jenkins, which will create and manage the deployments.
  • Implemented EBS NFS along with EFS solution to speed up the jobs and parallel processing has been used for modeling. Explored many DB solutions including DynamoDB, Redshift, Aurora, and more.
Technologies: Amazon Web Services (AWS), AWS Lambda, NFS, Amazon CloudWatch, Amazon EFS, Amazon EBS, Amazon S3 (AWS S3), IBM Cloud, Python 3, Project Management, IT Project Management, Amazon CloudFront CDN, Amazon EC2

Linux DevOps for Chromium Project

2020 - 2020
  • Built a framework to build Chromium source code using multiple nodes.
  • Set up CI/CD pipelines in Jenkins that will get triggered automatically when a commit has been made. It will pull and merge the custom changes to the original source code and build both for Linux and macOS variants.
  • Stored artifacts in AWS S3 and AWS EC2 instances were launched and terminated after the job has been done.
Technologies: Amazon Web Services (AWS), Linux, Chromium, MacOS, DevOps, Amazon S3 (AWS S3), IT Project Management, Amazon EC2

Azure DevOps with Security Focus

2020 - 2020
  • Architected and managed end-to-end application from cloud and DevOps point of view.
  • Implemented Terraform and Terragrunt and created many environments.
  • Implemented monitoring using Azure Monitor and prepared the solution for SOC 2 audit compliance.
  • Implemented proper segregation of subscriptions, groups.
  • Implemented and secured Azure Postgresql and integrated the same with application code through environment variables using Ad Ops.
Technologies: Microservices, Monitoring, Azure Blob Storage API, Azure Cosmos DB, Azure Key Vault, Terraform, Penetration Testing, Content Delivery Networks (CDN), Azure Logic Apps, Azure Functions, Azure SQL Databases, Azure DevOps, .NET, Vulnerability Identification, Vulnerability Scanning, Vulnerability Management, Cyber Defense, Cybersecurity

Azure DevOps

2020 - 2020
  • Implemented Azure ARM templates for IAC and created many ephemeral environments.
  • Implemented and developed many Azure Functions in Python, LogicApps, and built the whole ecosystem.
  • Architected the solution which gets triggered when an alert is detected on Azure monitor which triggers Logic App and fires up the connected Azure functions.
  • Contributed to the development of Azure functions that will interact with different databases and use Sendgrid to send emails to users to fix things according to standards.
Technologies: Monitoring, Azure Logic Apps, Python 3, Terraform, Terragrunt, Azure, .NET

Azure DevOps Using Kubernetes

2019 - 2020
  • Architected and implemented the AKS Kubernetes cluster from scratch.
  • Automated the creation of several environments using Infrastructure As Code/IAC(Terraform).
  • Deployed the whole solution using Azure DevOps pipelines.
  • Ensured the scaling of the cluster in real-time when more machine learning jobs were submitted.
Technologies: Azure DevOps, Distributed Computing, PostgreSQL, Django, Elasticsearch, Dask, Python 3, Azure Kubernetes Service (AKS), Kubernetes, Docker, Azure, .NET

Vice President

2015 - 2016
Bank of America Merrill Lynch
  • Developed ILM solution to be compatible with Dodd-Frank.
  • Implemented Ansible Tower to manage all the server estate.
  • Implemented ELK solution and Graphite, Grafana monitoring tools.
  • Architected the components to be highly available for DR and BC purposes.
  • Implemented the grid utilization tool which helped in saving millions of dollars on infrastructure spending.
Technologies: Heroku, Graphite, Grafana, Mercurial, Ansible, JavaScript, ELK (Elastic Stack), Cloud Foundry, Django, Python, Amazon EC2, Amazon RDS

Monitoring Team Lead

2011 - 2015
  • Monitored the whole of the online technology group.
  • Architected the solution by placing multiple monitoring collectors in each data center for HA and latency purposes.
  • Extended the Zenoss tool by coding custom Zenpacks for application monitoring.
  • Migrated the applications from data centres to AWS cloud.
  • Analyzed many monitoring tools to find which solution meets the requirements and better value for money.
  • Introduced the configuration management tools such as puppet and Ansible to manage 60 monitoring servers and around 3,500 servers used for different applications.
Technologies: Amazon Web Services (AWS), Ansible, Puppet, Monitoring, Linux, VMware, Confluence, Jira, Jenkins, ELK (Elastic Stack), Splunk, JavaScript, Load Balancers, Docker, MySQL, Twisted, Nagios, Zabbix, Zenoss, Python, Amazon CloudFront CDN, Amazon EC2, Amazon RDS

Data Center Migration (1,400 Servers)

Worked on a public sector digital transformation project where we migrated around 1,400 servers to the public cloud. Started with a lift and shift approach, but in the process we automated every aspect from creating images using Packer, creating infrastructure using Terraform, coding Ansible playbooks for configuring the servers, writing Jenkins groovy pipelines for CI/CD, storing the secrets in vault, using consul for service discovery, moving to serverless approach where feasible, converting monolithic applications to microservices, moved from pet to cattle model, containerization using Docker, Docker-Compose, using container orchestration tools such as Rancher, Docker Swarm, Kubernetes, and implemented and improved monitoring tools such as ELK, Zabbix, and Prometheus.

Architected, designed, and implemented the application, tooling, monitoring layers from infra and DevOps perspective to support the big data applications to be redundant, highly available, and scalable with blue-green/zero downtime deployments. As an end result, migrated around 21 environments successfully without compromising on security, adhering to CIS standards, and passing all ITHC.

Did many upgrades and POCs on hybrid cloud, PCF, and private clouds.

AKS Automated Zero Down Time Deployment

Created the Azure Kubernetes Service along with the whole eco-system such as resource groups, virtual networks, storage accounts, etc.., using Terraform and used Terragrunt to reuse the same code to deploy for different environments.

Setup the Azure DevOps build pipelines and release pipelines which will get triggered on developer commits and build the required docker images and push the same to Azure container registry with appropriate tags, generate the artifacts and pushed the same as part of rolling updates in release pipelines.

Enabled the logging, and used different patterns of deployment in Kubernetes as this is for AI/ML application. Used node pools and enabled autoscaling for the same and made sure for ML GPU workloads only one pod runs on a given node(with Nvidia drivers installed as part of Daemonset) and based on the resource limits spin up a new node for a new ML GPU workload pod.

Subscription-based Mobile Trading App

Architected and implemented a low latency trading platform that can execute a trading order in all the client accounts at the same time under 500ms. Used Kafka with many consumers one per user and single or limited producers. Complexity is to be able to make it compatible for all different types of accounts, designing an order management system.
2004 - 2006

Master's Degree in Information Technology

International Institute of Information Technology - Bangalore, India


Certified Kubernetes Administrator

The Cloud Native Computing Foundation


Atlassian Certified Confluence Administrator



Atlassian Certified System Administrator



Atlassian Certified Jira Service Desk Administrator


JUNE 2019 - JUNE 2021

Google Cloud - Professional Cloud Architect



Atlassian Certified Jira Administrator



Atlassian Certified Jira Administrator



AWS Certified SysOps Administrator - Associate



AWS Certified DevOps Engineer - Professional



ISTQB - BCS Certified Tester Foundation Level


APRIL 2017 - APRIL 2020

AWS Certified Solution Architect - Professional



Prince2 Agile Practitioner



Prince2 Foundation Certificate in Project Management



Microsoft Specialist: Implementing Microsoft Azure Infrastructure Solutions



Microsoft Specialist: Architecting



Microsoft Certified Solutions Developer: Azure Solutions Architect



Microsoft Certified Solutions Expert: Cloud Platform and Infrastructure



Microsoft Certified Solutions Associate: Cloud Platform


MARCH 2017 - APRIL 2020

AWS Certified Solution Architect - Associate



Python IKM Assessment



REST APIs, Jira REST API, Terragrunt, Node.js, React, jQuery, AWS Amplify, Dask, Azure Blob Storage API


AWS ELB, AWS CloudFormation, Amazon Simple Queue Service (SQS), Azure DevOps Services, Ansible, Docker Compose, Azure Kubernetes Service (AKS), Jira, Confluence, GitHub, Jenkins, Atlassian Suite, ELK (Elastic Stack), Zenoss, Zabbix, Subversion (SVN), GitLab CI/CD, TeamCity, Cucumber, Grafana, SonarQube, Terraform, Packer, Vault, Amazon CloudFront CDN, Amazon EKS, Kibana, VPN, NGINX, Logging, HashiCorp, AWS IAM, BigQuery, Helm, Logstash, Puppet, Docker Swarm, Google Kubernetes Engine (GKE), Bitbucket, Ansible Tower, Chef, SaltStack, Mesos, Splunk, CircleCI, BOSH, Kafka Streams, RabbitMQ, Nagios, AWS Glue, Tableau, Microsoft Power BI, Apache, AWS AppSync, Squid, Git, Apache Maven, Artifactory, Mercurial, VMware, AWS Fargate, Amazon Simple Notification Service (Amazon SNS), Azure Logic Apps, Azure Key Vault, Amazon EBS, Amazon CloudWatch, Amazon SageMaker, LoadRunner, Apache Sqoop, Kong, NVIDIA Grid SDK, Bamboo, Azure Machine Learning


Django, Selenium, Spark, Ruby on Rails (RoR), Windows PowerShell, Hadoop, React Native, .NET, Twisted, .NET Core


Python, Python 2, Python 3, YAML, SQL, Go, JavaScript, Ruby, Bash, C++, Java, PHP, HTML, CSS, TypeScript, Groovy, Perl, Bash Script


ETL, Data Science, Rapid Application Development (RAD), DevOps, Agile Software Development, Test-driven Development (TDD), Behavior-driven Development (BDD), PRINCE2, Continuous Integration (CI), Continuous Deployment, Continuous Delivery (CD), DevSecOps, Continuous Development (CD), ITIL, Service-oriented Architecture (SOA), Distributed Computing, Azure DevOps, Penetration Testing, Microservices


Amazon EC2, Docker, Kubernetes, Linux CentOS 7, CentOS 6, Ubuntu Linux, Alpine Linux, AWS Lambda, Google Cloud Platform (GCP), Azure, Amazon Web Services (AWS), Azure IaaS, Rancher, Heroku, VMware Tanzu Application Service (TAS) (Pivotal Cloud Foundry (PCF)), Oracle, MacOS, Azure Synapse, Ubuntu, CentOS, Windows, Linux, Apache Kafka, Azure Functions, DigitalOcean, AWS IoT, Spinnaker, Visual Studio Code (VS Code), Microsoft


Auto-scaling Cloud Infrastructure, On-premise, JSON, Amazon S3 (AWS S3), Elasticsearch, Azure Cloud Services, Data Pipelines, Redshift, MySQL, PostgreSQL, MongoDB, Redis, Memcached, Amazon Aurora, Amazon DynamoDB, MariaDB, Google Cloud, Azure SQL Databases, Azure Cosmos DB, Amazon EFS, Neo4j, Apache Hive, Microsoft SQL Server, Datadog

Industry Expertise

Project Management, Cybersecurity


Cloud Platforms, Cloud Infrastructure, Security, AWS DevOps, Infrastructure, Hybrid Cloud Infrastructure, Architecture, IT Governance, IT Infrastructure, GAP Analysis, Implementation, Product Roadmaps, Organization, Resource Allocation, APIs, Site Reliability, Site Reliability Engineering (SRE), Linux Server Administration, Web Scalability, Scalability, Software Architecture, Containers, Shell Scripting, AWS Certified DevOps Engineer, Serverless, Groovy Scripting, Lambda Functions, Cloud, Certified ScrumMaster (CSM), AWS Certified Solution Architect, AWS Certified SysOps Administrator, MCSA, MCSE | Private Cloud, Consul, Linux Administration, Web Application Firewall (WAF), SecOps, Sprint Planning, Sprint Retrospectives, Dashboard Development, Data Reporting, ETL Development, ETL Tools, Prometheus, Monitoring, IT Networking, Cloud Architecture, AWS Cloud Architecture, SSL Configurations, SSL, IT Security, Web App Security, Documentation, NFS, SMTP, Infrastructure Monitoring, Log Management, Network Monitoring, Jira Administration, Jira Administrator, Build Pipelines, CI/CD Pipelines, ITSM, IT Service Management (ITSM), ITIL V3 Foundation Certified, AWS Secrets Manager, Azure Data Factory, IT Project Management, Amazon RDS, Benchmark Analysis, Amazon Kinesis, Networking, Proxies, Ruby Gems, Kubernetes Operations (kOps), High Availability Disaster Recovery (HADR), Cloud Foundry, Big Data, Big Data Architecture, WebSockets, GPU Computing, Machine Learning, Writing & Editing, IBM Cloud, SOAP, Nomad, Azure Databricks, Vulnerability Identification, Vulnerability Scanning, Vulnerability Management, Cyber Defense, RHEL, SAP, Graphite, Load Balancers, Amazon API Gateway, Content Delivery Networks (CDN), Chromium, Apache Flume, Firewalls, SAP ERP, GitHub Actions, Debezium, Scaling

Collaboration That Works

How to Work with Toptal

Toptal matches you directly with global industry experts from our network in hours—not weeks or months.


Share your needs

Discuss your requirements and refine your scope in a call with a Toptal domain expert.

Choose your talent

Get a short list of expertly matched talent within 24 hours to review, interview, and choose from.

Start your risk-free talent trial

Work with your chosen talent on a trial basis for up to two weeks. Pay only if you decide to hire them.

Top talent is in high demand.

Start hiring