Vikas Aggarwal, Developer in Chandigarh, India
Vikas is available for hire
Hire Vikas

Vikas Aggarwal

Verified Expert  in Engineering

DevOps Developer

Location
Chandigarh, India
Toptal Member Since
March 30, 2020

Vikas is a DevOps engineer with work experience of more than eight years. He is an expert on cloud, container, and Kubernetes platforms. Along with that, he comes with a strong automation background and is a firm believer in the automation comes first approach. Vikas is an excellent debugger and a problem solver, and have helped his clients like BrowserStack, Helpshift, and Booking.com to scale their apps up to 100 folds. His clients have always praised his exceptional problem-solving skills.

Portfolio

Self-employed
GitLab CI/CD, Site Reliability, Terraform, Amazon EC2, Redis, Vault, Kubernetes...
Helpshift Technologies
Site Reliability, Amazon EC2, Redis, Vault, Kubernetes, MongoDB...
Browserstack
Terraform, Amazon EC2, MongoDB, Amazon Web Services (AWS), Python, DevOps...

Experience

Availability

Part-time

Preferred Environment

Zoom, BlueJeans, Google Docs, Confluence, Jira, Slack, Vim Text Editor, Ubuntu

The most amazing...

...thing I've done is save my client's money with a performance boost by implementing custom layered zonal caching.

Work Experience

DevOps Leader

2019 - PRESENT
Self-employed
  • Migrated infrastructure from cloud to on-premises, including large-sized databases and multi-stage pipelines.
  • Designed and coded ML pipelines to aid ML apps with data ingestion, model training, and model serving.
  • Transformed infrastructure into a version control deployment using Terraform.
  • Deployed and managed multi-tenant Kubernetes cluster with support of disposable clusters.
  • Managed and improvised production, building and deploying pipelines for durability and resumability.
  • Designed and implemented a centralized logging solution for multi-account AWS infrastructure.
  • Implemented auto-scaling in RDS database with AZ control and cache layer to achieve a mix of performance booster and cost savings.
  • Performed live DB migrations for PostgreSQL, Redis, and MongoDB that were required for cloud migrations without downtime.
  • Created the automation framework for teams to keep the code deployment smooth and reliable.
Technologies: GitLab CI/CD, Site Reliability, Terraform, Amazon EC2, Redis, Vault, Kubernetes, MongoDB, Amazon Web Services (AWS), Python, DevOps, Ansible, Amazon Route 53, Docker, GitLab, Amazon CloudWatch, Amazon EKS, AWS Elastic Beanstalk, Amazon Elastic Container Service (Amazon ECS), AWS Lambda, Envoy Proxy, Packer, NGINX, Site Reliability Engineering (SRE), Grafana, Amazon Virtual Private Cloud (VPC), Amazon S3 (AWS S3), Amazon CloudFront CDN, CI/CD Pipelines, Prometheus, GitHub Actions, Istio, WireGuard, Oracle Cloud, Containers, Helm, K3s, RabbitMQ, DNS, Machine Learning Operations (MLOps), Prefect, Bash, Linux, Elasticsearch, GPU Computing, Apache Kafka, Cloud Migration, Database Migration, Buildah, Podman, Kubernetes Operations (kOps), Infrastructure as Code (IaC), Microservices, Serverless Architecture, REST, REST APIs, Amazon RDS, Git, DevSecOps, Amazon Firewall, AWS DevOps, Cloud Services, Unix, Linux Server Administration, Resource Management, Networking, Linux Administration, Autoscaling, DevOps Engineer, Shell Scripting, MySQL, Datadog, CircleCI, AWS Cloud Architecture, AWS ELB, Amazon Aurora, Security, IT Infrastructure, Architecture, Cloud Security, CloudOps, AWS CLI, AWS IAM, Cloud Architecture, Redshift, Cloud Native, Azure DevOps, Continuous Deployment, IaaS, Elastic, Infrastructure, Azure Kubernetes Service (AKS), Ansible Tower

Production Engineer II

2017 - 2019
Helpshift Technologies
  • Removed tech debt items like system upgrades, auto-failover for database systems, and immutable infrastructure.
  • Led a Kubernetes project and helped the developer shift 80% of the workload from monolith to SOA.
  • Scaled MongoDB infrastructure to support ten terabytes of data set used by more than 90% of production applications.
  • Implemented caching for passive workloads to offload DB pressure.
  • Debugged issues with Azure cloud engineers on Azure’s end.
  • Implemented a highly performant job scheduler using Jenkins.
  • Managed Hadoop infrastructure and implemented Druid to support the data engineering team in their experiments.
Technologies: Site Reliability, Amazon EC2, Redis, Vault, Kubernetes, MongoDB, Amazon Web Services (AWS), Python, DevOps, Ansible, Amazon Route 53, Consul, PostgreSQL, Sensu, Continuous Integration (CI), Jenkins Job DSL, Pipeline Programming, Jenkins, Azure, HAProxy, Docker, Vim Text Editor, NGINX, Site Reliability Engineering (SRE), Grafana, Amazon Virtual Private Cloud (VPC), Amazon S3 (AWS S3), Amazon CloudFront CDN, Jenkins Pipeline, Infrastructure as Code (IaC), Microservices, REST, REST APIs, Amazon RDS, Git, Amazon Firewall, Cloud Services, Unix, Linux Server Administration, Resource Management, Linux, Linux Administration, Autoscaling, DevOps Engineer, Shell Scripting, MySQL, SQL, Datadog, AWS Cloud Architecture, AWS ELB, Amazon Aurora, Security, IT Infrastructure, Architecture, Cloud Security, CloudOps, AWS CLI, AWS IAM, Cloud Architecture, Cloud Native, Continuous Deployment, IaaS, Elastic, Infrastructure, Azure Kubernetes Service (AKS), Ansible Tower

Infrastructure Engineer

2015 - 2017
Browserstack
  • Contributed to system design and architecture and created the infrastructure from scratch.
  • Supported the company growth from 1,000 rpm to 100,000 rpm in one year.
  • Implemented a real-time monitoring system for more than 6,000 servers deployed in the hybrid cloud across six different geographically separated locations.
  • Developed a tool to deploy across the fleet in control, reversible with a feedback loop, with CI hooks.
  • Implemented a highly scalable instrumentation system to record application and system metrics to support real-time system and application profiling.
  • Automated hybrid cloud management using Ansible. It is highly configured for a specialized use case.
  • Developed and managed a lot of in-house tooling for housekeeping, third-party integrations, and a disposable dev environment.
  • Started an SOA initiative and delivered 40% of workloads in a well-managed, self-designed microservices framework.
Technologies: Terraform, Amazon EC2, MongoDB, Amazon Web Services (AWS), Python, DevOps, Ansible, Amazon Route 53, Nagios, SmokePing, Go, Grafana, Google Cloud Platform (GCP), Graphite, InfluxDB, Docker, Site Reliability, Vim Text Editor, Site Reliability Engineering (SRE), Amazon Virtual Private Cloud (VPC), Amazon S3 (AWS S3), SOX Compliance, Infrastructure as Code (IaC), Microservices, REST, REST APIs, Amazon RDS, Git, Amazon Firewall, Cloud Services, Unix, Linux Server Administration, Resource Management, Networking, Linux, Linux Administration, Autoscaling, DevOps Engineer, Shell Scripting, MySQL, SQL, AWS Cloud Architecture, AWS ELB, Amazon Aurora, Security, IT Infrastructure, Architecture, Cloud Security, React, CloudOps, AWS CLI, AWS IAM, Cloud Architecture, Cloud Native, Laravel, Continuous Deployment, IaaS, Elastic, Infrastructure, Ansible Tower

DevOps Intern

2013 - 2013
Healthkart
  • Created a tool to load tests with almost real users and benchmark applications.
  • Supported the team to automate infrastructure provisioning.
  • Designed a graphing tool to monitor service health using Node.js and a high chart, integrated with haproxy, vmstats, and Linux probes.
Technologies: DevOps, KVM, Vagrant, Highcharts, HAProxy, Node.js, Vim Text Editor, Amazon Virtual Private Cloud (VPC), REST, REST APIs, Git, Cloud Services, Unix, Linux Server Administration, Linux, DevOps Engineer, Shell Scripting, IT Infrastructure, AWS CLI, AWS IAM, Laravel, Infrastructure

Ansible 2 Cloud Automation Cookbook

https://www.oreilly.com/library/view/ansible-2-cloud/9781788295826/
• Authored an automation cookbook entitled “Ansible 2 Cloud Automation.”
• The book is a guide to Ansible aspirants and consists of information about the best possible ways to manage and automate the cloud infrastructure.
• It was published by Packt publishing company in 2018.
Tech Stack: Ansible

Deployment Tool for Kubernetes Application

Developed a tool to deploy applications on the Kubernetes platform. Kubectl, the default tool, does not have a built-in controlled deployment strategy which is required for every business-critical application. This tool allowed developers to deploy applications more guarded and controlled fashion.
Tech Stack: Python, Kubectl, Git

Service Level Objectives/Indicators Framework

I developed a framework to support official languages supported in the company, to generate default SLO and SLIs with minimal YAML config. The idea was to encourage teams and populate their SLIs and help SREs in the team to keep a healthy SLO.
Tech Stack: Perl, Python, Graphite

Auto Failover Service

I developed a service to fail over customers to nearby data centers in case of network issues. A colocation hosting network can be unreliable sometimes, causing bad customer experiences. We built a service to seamlessly connect the user to a nearby data center provider.
Tech Stack: Python, a couple of integrations with the application.

Uptime Monitoring Service

Created an internal service to display the uptime of a product based on weighted error calculation. This tool was built on the heart of the instrumentation system with a decision tree to report up/down every minute.

Tech Stack: Python, Go, InfluxDB, Redis, Status.io

Access Management Tool

Developed an internal access management tool that was deeply integrated with every internal/external component used by the company to provide on-demand access. With the growing size of the firm, providing access to critical/non-critical services becomes uncontrollable. With this system, everyone, from CTO to the intern, was granted access.

Tech Stack: Python, Perl, LDAP, a couple of integrations

Languages

Python, Bash, SQL, Go, Ruby, Perl, Python 3, C

Libraries/APIs

Jenkins Job DSL, REST APIs, Jenkins Pipeline, Node.js, Highcharts, React

Tools

GitLab, Ansible, Terraform, Amazon Virtual Private Cloud (VPC), Amazon CloudFront CDN, Vagrant, GitLab CI/CD, NGINX, Git, Amazon Firewall, CircleCI, AWS ELB, CloudOps, AWS CLI, AWS IAM, Elastic, Azure Kubernetes Service (AKS), Nagios, Amazon Elastic Container Service (Amazon ECS), Amazon EKS, Packer, Envoy Proxy, Vault, GitHub, Istio, WireGuard, Helm, RabbitMQ, AWS Fargate, Ansible Tower, Vim Text Editor, Slack, Jira, Confluence, Google Docs, Zoom, Grafana, SmokePing, Jenkins, Amazon CloudWatch, Statuspage.io

Paradigms

Continuous Integration (CI), DevOps, Microservices, REST, Azure DevOps, Continuous Deployment, Serverless Architecture, DevSecOps, Pipeline Programming

Platforms

Azure, Kubernetes, Docker, AWS Lambda, Amazon EC2, Linux, Amazon Web Services (AWS), Cloud Native, Google Cloud Platform (GCP), Ubuntu, CentOS, Apache Kafka, Unix, Sensu, AWS Elastic Beanstalk, KVM

Storage

Redis, Amazon S3 (AWS S3), MySQL, MongoDB, PostgreSQL, Oracle Cloud, Elasticsearch, Google Cloud, Database Migration, Datadog, Amazon Aurora, Redshift, InfluxDB, Database Management Systems (DBMS)

Other

Amazon Route 53, Site Reliability Engineering (SRE), CI/CD Pipelines, Machine Learning Operations (MLOps), GitHub Actions, Containers, Buildah, Podman, Infrastructure as Code (IaC), Amazon RDS, AWS DevOps, Cloud Services, Linux Server Administration, Resource Management, Autoscaling, DevOps Engineer, Shell Scripting, AWS Cloud Architecture, IT Infrastructure, Architecture, Cloud Architecture, IaaS, Infrastructure, Consul, HAProxy, Site Reliability, SOX Compliance, Prometheus, K3s, Cloud Migration, GPU Computing, Linux Administration, Access Control, Clustering, User Permissions, Kubernetes Operations (kOps), Computer Networking, Networking, Security, Cloud Security, BlueJeans, Graphite, DNS, Prefect, Operating Systems, Data Structures, Algorithms, LDAP

Frameworks

gRPC, Laravel

2011 - 2015

Bachelor of Technology Degree in Computer Science

The LNMIIT - India

Collaboration That Works

How to Work with Toptal

Toptal matches you directly with global industry experts from our network in hours—not weeks or months.

1

Share your needs

Discuss your requirements and refine your scope in a call with a Toptal domain expert.
2

Choose your talent

Get a short list of expertly matched talent within 24 hours to review, interview, and choose from.
3

Start your risk-free talent trial

Work with your chosen talent on a trial basis for up to two weeks. Pay only if you decide to hire them.

Top talent is in high demand.

Start hiring