
HarmanjotSingh Bhatia
Verified Expert in Engineering
DevOps Developer
Ahmedabad, Gujarat, India
Toptal member since January 19, 2026
Harman is a Senior Cloud Architect and DevOps Lead with deep expertise in AWS and GCP cloud architecture, large-scale migrations, and platform engineering. He specializes in designing, rebuilding, and migrating production cloud environments with emphasis on security, reliability, scalability, and cost. Harman has hands-on experience leading end-to-end cloud migrations, while establishing production-grade CI/CD pipelines and Infrastructure as Code (IaC) for repeatable, auditable deployments.
Portfolio
Experience
- DevOps - 7 years
- Cloud Infrastructure - 6 years
- Amazon Web Services (AWS) - 6 years
- Technical Architecture - 6 years
- GCP DevOps - 6 years
- Cloud Architecture - 6 years
- Infrastructure as Code (IaC) - 5 years
- Terraform Cloud - 4 years
Preferred Environment
Kubernetes, GCP DevOps, Machine Learning Operations (MLOps), CI/CD Pipelines, GitHub, Cloud Architecture, AWS Cloud Architecture, AWS DevOps, Amazon Web Services (AWS), YAML Pipelines
The most amazing...
...thing I’ve done is architect a self-healing DevSecOps pipeline that enforces SOC 2 compliance through automated SAST, DAST, and SCA.
Work Experience
DevOps Specialist
Freelance Clients
- Executed Pulumi AWS lift and shift environment migration between traditional AWS accounts and AWS organizations with IAM identity centre.
- Implemented GCP security suite with Static/Dynamic Application Security Testing, Software composition Analysis, VAPT, Server hardening, etc.
- Migrated traditional CI/CD to a self-healing OCID, AWS CDK, and Pipeline using AWS Amplify service.
Solutions Architect (AI and Cloud) | Technical Lead
Creole Studios
- Architected and developed end-to-end, enterprise-grade AI solutions (including LLM-based agents and traditional computer vision models), leading projects from POC to end products.
- Led the technical design of scalable and resilient cloud architectures for complex AI/ML workloads, with a focus on establishing best practices for MLOps, scalability, and continuous improvement.
- Collaborated directly with teams of data scientists, software engineers, and product managers to align AI solutions with business objectives.
- Mentored junior engineers on AI/ML development best practices.
- Led cloud architecture and DevOps for multi-environment production workloads on AWS (and GCP where needed), standardizing IaC, CI/CD, and platform reliability.
Cloud Solution Architect
9Series
- Led and mentored a high-performing DevOps team, establishing the strategy for cloud infrastructure management, CI/CD, and automation across diverse client projects.
- Spearheaded the implementation of a comprehensive DevSecOps program, integrating security tooling into CI/CD pipelines, which reduced vulnerabilities by 40% and was instrumental in achieving SOC 2 type-2 compliance.
- Drove a minimum 20% reduction in overall infrastructure costs through rigorous benchmarking, architectural reviews, and strategic automation initiatives.
- Implemented observability by integrating metrics, logs, and traces using Prometheus, Grafana, OpenTelemetry, and ELK, and defined service-level objectives and alerting.
- Implemented CI/CD pipelines using GitHub Actions and IaC, incorporating plan and apply gates, policy checks, and artifact versioning.
Sr. DevOps Engineer/DevOps Engineer
Global Garner & MindzTeq Solutions
- Designed, implemented, and managed scalable cloud infrastructure on AWS and GCP, ensuring high availability and reliability for business-critical applications.
- Developed complex CI/CD pipelines incorporating blue-green deployments and zero-downtime rollbacks, increasing deployment success rates by 70%.
- Gained foundational experience in data engineering practices (ETL/ELT) and SRE principles, which informed subsequent architectural approaches to reliability and data handling.
Experience
Zero Downtime Blue-green Deployment Pipeline
I implemented blue-green deployment strategies using project-specific options, including AWS ALB, Kubernetes, and Elastic IPs, to ensure zero downtime and manage potential rollbacks. As a result of my efforts, we achieved zero downtime during deployments, improved deployment stability, and reduced rollback incidents by 60%.
Serverless RAG-based Q&A System for Unstructured Data
Key technologies: RAG, Amazon Bedrock, AWS Lambda, S3, Pinecone, FastAPI, Titan Embeddings.
Multi-agent Customer Support System
Self-healing CI/CD Pipelines
DevSecOps Pipeline Implementation
One-click SaaS Environment Pipeline
Unified Infrastructure Observability Stack
Logistics Optimization Bot with Hybrid Data Access
Intranet Deployment for Cloud Environments
Multi-code Repo Hosting and Deployment Pipeline
Automated Application Testing Implementation
AI-powered Media Intelligence & PR Opportunity Platform
Cloud Deployment and Operations of Moodle LMS on AWS
I prepared automated cloud installers and provisioned the environment on Linux-based Amazon EC2 instances with database services on RDS.
Responsibilities included environment setup, Moodle deployment, database migration, and debugging application issues during rollout.
Operational management and troubleshooting were performed using Moodle CLI tools, where some includes commands like:
• php admin/cli/upgrade.php –non-interactive
• php admin/cli/purge_caches.php
• php admin/cli/cron.php
• mysqldump -u moodleuser -p moodledb > moodle_backup.sql
I also handled file permissions for the Moodle data directory, plugin compatibility checks, and performance troubleshooting during the migration process.
Healthcare Staffing SaaS Platform
Solution: Led the full migration from Terraform to Pulumi V3, re-platforming the entire production stack onto a clean, dedicated AWS account. Implemented an active-passive multi-region disaster recovery strategy using Route 53 health checks and EventBridge-orchestrated failover. Deployed edge-based WAF v2 protection on CloudFront and secured cross-account data migration (150+ GB) using KMS re-encryption and AWS DMS with change data capture (CDC) for near-zero downtime cutover.
Tech stack: Pulumi V3, AWS ECS Fargate, RDS PostgreSQL 17.6, CloudFront, AWS WAF v2, Route 53, AWS DMS, AWS Transfer Family, IAM Identity Center, KMS, SQS FIFO, Python.
Outcome: Reduced monthly infrastructure spend by $2,921 ($35,000+ annually). Achieved a 5–15 minute RTO with automated failover. Completed a zero-drift migration of 153+ managed resources with 0 unplanned downtime.
Education
Bachelor's Degree in Computer Science
Ahmedabad Institute of Technology - Ahmedabad, India
Certifications
Google AI Professional
Google Career Certificates
Responsible AI With Amazon Bedrock
A Cloud Guru | A Pluralsight Company
Azure AI Engineer Associate (AI-102): Azure AI Fundamentals, Planning, and Management
A Cloud Guru | A Pluralsight Company
Deploying Applications with AWS CDK
A Cloud Guru | A Pluralsight Company
Cloud AI Security Principles
A Cloud Guru | A Pluralsight Company
AWS Certified Solutions Architect – Associate
Amazon Web Services
IBM Full-stack Software Developer
Coursera
Skills
Libraries/APIs
Node.js, REST APIs, React, Playwright, Newman, NumPy, Pandas
Tools
GitLab CI/CD, Terraform, GitLab, GitHub, AWS Deployment, Amazon CloudWatch, Amazon EKS, Jenkins, Helm, NGINX, Kubernetes HorizontalPodAutoscaler (HPA), Amazon Elastic Container Service (ECS), Amazon CloudFront, AWS CloudFormation, IBM MQ, Grafana, Sentry, AWS IAM, Moodle, AWS Fargate, Amazon ElastiCache, Google Kubernetes Engine (GKE), Docker Compose, Claude, Cloud Development Kit for Terraform (CDKTF), Shell, Amazon SageMaker, Prefect, Jira, Git, Amazon Elastic Container Registry (ECR), Azure App Service, AWS Cloud Development Kit (CDK), Observability Tools, ELK (Elastic Stack), Grafana k6, Pingdom, CircleCI, Postman, Vitest, AWS ELB, Istio, Codex, Kong, Dynatrace, Splunk, Ansible
Languages
Python, SQL, JavaScript, TypeScript, Go, Bash, Groovy, Rust, CSS, HTML, Java, PHP
Paradigms
DevOps, Role-based Access Control (RBAC), Continuous Delivery (CD), Azure DevOps, Automation, Testing, HIPAA Compliance, DevSecOps, Unit Testing
Platforms
AWS IoT, Kubernetes, Amazon Web Services (AWS), Azure, Linux, Google Cloud Platform (GCP), Docker, AWS Lambda, Vercel, Amazon EC2, DigitalOcean, Langfuse, Cloud Native, AWS ALB, Firebase, LangSmith, Red Hat OpenShift, Vertex AI, Apache Kafka, Kubeflow, OpenShift, NVIDIA CUDA, Apigee X, Jakarta EE (Java EE or J2EE)
Storage
NoSQL, On-premise, MongoDB, Redis, PostgreSQL, MySQL, Microsoft SQL Server, Datadog, MariaDB, Amazon S3 (AWS S3)
Frameworks
Next.js, Windows PowerShell, Django, Spring, Bedrock, LangGraph, Selenium, Jest, Flux, Hibernate, SST
Industry Expertise
Cybersecurity, Healthcare
Other
Machine Learning Operations (MLOps), Machine Learning, Scripting, CI/CD Pipelines, Cloud Architecture, Cloud Infrastructure, Infrastructure, AWS DevOps, DevOps Engineer, Artificial Intelligence (AI), GitHub Actions, Amazon Bedrock, Large Language Models (LLMs), Configuration Management, Software Development Lifecycle (SDLC), Networking, Virtual Private Cloud (VPC), Site Reliability Engineering (SRE), AWS Certified Solution Architect, Linux Administration, Performance, Disaster Recovery (DR), YAML Pipelines, Domain DNS Setup, IT Infrastructure, Web Hosting, Code Review, DevOps Automation, AI Voice Agents, AI Research, Debugging, GCP DevOps, Programming, Software Development, Technical Architecture, Infrastructure as Code (IaC), Full-stack, Azure Cloud Security, Cloudflare, Microsoft Azure, Security, Terraform Cloud, SOC 2, Content Delivery Networks (CDN), SSL Configurations, Debugging Tools, Disaster Recovery Plans (DRP), Middleware, Agile DevOps, Data Engineering, Cloud, Virtualization, Containerization, Amazon RDS, AWS Auto Scaling, AWS Cloud Security, AWS Cloud Operations, Amazon Machine Learning, Virtual Machines, DataOps, Server Optimization, Podman, Transport Layer Security (TLS), AI Architecture, Prompt Engineering, Disaster Recovery Automation, Monitoring, APIs, API Gateways, Migration, Distributed Systems, Domain Migration, Architecture, AI Agent Orchestration, Cursor AI, AI Agents, AI Automation, Agentic AI, System Architecture, Data Annotation, Voice Activity Detection (VAD), Speech Analytics, Natural Language Processing (NLP), HIPAA, Supabase, TimescaleDB, Incident Response, Healthcare Software, Healthcare Services, IT Security, Learning Management Systems (LMS), AI Engineering, Agentic AI Systems, Full-stack Development, OpenTelemetry, GitOps, elastic ip, Computer Science, Blue-green Deployment, Agentic RAG Systems, Pinecone, FastAPI, Titan Embeddings, Cohere Embeddings, AWS Cloud Architecture, Ai Guardrails, azure ai, Generative Artificial Intelligence (GenAI), OpenAI, AWS Certified Cloud Practitioner, OpenID Connect (OIDC), Prometheus, VAPT, Vulnerability Triage, LogRocket, AWS ECS Fargate, Azure Stack, Akamai, Anthos, Platform Engineering, Shell Scripting, Argo CD, MLflow, AI Model Training, Responsible AI, Large-scale Projects, General Data Protection Regulation (GDPR), LangChain, Voice Analysis, Speech Recognition, Pulumi
How to Work with Toptal
Toptal matches you directly with global industry experts from our network in hours—not weeks or months.
Share your needs
Choose your talent
Start your risk-free talent trial
Top talent is in high demand.
Start hiring