
Ram Bitra
Verified Expert in Engineering
DevOps and DevSecOps Developer
Maple Shade Township, NJ, United States
Toptal member since November 27, 2024
Ram is a site reliability engineer (SRE) with over 10 years of experience in automating, deploying, and managing scalable infrastructure across cloud and on-premise environments. Skilled in CI/CD pipelines, cloud platforms (AWS, GCP, Azure), container orchestration (Kubernetes, Docker), and configuration management (Ansible, Terraform, Chef), he enhances system reliability, performance, and scalability. Ram is also experienced in monitoring tools, including Splunk, DataDog, and Dynatrace.
Portfolio
Experience
- Python - 8 years
- CI/CD Pipelines - 8 years
- Amazon Web Services (AWS) - 7 years
- Kubernetes - 7 years
- Crossplane - 6 years
- Terraform - 6 years
- Argo CD - 6 years
- Grafana - 6 years
Availability
Preferred Environment
Linux, Windows, PyCharm, Visual Studio Code (VS Code), Jira, Amazon Web Services (AWS), Google Cloud Platform (GCP), Azure, Argo CD, Crossplane
The most amazing...
...project I've undertaken reduced infrastructure costs by 30% and deployment time by 40%, allowing the team to roll out new features faster.
Work Experience
Senior Site Reliability Engineer
TD Bank Group
- Managed weekly deployments using Jenkins and Spinnaker while leveraging Splunk and Grafana for real-time monitoring and troubleshooting of Kubernetes environments.
- Designed and implemented dynamic scaling strategies across AWS, GCP, and Azure, ensuring optimal resource utilization and system performance.
- Defined and managed SLOs and automated recovery processes using Go, enhancing system resilience and minimizing downtime.
- Developed automation tools and processes to manage incidents, ensuring rapid recovery and minimizing system downtime using Dynatrace and Datadog for proactive performance monitoring.
- Deployed machine learning models using Vertex AI for predictive analytics and anomaly detection, integrating real-time data processing with Dataflow to optimize business decisions.
- Leveraged Terraform and AWS CloudFormation to automate the provisioning and management of cloud infrastructure, ensuring consistency and scalability across environments.
- Conducted in-depth analysis and optimization of system performance metrics, including CPU, memory, and network usage, to ensure efficient resource allocation and application stability.
- Integrated Crossplane and Argo CD to achieve GitOps-driven infrastructure provisioning and application deployment.
- Automated the creation of Crossplane providers, composites, and CRDs using GitOps pipelines. Used Crossplane composition revisions to version and update resources automatically whenever there was a change in the underlying definitions.
- Used Argo CD to manage Helm-based deployments and Kustomize overlays for environment-specific configurations. Developed developer self-service for infrastructure provisioning using Crossplane.
Site Reliability and DevOps Engineer
Elevance Health
- Built and maintained CI/CD pipelines using Jenkins, GitHub, and Ansible to streamline deployment processes.
- Designed and deployed scalable cloud infrastructure on AWS, GCP, and Azure with Terraform and ARM templates.
- Automated data ingestion and transformation pipelines using BigQuery, Google Cloud Dataflow, and Google Cloud Functions.
- Deployed ML models using Vertex AI for predictive analytics and anomaly detection.
- Leveraged Dynatrace, Datadog, and Google Cloud Monitoring for proactive system performance management.
- Managed Kubernetes clusters for scalable and reliable application deployments.
- Implemented GitOps workflows using Argo CD to manage Kubernetes manifests and ensure continuous reconciliation of cluster state.
- Configured Crossplane to provision cloud infrastructure (AWS, GCP, Azure) as declarative YAML manifests. Automated the end-to-end lifecycle of cloud resources with a CI/CD pipeline using GitHub Actions, Jenkins, and GitLab CI to update Crossplane.
- Used Crossplane to automatically provision environments for development, QA, and production as part of a continuous delivery pipeline. Monitored Crossplane controllers and resource provisioning workflows using Argo CD dashboards and Prometheus.
- Troubleshot provider connectivity issues, failed resource creation, and drift detection errors in Crossplane using Crossplane logs and Kubernetes events.
Site Reliability and DevOps Engineer
Cox Automotive
- Designed and automated CI/CD pipelines using Jenkins, Git, Maven, and Ant for continuous integration and deployment.
- Managed containerized applications with Docker and Kubernetes, ensuring efficient scaling and high availability.
- Configured and maintained AWS and GCP resources like Amazon EC2, Amazon S3 (AWS S3), and Amazon RDS using Terraform.
- Implemented monitoring with Splunk, Dynatrace, and Datadog to track system health and automated incident response.
- Leveraged ELK (Elastic Stack) for centralized log management and real-time analysis.
- Automated server provisioning and application deployments with Ansible and Chef for streamlined operations.
- Deployed and optimized Java applications on JBoss and Apache Tomcat servers, integrating REST APIs for seamless back-end communication.
- Monitored the state of Argo CD syncs and Crossplane-managed infrastructure to ensure alignment with the desired state. Set up Argo CD and Crossplane from scratch in a production environment, enabling GitOps-driven control for Kubernetes and cloud.
- Configured Crossplane to provision cloud infrastructure (AWS, GCP, Azure) as declarative YAML manifests. Monitored sync status and health checks using Argo CD UI and Grafana dashboards.
- Used Crossplane Composition (XRDs and XRCs) to define infrastructure as blueprints, enabling on-demand provisioning of complete cloud environments.
DevOps Engineer and Linux Engineer
HCL
- Developed and managed CI/CD pipelines using Jenkins, Git, and Maven, automating build, test, and deployment processes.
- Provisioned and optimized cloud resources on AWS and Azure using Terraform and AWS CloudFormation.
- Leveraged Docker for containerization and Kubernetes for orchestrating scalable, resilient infrastructure.
- Automated infrastructure configurations using Puppet and Chef, creating and maintaining cookbooks and modules.
- Implemented Nagios, Splunk, and Amazon CloudWatch for proactive monitoring, alerting, and incident management.
- Leveraged ELK (Elastic Stack) for log aggregation and performance troubleshooting.
- Implemented AWS IAM roles and security best practices, ensuring compliance and secure cloud access.
- Wrote Shell, Python, and Windows PowerShell scripts to automate operational tasks, system configurations, and cloud management.
Experience
Site Reliability Engineering for Cloud Infrastructure at TD Bank
Cloud Infrastructure Optimization and ML Deployment for Elevance Health
GitOps-driven Multi-cloud Infrastructure Management with Argo CD and Crossplane
Education
Master's Degree in Computer Science
Wilmington University - Wilmington, DE, USA
Skills
Libraries/APIs
GCM
Tools
Terraform, PyCharm, IntelliJ IDEA, Slack, Amazon EKS, Jira, Git, Grafana, Ansible, Jenkins, Splunk, Azure Monitor, WildFly, Dynatrace, GitHub, Apache Maven, BigQuery, ANTs, TFS, ELK (Elastic Stack), Chef, AWS IAM, AWS CloudFormation, Puppet, Nagios, Amazon CloudWatch, Shell, Cloud Dataflow, Google Compute Engine (GCE), Looker, Helm, GitLab CI/CD, HashiCorp, HashiCorp Vault
Languages
Python, Bash, SQL, Go
Frameworks
Crossplane, .NET, Windows PowerShell
Paradigms
DevOps, DevSecOps, Automation
Platforms
Google Cloud Platform (GCP), Amazon Web Services (AWS), Azure, Kubernetes, Docker, Linux, Windows, Spinnaker, New Relic, AWS Lambda, Nexus, Vertex AI, Amazon EC2, JBoss, OpenShift, Visual Studio Code (VS Code)
Storage
Datadog, Amazon S3 (AWS S3)
Other
CI/CD Pipelines, GitHub Actions, Security, Argo CD, Debugging, Troubleshooting, Prometheus, Monitoring, SAP System Landscape Optimization (SLO), Google Cloud Dataflow, Shell Scripting, Amazon RDS, Azure Resource Manager (ARM), Policy as code (PaC), Large Language Model Operations (LLMOps), ARM, Virtual Private Cloud (VPC), RHEL, Computer Science
How to Work with Toptal
Toptal matches you directly with global industry experts from our network in hours—not weeks or months.
Share your needs
Choose your talent
Start your risk-free talent trial
Top talent is in high demand.
Start hiring