Prasanna Venkataraman, Developer in Chennai, Tamil Nadu, India
Prasanna is available for hire
Hire Prasanna

Prasanna Venkataraman

Verified Expert  in Engineering

DevOps Engineer and Developer

Chennai, Tamil Nadu, India

Toptal member since April 14, 2020

Bio

Prasanna is a DevOps, SRE, and full-stack engineer with 14+ years of experience building scalable, reliable, and secure systems. A quick learner and agile practitioner, he thrives in dynamic environments, delivering solutions aligned with industry best practices. With expertise in Kubernetes, cloud infrastructure (AWS, GCP), and monitoring systems, Prasanna's key accomplishments involve migrating services to containerized environments and developing fault-tolerant systems leveraging CNCF tools.

Portfolio

HireSRE AI
Large Language Models (LLMs), LangChain, Python 3, React, OpenAI API, AI Agents...
Pi42 Inc
Agile Software Development, Kubernetes, Google Kubernetes Engine (GKE)...
Dacio AI
Kubernetes, Amazon Web Services (AWS), GitHub Actions, Prometheus, Grafana...

Experience

  • Docker - 7 years
  • Python - 7 years
  • Amazon Web Services (AWS) - 7 years
  • Kubernetes - 6 years
  • Infrastructure as Code (IaC) - 5 years
  • PostgreSQL - 5 years
  • Terraform - 5 years
  • Prometheus - 3 years

Availability

Part-time

Preferred Environment

Amazon Web Services (AWS), Kubernetes, Go, Terraform, Google Cloud Platform (GCP), Docker, Python 3, TypeScript

The most amazing...

...thing I've set up was an infrastructure using Kubernetes that handles millions of transactions for one of the biggest consulting service companies.

Work Experience

AI & Infrastructure Engineer

2024 - 2024
HireSRE AI
  • Built an on-call infrastructure support platform leveraging GenAI models, including Llama and GPT, and implemented a retrieval-augmented generation (RAG) system using PineconeDB to enhance information retrieval and response accuracy.
  • Configured and deployed GenAI models to support on-premise AI setups using Llama, ensuring secure and efficient integration within client environments.
  • Developed AI agents using LangChain and LangGraph to proactively monitor infrastructure, enabling automated issue detection and first-level resolutions.
  • Implemented AgenticOps to monitor LLM models, tracking token usage and call costs, optimizing model performance and cost efficiency for production-grade applications.
Technologies: Large Language Models (LLMs), LangChain, Python 3, React, OpenAI API, AI Agents, Bash, Shell

Lead DevOps Engineer

2024 - 2024
Pi42 Inc
  • Migrated services from a virtual machine-based system to Kubernetes-based infrastructure, achieving improved scalability, operational efficiency, and reduced maintenance overhead.
  • Developed and deployed a dynamic rate-limiting system to throttle requests based on user plans, enhancing user experience while maintaining system reliability and fairness.
  • Managed infrastructure supporting $100,000 worth of transactions daily, ensuring high availability, performance, and compliance with operational standards.
  • Engineered a resilient, fault-tolerant system in Google Cloud using CNCF tools like Jaeger and Istio, delivering robust observability and service mesh capabilities for efficient issue diagnosis and traffic management.
Technologies: Agile Software Development, Kubernetes, Google Kubernetes Engine (GKE), Kubernetes Operators, Google Cloud, GitHub Actions, Amazon Elastic Container Registry (ECR), Bash, Shell, Cloud Migration, Amazon Aurora, AWS IAM

Lead DevOps Engineer

2023 - 2023
Dacio AI
  • Designed and implemented infrastructure for real-time video streaming platform using Cloudflare Streams, R2, Amazon Kinesis, and Amazon S3, ensuring seamless playback and high availability.
  • Architected and deployed a scalable and cost-optimized cloud infrastructure using the AWS Well-Architected Framework, achieving measurable cost reductions and improved performance.
  • Established and maintained CI/CD pipelines for Kubernetes-based infrastructure leveraging GitHub Actions, ensuring zero downtime deployments and streamlined release cycles.
  • Implemented and optimized monitoring and centralized logging solutions with Prometheus, Grafana, and the ELK stack, providing actionable insights and reducing issue resolution time by 40%.
Technologies: Kubernetes, Amazon Web Services (AWS), GitHub Actions, Prometheus, Grafana, Amazon Elastic Container Registry (ECR), Bash, Shell, Amazon ElastiCache, Amazon Aurora, AWS IAM

Lead DevOps Engineer

2020 - 2022
nDimensional Inc
  • Contributed to the migration of the whole infrastructure from VM-based to Kubernetes with support for zero-downtime deployment, option to roll back deployments, and canary-based deployments. Revamped the entire CI/CD process using Jenkins.
  • Managed three DevOps engineers following the practice of Agile-based methodologies along with support for managing the production system. Set up a production-ready system with Prometheus, Papertrail, Jaeger, and Istio.
  • Did capacity and AWS cost estimation for a big data application with workloads for real-time data of 3 GB/min and storage of 2 TB in Cassandra. Managed distributed systems, including Kafka, Cassandra, Spark, Flink, and Akka Clusters in Kubernetes.
Technologies: Kubernetes, Amazon Web Services (AWS), Terraform, Spark, Scala, Jenkins, Flink, Infrastructure as Code (IaC), Prometheus, Web Security, DevOps, CI/CD Pipelines, Continuous Deployment, Linux, Python 3, Docker, Test-driven Development (TDD), Elasticsearch, Redis, Python, Amazon EKS, Amazon RDS, Amazon EC2, Amazon API, IntelliJ IDEA, Helm, Continuous Integration (CI), Monitoring, Microservices, Cloud Services, Scalability, Load Balancers, Pulumi, TypeScript, API Testing, Cost Reduction & Optimization (Cost-down), Cloud Infrastructure, Datadog, Logging, AWS Lambda, Amazon S3 (AWS S3), GitHub, Cloud, Containerization, Orchestration, Scripting Languages, Security, ELK (Elastic Stack), AWS VPN, FastAPI, NGINX, Cloud Migration, JavaScript, Grafana, GitHub Actions, Automation, Confluence, Site Reliability Engineering (SRE), Google Cloud Platform (GCP), Platform Engineering, Vercel, Cloud Architecture, Architecture, AWS Cloud Architecture, Shell Scripting, Argo CD, GitOps, AWS Cloud Development Kit (CDK), Amazon Elastic Container Registry (ECR), Bash, Shell, Amazon ElastiCache, AWS IAM

Architect | Lead DevOps Engineer

2019 - 2020
Francium Technologies
  • Set up a multi-cloud infrastructure leveraging services from AWS, GCP including orchestrating 30+ microservices using Kubernetes and creating CD pipelines which significantly reduced the time to put services to production.
  • Established a proper monitoring-and-tracing system and scaled the product to handle 50x load with half the infrastructure cost. Set up the fitness function to track cost as one of the metrics for infrastructure (AWS and GCP).
  • Architected and led the company in high-performance DevOps culture by following principles of XP/lean for DevOps and automating the entire infrastructure using Terraform.
Technologies: Amazon Web Services (AWS), Kubernetes, Elasticsearch, Prometheus, Google Cloud Platform (GCP), Terraform, Go, Ruby on Rails (RoR), Infrastructure as Code (IaC), DevOps, CI/CD Pipelines, Heroku, Continuous Deployment, Linux, Docker, Test-driven Development (TDD), Redis, Python 3, Python, Amazon EKS, Amazon Elastic Container Service (ECS), Amazon RDS, Amazon EC2, Amazon API, IntelliJ IDEA, Helm, Google Kubernetes Engine (GKE), Continuous Integration (CI), Monitoring, Microservices, Cloud Services, Scalability, Load Balancers, Azure, Monorepos, TypeScript, Node.js, Cost Reduction & Optimization (Cost-down), Cloud Infrastructure, Logging, AWS Lambda, Amazon S3 (AWS S3), GitHub, Cloud, Containerization, Orchestration, Scripting Languages, ELK (Elastic Stack), FastAPI, NGINX, DigitalOcean, Grafana, GitHub Actions, Automation, Docker Swarm, Networking, MySQL, Confluence, AWS CloudFormation, Platform Engineering, Google Cloud, Vercel, Cloud Architecture, Architecture, AWS Cloud Architecture, Shell Scripting, Argo CD, GitOps, AWS Cloud Development Kit (CDK), AWS Batch, Bash, Shell, AWS IAM

Technical Lead

2015 - 2019
ThoughtWorks
  • Worked as part of multiple engagements ranging from startups to massive enterprises and led many projects involving various technology stacks. Followed TDD and agile (XP/lean) methodologies in all teams and tech stacks.
  • Set up an authentication system for a popular bank in Spain and handled many requests. Chose the tools to track performance and utilize the infrastructure efficiently. Used Kubernetes, Go, Prometheus, Redis, and Elasticsearch.
  • Worked at a client's location (UK) to refactor their core pricing engine. Followed a strangulation approach and, in the process, improved their CI/CD pipelines.
Technologies: Amazon Web Services (AWS), Kubernetes, Ruby on Rails (RoR), Ruby, Terraform, Go, PostgreSQL, Infrastructure as Code (IaC), CI/CD Pipelines, Continuous Deployment, Linux, Docker, Test-driven Development (TDD), Heroku, Python 3, Python, Amazon EKS, Amazon Elastic Container Service (ECS), Amazon RDS, Amazon EC2, Amazon API, IntelliJ IDEA, Helm, Continuous Integration (CI), Monitoring, Microservices, Cloud Services, Load Balancers, Node.js, Cloud Infrastructure, Logging, Amazon S3 (AWS S3), GitHub, Cloud, Containerization, Orchestration, Scripting Languages, GitLab CI/CD, NGINX, JavaScript, Automation, Networking, MySQL, AWS CloudFormation, Full-stack Development, Google Cloud, AWS Cloud Architecture, Shell Scripting, AWS Batch

Senior Software Engineer

2011 - 2015
Pramati Technologies
  • Built a type of warehouse management software for one of the enterprise companies and was responsible for developing the back-end system using Java, Spring, and JPA technology stack.
  • Developed a high-performance, real-time bidding engine which handles millions of requests in Scala and Akka framework.
  • Set up virtual machines using Ansible and Bash scripts and used Jenkins for continuous integration and deployment.
Technologies: Ansible, Docker, JPA, Spring, Scala, Java, Test-driven Development (TDD), Continuous Integration (CI), Cloud Services, Load Balancers, GitHub, Scripting Languages, Automation, MySQL, Full-stack Development, Google Cloud

Application Developer

2009 - 2011
Oracle
  • Contributed to the Oracle E-Business Suite development and handled the EU region's payroll system.
  • Implemented performance fine-tuning for a Java virtual machine (JVM) and analyzed and optimized SQL query performance.
  • Handled multiple clients for the EU region and was in charge of a complete module.
Technologies: Oracle Database, JPA, Spring, Java, Test-driven Development (TDD), Full-stack Development

Experience

Authentication System for Openbank SA

I built an authentication engine in Go for a distributed microservices system. It can handle the scale of 1 million requests/second with a latency of 30 milliseconds. The entire infrastructure was created using Terraform. Kubernetes is used as the container orchestrator, Jaeger is used to trace the performance bottleneck, and Prometheus to monitor and set alarms.

CI/CD Setup for Monolith to Microservice Migration

https://nd.com
AI/ML monolith platform migration. I led the DevOps migration from a Scala-based monolith platform to a microservice platform. Infrastructure was created using Terraform (Iac), Jenkins was used as the CI/CD tool, and Kubernetes was used for container orchestration. I suggested and implemented tools to manage configs and secrets, the IaC approach, and security best practices to be followed.

Infrastructure Setup for the Consulting Group

I set up the entire Infrastructure for one of Boston's most prominent consulting groups. I used best practices in CI/CD, Infrastructure as Code, distributed logging and tracing, observability, and monitoring for infra and application.

I analyzed and suggested pros and cons for the entire DevOps stack, implemented the right tool for each and effectively communicated with developers to set up seamless auto-deployment and one-click deployment for the apps.

Took sessions with the development team on the proper usage of tools for logging and monitoring.

Education

2005 - 2009

Bachelor's Degree in Computer Science

Thiagarajar College of Engineering - Madurai, India

Certifications

NOVEMBER 2021 - NOVEMBER 2023

Certified Kubernetes Security Specialist

Cloud Native Computing Foundation

APRIL 2020 - APRIL 2023

Certified Kubernetes Administrator

The Cloud Native Computing Foundation

MARCH 2020 - MARCH 2023

AWS Certified Solutions Architect — Associate

Amazon Web Services (AWS)

Skills

Libraries/APIs

Amazon API, Vue 2, Node.js, React, OpenAI API

Tools

Terraform, IntelliJ IDEA, Ansible, Helm, Google Kubernetes Engine (GKE), Amazon EKS, Amazon CloudFront CDN, Logging, GitHub, NGINX, AWS CloudFormation, Amazon Elastic Container Registry (ECR), Shell, Amazon ElastiCache, AWS IAM, Grafana, Amazon Elastic Container Service (ECS), AWS CodeBuild, ELK (Elastic Stack), GitLab CI/CD, Docker Swarm, Confluence, AWS Cloud Development Kit (CDK), AWS Batch, Vim Text Editor, Jenkins, Flink, Mesos, CircleCI, Traefik, Papertrail, Kubernetes Operators

Languages

Java, Python, Bash, Go, Scala, Python 3, TypeScript, JavaScript, Ruby

Frameworks

Ruby on Rails (RoR), Spring 5, JPA, Spring, Spark, Akka, Apache Spark, gRPC

Paradigms

Agile Software Development, Test-driven Development (TDD), Continuous Deployment, Continuous Integration (CI), DevOps, Microservices, Automation, Functional Programming

Platforms

Docker, Kubernetes, Amazon Web Services (AWS), Google Cloud Platform (GCP), Linux, Amazon EC2, Vercel, Heroku, Dokku, DigitalOcean, AWS Lambda, Oracle Database, Apache Kafka, Azure

Storage

PostgreSQL, Redis, Datadog, Amazon S3 (AWS S3), Google Cloud, Elasticsearch, MySQL, Amazon Aurora

Other

Prometheus, Infrastructure as Code (IaC), Shell Scripting, CI/CD Pipelines, API Testing, AWS Cloud Architecture, AWS DevOps, Networking, Cloud Architecture, Cost Control, Amazon RDS, AWS Certified Solution Architect, Cloud Services, Scalability, Load Balancers, Cost Reduction & Optimization (Cost-down), Cloud Infrastructure, Cloud, Containerization, Orchestration, Scripting Languages, Cloud Migration, GitHub Actions, Full-stack Development, Platform Engineering, Architecture, Argo CD, GitOps, Certified Kubernetes Administrator (CKA), Monitoring, Kubernetes Operations (kOps), Immutable Infrastructure, Cloudflare, AWS CodePipeline, Security, AWS VPN, FastAPI, Site Reliability Engineering (SRE), Web Security, Pulumi, Monorepos, Large Language Models (LLMs), LangChain, AI Agents

Collaboration That Works

How to Work with Toptal

Toptal matches you directly with global industry experts from our network in hours—not weeks or months.

1

Share your needs

Discuss your requirements and refine your scope in a call with a Toptal domain expert.
2

Choose your talent

Get a short list of expertly matched talent within 24 hours to review, interview, and choose from.
3

Start your risk-free talent trial

Work with your chosen talent on a trial basis for up to two weeks. Pay only if you decide to hire them.

Top talent is in high demand.

Start hiring