Dmitry Kireev, Developer in New York, NY, United States
Dmitry is available for hire
Hire Dmitry

Dmitry Kireev

Verified Expert  in Engineering

Bio

Dmitry is a cloud architect and site reliability engineer with nearly two decades of intense professional experience strictly adhering to the DevOps methodology. He has architected and built numerous scalable infrastructures for modern cloud systems from scratch. Dmitry has a proven track record of hands-on operations in high-scale environments. He is also proficient with IaC, Kubernetes, automation, scripting, as well as monitoring and observability.

Portfolio

Shift lab
Serverless, Amazon Web Services (AWS), AWS CloudFormation, Amazon Cognito...
HazelOps
Amazon Elastic Container Service (ECS), AWS DevOps, GNU Make...
Sema Technologies, Inc
DevOps, GitHub, GitHub Actions, CI/CD Pipelines, Startups...

Experience

  • Bash - 15 years
  • Linux - 14 years
  • Amazon Web Services (AWS) - 13 years
  • DevOps - 10 years
  • Terraform - 8 years
  • Docker - 6 years
  • Amazon Elastic Container Service (ECS) - 5 years
  • Amazon EKS - 1 year

Availability

Full-time

Preferred Environment

Terraform, Linux, Docker, Amazon Web Services (AWS), DevOps, Serverless, Amazon Elastic Container Service (ECS), Amazon EKS, CI/CD Pipelines, Cloud Engineering

The most amazing...

...thing I've architected, deployed, and managed was a scalable, highly available cloud for an IoT home security product that was later acquired by Moen.

Work Experience

Principal Cloud Engineer

2022 - PRESENT
Shift lab
  • Designed and implemented multi-environment AWS architecture following AWS Well-Architected principles using Terraform.
  • Developed a CI/CD system to deploy and roll back applications with zero downtime.
  • Created a multi-service monorepo for enhanced operational efficiency.
Technologies: Serverless, Amazon Web Services (AWS), AWS CloudFormation, Amazon Cognito, CI/CD Pipelines, Amazon Elastic Block Store (EBS), Amazon Redshift, Identity & Access Management (IAM), DigitalOcean, Cloud Engineering, AWS CLI, Cloud Security, DevOps Engineer, Networks, AWS Deployment, Cron, JavaScript, Slack API, Observability Tools, AWS Cloud Computing Services, Reliability, Reliability Engineering, Operational Control, Operational Excellence, Operational Readiness, API Observability, Disaster Recovery Consulting

Principal Cloud Engineer | Consultant

2015 - PRESENT
HazelOps
  • Founded and led site reliability engineering and DevOps, consulting on various projects ranging from a week to multiple years.
  • Built scalable, multi-environment infrastructures with infrastructure as code, self-healing, and predictable environments on AWS.
  • Deployed numerous projects on Ruby on Rails, Django, Flask, and Node.js to EKS, ECS, and Serverless.
  • Implemented CI/CD pipelines using GitHub Actions, GitLab, Jenkins, and CircleCI, with Docker and multi-stage builds.
  • Analyzed and audited performance for dozens of full-cycle reports based on key factors of infrastructure performance and action items based on proposals.
  • Helped software engineers implement DevOps, including close communication, strategy, and process improvement.
  • Created OPS procedures in customers' environments, including service-based alerting, on-call rotation, and escalations.
  • Designed and implemented monitoring and alerting systems using Datadog, New Relic, and Prometheus/Grafana.
  • Deployed and maintained distributed systems, including full-cycle management via Terraform, Kubernetes, and Docker.
Technologies: Amazon Elastic Container Service (ECS), AWS DevOps, GNU Make, Amazon Web Services (AWS), Grafana, HAProxy, Python, WordPress, PHP, Java, Serverless, ECS, Docker Swarm, Docker, Ansible, Terraform, NGINX, DevOps, SSL Certificates, Digital Certificates, Traefik, JVM, Flask, Linux, Bash, SQL, PostgreSQL, Amazon RDS, Ruby on Rails (RoR), Amazon EKS, AWS NAT Gateway, Datadog, Software as a Service (SaaS), Docker Compose, Agile, MySQL/MariaDB, Redis, Agile Software Development, Jira, Confluence, SSL, Containerization, TypeScript, Containers, Deployment, Helm, Azure, Kubernetes, Azure DevOps, Cloud, AWS Cloud Architecture, Cloud Architecture, APIs, Infrastructure as Code (IaC), CI/CD Pipelines, Amazon EC2, VPN, Node.js, Google Cloud Platform (GCP), Container Orchestration, Amazon CloudWatch, Amazon Simple Queue Service (SQS), Amazon Aurora, AWS Fargate, AWS Lambda, Amazon DynamoDB, AWS CodeBuild, GitHub, Git, Continuous Integration (CI), Agile DevOps, AWS CloudFormation, Amazon API, Monitoring, Infrastructure Monitoring, Application Monitoring, Enterprise Architecture, Continuous Delivery (CD), Microservices, Microservices Architecture, Django, RabbitMQ, Celery, AWS Elastic Beanstalk, ChatGPT, Machine Learning, Serverless Architecture, Cloud Infrastructure, Architecture, Infrastructure, Cybersecurity, Network Security, Redis Cache, Load Balancers, Heroku, Site Reliability Engineering (SRE), Scaling, Amazon CloudFront CDN, AWS IAM, Amazon Virtual Private Cloud (VPC), CORS, Cloud Migration, Amazon S3 (AWS S3), Go, Startups, System Administration, GitHub Actions, Amazon Elastic Block Store (EBS), Identity & Access Management (IAM), AWS Amplify, Cloud Engineering, AWS CLI, AWS Cloud Development Kit (CDK), Google Compute Engine (GCE), Cloud Security, Windows PowerShell, DevOps Engineer, Networks, AWS Deployment, Cron, JavaScript, Slack API, Observability Tools, AWS Cloud Computing Services, Reliability, Reliability Engineering, Operational Control, Operational Excellence, Operational Readiness, API Observability, Disaster Recovery Consulting

Principal Cloud Engineer

2024 - 2024
Sema Technologies, Inc
  • Architected and deployed AWS infrastructure for an AI Code Scanning Tool using Terraform and AWS Well-Architected principles across multiple environments.
  • Developed a fully automated CI/CD pipeline for seamless delivery and rollback of the AI Code Scanning Tool.
  • Implemented an Amazon SQS-based self-healing worker system.
Technologies: DevOps, GitHub, GitHub Actions, CI/CD Pipelines, Startups, System Administration, Machine Learning Operations (MLOps), Amazon Web Services (AWS), Amazon Elastic Block Store (EBS), Identity & Access Management (IAM), AWS Amplify, Cloud Engineering, AWS CLI, Cloud Security, DevOps Engineer, Networks, AWS Deployment, Cron, Observability Tools, AWS Cloud Computing Services, Reliability, Reliability Engineering, Operational Control, Operational Excellence, Operational Readiness, API Observability

Principal Cloud Engineer

2023 - 2023
SimplyWise, Inc.
  • Decomposed a complex transient bug into a series of hypotheses.
  • Analyzed the current EKS/Django configuration on AWS with minimal documentation.
  • Tracked down the source of odd errors in Django/EKS using troubleshooting methods with Datadog and CloudWatch.
  • Proposed an optimal solution to the issue and supported the team in implementing it.
Technologies: Kubernetes, Amazon EKS, Django, Datadog, Scaling, AWS IAM, Amazon Virtual Private Cloud (VPC), Cloud Migration, Amazon S3 (AWS S3), Startups, Amazon Elastic Block Store (EBS), Identity & Access Management (IAM), Cloud Engineering, AWS CLI, Cloud Security, DevOps Engineer, Networks, AWS Deployment, Cron, Observability Tools, AWS Cloud Computing Services, Reliability, Reliability Engineering, Operational Control, Operational Excellence, Operational Readiness, API Observability

Principal Cloud Engineer

2022 - 2023
SofaBet Co.
  • Improved the current infrastructure to pass the GLI certification.
  • Implemented IAM and role-based PostgreSQL password-less access managed via Terraform.
  • Improved remote access patterns; migrated to OpenVPN.
Technologies: Amazon Web Services (AWS), Serverless, Container Orchestration, Amazon CloudWatch, Amazon Simple Queue Service (SQS), Amazon Aurora, AWS Fargate, AWS Lambda, GitHub, Git, Continuous Integration (CI), Agile DevOps, Terraform, Amazon RDS, Monitoring, Infrastructure Monitoring, Application Monitoring, Continuous Delivery (CD), CI/CD Pipelines, Containerization, Microservices, Microservices Architecture, Celery, ECS, Serverless Architecture, Cloud Infrastructure, Architecture, Infrastructure, Cybersecurity, Network Security, Redis Cache, Load Balancers, Site Reliability Engineering (SRE), Scaling, Amazon CloudFront CDN, AWS IAM, Amazon Virtual Private Cloud (VPC), CORS, Cloud Migration, Amazon S3 (AWS S3), Startups, System Administration, GitHub Actions, Amazon Elastic Block Store (EBS), Identity & Access Management (IAM), AWS Amplify, Cloud Engineering, AWS CLI, Cloud Security, DevOps Engineer, Networks, AWS Deployment, Cron, Observability Tools, AWS Cloud Computing Services, Reliability, Reliability Engineering, Operational Control, Operational Excellence, Operational Readiness, API Observability

Principal Cloud Engineer

2021 - 2022
Chicks Gold Inc.
  • Improved the security and reliability of the current EKS cluster.
  • Designed and implemented the CI/CD system to deploy/roll back the application with zero downtime.
  • Handled troubleshooting and maintenance support for legacy systems.
  • Designed security improvements for the infrastructure overall.
  • Designed branching and staging improvements to facilitate faster QA.
Technologies: Amazon Web Services (AWS), Amazon EKS, Amazon EC2, .NET, Rancher, Azure DevOps, Cloudflare, VPN, Container Orchestration, Amazon CloudWatch, Amazon Aurora, AWS Fargate, AWS Lambda, GitHub, Git, Continuous Integration (CI), Agile DevOps, AWS CloudFormation, Terraform, Amazon RDS, Amazon API, Monitoring, Infrastructure Monitoring, Application Monitoring, Continuous Delivery (CD), CI/CD Pipelines, Containerization, Microservices, Microservices Architecture, Celery, ECS, Serverless Architecture, Cloud Infrastructure, Architecture, Infrastructure, Cybersecurity, Network Security, Redis Cache, Load Balancers, Heroku, Site Reliability Engineering (SRE), Scaling, Amazon CloudFront CDN, AWS IAM, Amazon Virtual Private Cloud (VPC), CORS, Cloud Migration, Amazon S3 (AWS S3), Startups, System Administration, GitHub Actions, Amazon Elastic Block Store (EBS), Identity & Access Management (IAM), Cloud Engineering, AWS CLI, Cloud Security, DevOps Engineer, Networks, AWS Deployment, Cron, Observability Tools, AWS Cloud Computing Services, Reliability, Reliability Engineering, Operational Control, Operational Excellence, Operational Readiness, API Observability

Principal Cloud Engineer

2021 - 2022
ONFO, LLC
  • Designed and implement a multi-environment AWS architecture.
  • Designed and implemented a CI/CD system to deploy/roll back the application with zero downtime.
  • Updated and migrated a .NET application to the new environment.
  • Updated and migrated a TypeScript/Serverless application to the new environment.
  • Designed and deployed an immutable infrastructure for private Stellar nodes.
Technologies: Amazon Web Services (AWS), DevOps, Stellar SDK, .NET, Serverless, Docker, Docker Compose, Container Orchestration, Amazon CloudWatch, AWS Fargate, AWS Lambda, TypeScript, Amazon DynamoDB, GitHub, Git, Continuous Integration (CI), Agile DevOps, AWS CloudFormation, Terraform, Amazon RDS, Monitoring, Infrastructure Monitoring, Application Monitoring, Continuous Delivery (CD), CI/CD Pipelines, Containerization, Microservices, Microservices Architecture, ECS, Serverless Architecture, Cloud Infrastructure, Architecture, Infrastructure, Cybersecurity, Network Security, Redis Cache, Load Balancers, Site Reliability Engineering (SRE), Scaling, Amazon CloudFront CDN, AWS IAM, Amazon Virtual Private Cloud (VPC), CORS, Cloud Migration, Amazon S3 (AWS S3), Startups, System Administration, GitHub Actions, Amazon Elastic Block Store (EBS), Identity & Access Management (IAM), Cloud Engineering, AWS CLI, Cloud Security, DevOps Engineer, Networks, AWS Deployment, Cron, AWS Cloud Computing Services, Reliability, Reliability Engineering, Operational Control, Operational Excellence, Operational Readiness, API Observability

Principal Cloud Engineer

2020 - 2021
Wizard Inc
  • Designed and implemented multi-environment, multi-account AWS architecture using Amazon Well-Architected principles and Terraform.
  • Developed a CI/CD system for zero-downtime deployment and rollback.
  • Dockerized and migrated a Python and CUDA-based application to Amazon Elastic Container Service (ECS).
  • Provided troubleshooting and maintenance support for legacy systems.
  • Designed and implemented monitoring of all crucial systems using Datadog.
Technologies: Terraform, ECS, Docker, CI/CD Pipelines, Amazon Elastic Block Store (EBS), Identity & Access Management (IAM), Cloud Engineering, AWS CLI, Cloud Security, DevOps Engineer, Networks, AWS Deployment, Cron, Observability Tools, AWS Cloud Computing Services, Reliability, Reliability Engineering, Operational Control, Operational Excellence, Operational Readiness, API Observability

Principal Cloud Engineer

2020 - 2021
Tatango, Inc.
  • Designed and implemented a multi-environment AWS architecture.
  • Designed and implemented a CI/CD system to deploy/roll back the application with zero downtime.
  • Updated and migrated a Rails application to the new environment.
  • Updated and migrated a TypeScript/Serverless application to the new environment.
Technologies: Amazon Web Services (AWS), SQL, AWS CodeDeploy, AWS DevOps, Redshift, Datadog, ECS, Serverless, Ruby on Rails (RoR), Terraform, VPN, Container Orchestration, Amazon CloudWatch, AWS Fargate, AWS Lambda, GitHub, Git, Continuous Integration (CI), Agile DevOps, Amazon RDS, Amazon API, Monitoring, Infrastructure Monitoring, Application Monitoring, Network Monitoring, Enterprise Architecture, Continuous Delivery (CD), CI/CD Pipelines, Containerization, Serverless Architecture, Cloud Infrastructure, Architecture, Infrastructure, Cybersecurity, Network Security, Redis Cache, Load Balancers, Site Reliability Engineering (SRE), Scaling, Amazon CloudFront CDN, AWS IAM, Amazon Virtual Private Cloud (VPC), CORS, Cloud Migration, Amazon S3 (AWS S3), Startups, System Administration, GitHub Actions, Amazon Elastic Block Store (EBS), Amazon Redshift, Identity & Access Management (IAM), Cloud Engineering, AWS CLI, Cloud Security, DevOps Engineer, Networks, AWS Deployment, Cron, AWS Cloud Computing Services, Reliability, Reliability Engineering, Operational Control, Operational Excellence, Operational Readiness, API Observability

Principal Cloud Engineer

2019 - 2020
Patron Technology, Inc.
  • Designed and implemented scalable AWS-based infrastructure with Terraform.
  • Worked as a part of a core team to iterate on the infrastructure side of product ideas.
  • Designed and implemented a scalable CI/CD system on Amazon ECS.
Technologies: Amazon EC2, Docker, Kubernetes, Docker Swarm, Elasticsearch, Terraform, Ansible, Apache Airflow, Amazon Elastic Block Store (EBS), Identity & Access Management (IAM), Cloud Engineering, AWS CLI, Cloud Security, DevOps Engineer, Networks, AWS Deployment, Cron, Observability Tools, AWS Cloud Computing Services, Reliability, Reliability Engineering, Operational Control, Operational Excellence, Operational Readiness, API Observability

Lead Site Reliability Engineer

2016 - 2019
Moen
  • Designed and executed complex IoT infrastructure from scratch on AWS: multi-tier, multi-subnet scalable cloud AWS infrastructure, multi-app stateless stack with ECS and Docker, platform-agnostic local workspaces with Docker.
  • Created and administered Ansible infrastructure: idempotent plays and roles to support infrastructure needs and wrote community-available roles for multiple platforms under Apache Foundation.
  • Designed and implemented CI/CD: complete application lifecycle with green deployments of high-traffic services, platform-agnostic framework to support SaaS or hosted CI servers, and hassle-free pipelines for software engineers.
  • Developed monitoring solutions using ELK for log aggregation, TICK and Grafana for on-prem monitoring, and Datadog and New Relic for SaaS monitoring.
  • Devised operational procedures, including service-oriented OLAs and a "Service Owner First" policy with PagerDuty integration.
  • Created and maintained an upgrade procedure for critical distributed systems that allowed upgrades with no downtime and no data loss for the entire three-year period.
Technologies: AWS DevOps, GNU Make, Amazon Web Services (AWS), Transport Layer Security (TLS), Linux, CircleCI, Docker, TICK Stack, ELK (Elastic Stack), GitLab, Apache Kafka, Ansible, AWS CloudFormation, Terraform, DevOps, SSL Certificates, Digital Certificates, Grafana, JVM, InfluxDB, Bash, SQL, Internet of Things (IoT), Amazon RDS, AWS NAT Gateway, Datadog, Software as a Service (SaaS), Docker Compose, Agile, Redis, Agile Software Development, Jira, Confluence, SSL, Containerization, Containers, Deployment, Cloud, AWS Cloud Architecture, Cloud Architecture, APIs, Infrastructure as Code (IaC), CI/CD Pipelines, Amazon EC2, VPN, Amazon CloudWatch, Amazon Simple Queue Service (SQS), AWS Lambda, TypeScript, Amazon DynamoDB, GitHub, Git, GitLab CI/CD, Continuous Integration (CI), Agile DevOps, Amazon API, Monitoring, Infrastructure Monitoring, Application Monitoring, Network Monitoring, Enterprise Architecture, Continuous Delivery (CD), Microservices, Microservices Architecture, AWS Elastic Beanstalk, Cloud Infrastructure, Architecture, Infrastructure, Cybersecurity, Network Security, Redis Cache, Load Balancers, Site Reliability Engineering (SRE), Scaling, Amazon CloudFront CDN, AWS IAM, Amazon Virtual Private Cloud (VPC), Cloud Migration, Amazon S3 (AWS S3), Startups, System Administration, GitHub Actions, Amazon Elastic Block Store (EBS), Time Series, Identity & Access Management (IAM), AWS Amplify, Cloud Engineering, AWS CLI, Cloud Security, DevOps Engineer, Networks, AWS Deployment, Slack API, Observability Tools, AWS Cloud Computing Services, Reliability, Reliability Engineering, Operational Control, Operational Excellence, Operational Readiness, API Observability

Senior Member of Technical Staff

2016 - 2017
Delphix
  • Architected and implemented multi-tier hybrid cloud AWS infrastructure for a new project for a high-scale testing framework.
  • Constructed log and data aggregation from multiple sources (ELK).
  • Created a virtual and bare-metal host provisioning system (Foreman).
  • Designed and implemented Nmap-based inventory software.
  • Contributed to company-wide IT processes and improvements.
  • Came up with major portions to on-call rotation, monitoring, SOA, and OLA designs and implementations.
Technologies: AWS DevOps, Amazon Web Services (AWS), Python, AWS CloudFormation, Foreman, Ansible, ELK (Elastic Stack), Jenkins, Terraform, DevOps, SSL Certificates, Digital Certificates, Grafana, Telegraf, JVM, Linux, Bash, SQL, Amazon RDS, AWS NAT Gateway, New Relic, Datadog, Software as a Service (SaaS), Docker Compose, Travis CI, Elasticsearch, Agile, MySQL/MariaDB, Redis, Agile Software Development, Jira, Confluence, SSL, On-premise, Containerization, Containers, Deployment, Cloud, AWS Cloud Architecture, Cloud Architecture, APIs, Infrastructure as Code (IaC), CI/CD Pipelines, Amazon EC2, VPN, Amazon CloudWatch, GitHub, Git, GitLab CI/CD, Continuous Integration (CI), Agile DevOps, Monitoring, Infrastructure Monitoring, Application Monitoring, Network Monitoring, Enterprise Architecture, Continuous Delivery (CD), Windows, RabbitMQ, AWS Elastic Beanstalk, Cloud Infrastructure, Architecture, Infrastructure, Network Security, Redis Cache, Load Balancers, Site Reliability Engineering (SRE), Scaling, AWS IAM, Amazon Virtual Private Cloud (VPC), Amazon S3 (AWS S3), Startups, System Administration, GitHub Actions, Amazon Elastic Block Store (EBS), Identity & Access Management (IAM), Cloud Engineering, AWS CLI, Cloud Security, Windows PowerShell, DevOps Engineer, Networks, AWS Deployment, Cron, Observability Tools, Reliability, Reliability Engineering, Operational Excellence, Operational Readiness, API Observability

Senior DevOps Engineer

2013 - 2016
Intuit
  • Managed a hybrid cloud with around 300 nodes: AWS, VMware, and bare metal.
  • Implemented automation, config management, and provisioning. 90% of the environment is in Puppet and Git.
  • Managed the lifecycle of legacy systems in .NET and C# and the automation of manually deployed systems.
  • Provided CI in configuration management and IaaC: GitFlow, reusable code, and open-source contribution.
  • Managed and mentored junior IT staff, including separation of concerns and easy onboarding.
  • Led most of the post-acquisition infrastructure integration projects.
Technologies: AWS DevOps, Amazon Web Services (AWS), Foreman, Git, TeamCity, ELK (Elastic Stack), Puppet, DevOps, SSL Certificates, Digital Certificates, Grafana, Telegraf, JVM, Linux, Bash, Amazon RDS, New Relic, Software as a Service (SaaS), Docker Compose, Travis CI, Elasticsearch, Agile, MySQL/MariaDB, Redis, Agile Software Development, Jira, Confluence, SSL, On-premise, Containerization, Containers, Deployment, Cloud, AWS Cloud Architecture, Cloud Architecture, APIs, Infrastructure as Code (IaC), CI/CD Pipelines, Amazon EC2, VPN, Amazon CloudWatch, GitHub, Continuous Integration (CI), Agile DevOps, AWS CloudFormation, Monitoring, Application Monitoring, Network Monitoring, Enterprise Architecture, Continuous Delivery (CD), Windows, RabbitMQ, Cloud Infrastructure, Architecture, Infrastructure, Network Security, Redis Cache, Scaling, Amazon CloudFront CDN, Amazon Virtual Private Cloud (VPC), Amazon S3 (AWS S3), Startups, System Administration, GitHub Actions, Amazon Elastic Block Store (EBS), Identity & Access Management (IAM), Cloud Engineering, AWS CLI, Cloud Security, Windows PowerShell, DevOps Engineer, Networks, AWS Deployment, Cron, Observability Tools, AWS Cloud Computing Services, Reliability, Reliability Engineering, Operational Readiness

DevOps Engineer

2011 - 2013
Docstoc (Acquired by Intuit)
  • Supported colocation with 180+ Windows and Linux dedicated servers as well as new server deployment.
  • Managed network security and performance (Juniper SSG, SRX Firewalls, A10 networks load balancer, Radius, IPsec, NAT, and Amazon EC2 VPC).
  • Implemented proactive monitoring using Nagios, ELK, and New Relic.
  • Optimized Linux and Windows server performance for high scale.
  • Deployed and maintained on-premise MySQL databases.
  • Introduced and implemented an ELK stack comprising Elasticsearch, Logstash, and Kibana.
Technologies: Amazon Web Services (AWS), AWS DevOps, Nagios, Python, MongoDB, MySQL, LB, Juniper, DevOps, SSL Certificates, Digital Certificates, Grafana, Telegraf, JVM, Linux, Bash, Amazon RDS, New Relic, Software as a Service (SaaS), Infrastructure as Code (IaC), Docker Compose, Travis CI, Elasticsearch, Agile, MySQL/MariaDB, Redis, Agile Software Development, Jira, SSL, On-premise, Containerization, Containers, Deployment, Cloud, AWS Cloud Architecture, APIs, Amazon EC2, VPN, Amazon CloudWatch, GitHub, Git, Continuous Integration (CI), Agile DevOps, AWS CloudFormation, Monitoring, Network Monitoring, Continuous Delivery (CD), CI/CD Pipelines, Windows, RabbitMQ, Cloud Infrastructure, Architecture, Infrastructure, Network Security, Load Balancers, Scaling, AWS IAM, Amazon Virtual Private Cloud (VPC), Amazon S3 (AWS S3), System Administration, GitHub Actions, Amazon Elastic Block Store (EBS), Identity & Access Management (IAM), Cloud Engineering, AWS CLI, Cloud Security, Windows PowerShell, DevOps Engineer, Networks, AWS Deployment, Cron, Observability Tools, AWS Cloud Computing Services, Reliability, Operational Readiness, API Observability

IZE Infrastructure Tool

https://github.com/hazelops/ize
This tool is designed as a simple wrapper around popular tools so that they can be easily integrated into one infra: Terraform, ECS deployment, Serverless, and others.

It combines infra, build, and deploy workflows in one and is too simple to be considered sophisticated. So let's not do it but rather embrace the simplicity and minimalism.

ECS App Terraform Module

https://github.com/hazelops/terraform-aws-ecs-app
This Terraform module creates and manages Amazon ECS applications in a clean, abstract way. The module is actively maintained and covered by multiple end-to-end tests to prevent regressions.

Features:
• Deploy Worker application: No Application Load Balancer (ALB) is required.
• Deploy web application: Includes ALB, AWS Certificate Manager, and Route 53 integration.
• ECR repo Management.
• Naming convention.
• Deployment: Supports deployment via Terraform and external tools like ecs-deploy or ize.
• Datadog integration.
• Autoscaling: Supports both scheduled and trigger-based autoscaling.
• Supports EC2 or Fargate.
• Supports Elastic IP assignment.
• Resource configuration: Configurable CPU and memory resources.
• Manages Elastic File System mounts and shares.
• Supports GPU-based ECS instances.
• Multiple ECS network modes.
• Root block device configuration.
• Automatic Nginx Proxy.
• Firelens/Datadog log driver.
• ECS Exec (console into the container).
• Supports temporary file storage configuration.

Article: Runner Experience Design

https://automationd.com/developer-experience-design/
I'm adept at the Credo of Phoenix approach when discussing infrastructure design. Whatever you build should have the ability to be rebuilt with none-to-minimal effort over and over again by anyone or anything with sufficient permissions.

While such a poetic way of calling idempotent infrastructure has many important technical characteristics, this time, I'd like to talk about the other side of it:—anyone or anything with sufficient permissions - runners and their experience.

Article: How to Avoid Human Bottlenecks in Production

https://automationd.com/how-to-avoid-human-bottlenecks-in-production/
There is no doubt we've all heard of the term "bottleneck." A bottleneck is one process in a chain of processes such that its limited capacity reduces the capacity of the whole chain ( Wiki).

Generally speaking, it is required to have multiple humans to run a larger business to perform ideation, design, project management, development, QA, marketing, and infrastructure operations. When a single human limits a capacity of a team, it becomes a human bottleneck.

In this post, I'd like to highlight two distinct types of human bottlenecks, which can both make a negative impact on the productivity of the team from the perspective of operations and site reliability.

Windows Imaging Toolkit

https://github.com/AutomationD/wimaging
WImaging is a set of scripts to prepare WIM images and templates for Foreman to provision Windows hosts. Most of the time, official Microsoft deployment tools are used—mostly dism.exe.

All relevant configuration files like unattend.xml are rendered by Foreman and downloaded at build time.
2006 - 2009

Bachelor's Degree in Business Communication (English)

Tula State University - Tula, Russia

2004 - 2009

Bachelor's Degree in Business Administration

Tula State University - Tula, Russia

Libraries/APIs

AWS Amplify, Amazon API, Node.js, Slack API

Tools

Git, GNU Make, Ansible, AWS CloudFormation, ELK (Elastic Stack), GitLab, GitLab CI/CD, Terraform, Docker Compose, Grafana, Telegraf, CircleCI, Travis CI, Traefik, Amazon CloudWatch, Amazon Elastic Container Service (ECS), GitHub, VPN, AWS Fargate, Amazon CloudFront CDN, AWS IAM, Amazon Virtual Private Cloud (VPC), Amazon Elastic Block Store (EBS), AWS CLI, AWS Deployment, Cron, Observability Tools, Docker Swarm, NGINX, Puppet, Jenkins, Amazon EKS, Amazon Simple Queue Service (SQS), RabbitMQ, Celery, TeamCity, Nagios, Makefile, AWS CodeDeploy, Jira, Helm, Confluence, Splunk, Stellar SDK, AWS CodeBuild, ChatGPT, Apache Airflow, Amazon Elastic Container Registry (ECR), AWS Parameter Store, Amazon Cognito, AWS Cloud Development Kit (CDK), Google Compute Engine (GCE)

Languages

Python, Bash, Java, PHP, Markdown, Go, JavaScript, SQL, TypeScript

Paradigms

Agile, Continuous Delivery (CD), Continuous Integration (CI), DevOps, Microservices, Microservices Architecture, Serverless Architecture, API Observability, Azure DevOps, Automation, Agile Software Development, Testing

Platforms

Linux, Docker, Amazon Web Services (AWS), AWS Elastic Beanstalk, Amazon EC2, Kubernetes, AWS Lambda, Windows, AWS Cloud Computing Services, Apache Kafka, JVM, Heroku, DigitalOcean, Azure, WordPress, New Relic, Windows Server, Google Cloud Platform (GCP), Blockchain, Rancher, AWS ALB

Storage

Datadog, Amazon Aurora, Amazon S3 (AWS S3), MySQL, MongoDB, InfluxDB, Redis, On-premise, Amazon DynamoDB, Redis Cache, Elasticsearch, MySQL/MariaDB, Databases, PostgreSQL, Redshift, Amazon EFS, SQL Server Management Studio (SSMS)

Industry Expertise

Network Security, Cybersecurity

Frameworks

Flask, Django, Ruby on Rails (RoR), Windows PowerShell, .NET, Serverless Framework

Other

Site Reliability Engineering (SRE), GitHub Actions, AWS DevOps, SSL Certificates, Digital Certificates, CI/CD Pipelines, Amazon RDS, Software as a Service (SaaS), Infrastructure as Code (IaC), SSL, Cloud, Containerization, AWS Cloud Architecture, Containers, Deployment, Cloud Architecture, Container Orchestration, Agile DevOps, Monitoring, Infrastructure Monitoring, Application Monitoring, Cloud Infrastructure, Architecture, Infrastructure, Load Balancers, Scaling, Cloud Migration, Startups, System Administration, Identity & Access Management (IAM), Cloud Engineering, Cloud Security, DevOps Engineer, Networks, Reliability, Reliability Engineering, Operational Excellence, Operational Readiness, Networking, Internet of Things (IoT), AWS NAT Gateway, APIs, Enterprise Architecture, CORS, Operational Control, Disaster Recovery Consulting, TICK Stack, Transport Layer Security (TLS), Foreman, Juniper, LB, ECS, Serverless, HAProxy, Communication, English, Business, Economics, Software Development, Business Planning, Cloudflare, Hospitality, Network Monitoring, Machine Learning, Machine Learning Operations (MLOps), Autoscaling, Amazon Route 53, SSH, Time Series, Amazon Redshift

Collaboration That Works

How to Work with Toptal

Toptal matches you directly with global industry experts from our network in hours—not weeks or months.

1

Share your needs

Discuss your requirements and refine your scope in a call with a Toptal domain expert.
2

Choose your talent

Get a short list of expertly matched talent within 24 hours to review, interview, and choose from.
3

Start your risk-free talent trial

Work with your chosen talent on a trial basis for up to two weeks. Pay only if you decide to hire them.

Top talent is in high demand.

Start hiring