Ievgen (Eugene) Morokin, Site Reliability Engineer and Developer in Fremont, CA, United States
Ievgen (Eugene) Morokin

Site Reliability Engineer and Developer in Fremont, CA, United States

Member since July 23, 2020
Eugene is an accomplished GTD DevOps and site reliability engineer (SRE) with six years of experience that include old-school Linux and an extensive array of technologies and tools. His professionalism stands on three pillars: attention to small details, perfectionism, and the ability to predict the unpredictable. Eugene is a quick study who excels at identifying the best technologies and solutions for each situation.
Ievgen is now available for hire

Portfolio

  • Pango
    Amazon Web Services (AWS), Teams, Agile, Tableau...
  • Pango
    Amazon Web Services (AWS), Docker Compose, RabbitMQ, Teams...
  • CyderSoft
    Amazon Web Services (AWS), Apache ZooKeeper, Consul, GitLab, Microservices...

Experience

Location

Fremont, CA, United States

Availability

Part-time

Preferred Environment

Amazon Web Services (AWS), Geohash, OpenVAS, Git, PagerDuty, Opsgenie, Rsyslog, Fluentd, ELK (Elastic Stack), Tableau, Apache Airflow, Zeppelin, AWS EMR, Hadoop, Spark, ClickHouse, Tarantool, Memcached, Redis, PostgreSQL, MariaDB, MySQL, Okta, RabbitMQ, NATS, Envoy Proxy, HAProxy, Apache, OpenResty, Nginx, Google Cloud Platform (GCP), Apache ZooKeeper, Consul, Vault, Grafana, Prometheus, Jenkins, Groovy, Bash, Lua, Python, Docker, Ansible, Terraform, AWS

The most amazing...

...thing I've implemented from the ground up is an SLA monitoring and reporting system for a geographically distributed VPN infrastructure.

Employment

  • Lead Site Reliability Engineer

    2019 - 2020
    Pango
    • Implemented a CD pipeline for automated deployment of a VPN stack to a production fleet (over 600 hosts), drastically reducing the time spent in toil work.
    • Improved visibility, service quality, and customer experience, and reduced incident resolution time by implementing SLA monitoring and reporting.
    • Implemented Geohash technology for proximity searches.
    • Troubleshot and resolved complex tasks by providing a higher level of tech support for team members.
    • Led a geographically distributed team of multilingual engineers in multiple countries, coached and mentored team members, and motivated people to achieve business and personal goals in a timely manner.
    • Conducted on-site onboarding of a contractor team located in Costa Rica and Bolivia.
    • Planned projects and sprints, conducted retrospectives and performance reviews for team members, ensured team success and efficiency, and reported to stakeholders.
    Technologies: Amazon Web Services (AWS), Teams, Agile, Tableau, Amazon Elastic MapReduce (EMR), Zeppelin, Redshift, Hybrid Cloud Infrastructure, On-premise, HAProxy, Envoy Proxy, Nginx, Python, Google Cloud Platform (GCP), Terraform, Ansible, ELK (Elastic Stack), Fluentd, CI/CD Pipelines, Grafana, Prometheus, Geohash, VPN, Okta, Vault, Consul, AWS, Docker, Jenkins, SLA monitoring
  • Site Reliability Engineer

    2018 - 2019
    Pango
    • Dockerized and migrated key parts of an on-site Hadoop/Spark cluster to AWS. Fine-tuned AWS EMR to increase stability, improve performance (faster ETL jobs processing), and reduce costs.
    • Migrated an on-site legacy Tableau server to AWS and retained data. Implemented automated provisioning/deployment with Terraform and Ansible and monitoring with Prometheus. Drastically improved stability, performance, and report quality as a result.
    • Collaborated with the SecOps team to implement golden images and drove end-to-end deployment across the production fleet in both cloud and bare metal, thereby significantly improving security and stability.
    • Drove end-to-end implementation/deployment of a standardized naming schema across the production fleet.
    • Trained and assisted team members on various topics, including best practices and documentation writing.
    • Troubleshot networking and performance issues across production and worked closely with vendors and developers on resolutions.
    Technologies: Amazon Web Services (AWS), Docker Compose, RabbitMQ, Teams, ELK (Elastic Stack), Fluentd, PagerDuty, Opsgenie, GitHub, Jira, Hybrid Cloud Infrastructure, On-premise, Tableau, Amazon Elastic MapReduce (EMR), Zeppelin, Spark, Hadoop, SecOps, Apache, MySQL, Okta, Python, VPN, Nginx, HAProxy, Grafana, Prometheus, Ansible, Terraform, Docker, Vault, Consul, Google Cloud Platform (GCP), AWS
  • DevOps Engineer

    2016 - 2018
    CyderSoft
    • Developed and supported a custom AWS Cloud orchestration solution. Particularly responsible for an EC2 Spot instances Auto Scaling module that significantly reduced infrastructure costs.
    • Performed migrations from shell script-based automation to Ansible and Terraform, continuously developed new roles and modules for application deployment and infrastructure provisioning.
    • Designed Grafana dashboards based on InfluxDB, Prometheus, and ClickHouse data sources for advanced monitoring and troubleshooting, effective cost control, and for BI and product teams.
    Technologies: Amazon Web Services (AWS), Apache ZooKeeper, Consul, GitLab, Microservices, High-load, Autoscaling, Node.js, Ansible, Terraform, NATS, InfluxDB, Grafana, Prometheus, Twemproxy, Aerospike, ClickHouse, MySQL, Redis, Linux, Packer, Python, Bash, Lua, OpenResty, Proxies, Nginx, AWS
  • Operations Engineer

    2014 - 2015
    iMesh
    • Automated Hybrid Cloud (AWS, on-premise KVM) operations, significantly reducing time spent on toil work.
    • Performed a vulnerability assessment with OpenVAS, including issues analysis and security hardening on production hosts, thereby drastically reducing the number of security incidents.
    • Optimized backup procedures of a MySQL server fleet, thereby reducing backup time.
    Technologies: Amazon Web Services (AWS), Apache, Nginx, Redis, MongoDB, MySQL, Ansible, SSL Configurations, SSL Certificates, OpenVAS, Bash, DNS Servers, Akamai, Content Delivery Networks (CDN), CentOS, Linux, KVM/Qemu, AWS

Experience

  • AWS EC2 Spot Instances Auto Scaling Solution

    I developed a Python and Ansible-based service for autoscaling and replacement of AWS EC2 Spot instances. The key feature was the ability to scale a specific instance type from the range of allowed types in a particular availability zone according to the current state of the spot market and availability of the allowed instance types.

Skills

  • Tools

    Terraform, Ansible, Jenkins, Grafana, Vault, Apache ZooKeeper, Nginx, Apache, Envoy Proxy, RabbitMQ, Apache Airflow, Tableau, ELK (Elastic Stack), Fluentd, Rsyslog, Git, KVM/Qemu, Packer, GitLab, VPN, Amazon Elastic MapReduce (EMR), Jira, GitHub, Docker Compose
  • Platforms

    Amazon Web Services (AWS), Docker, Google Cloud Platform (GCP), OpenResty, Zeppelin, PagerDuty, Linux, CentOS
  • Other

    AWS, Prometheus, Consul, HAProxy, NATS, Okta, Opsgenie, Geohash, Content Delivery Networks (CDN), Akamai, DNS Servers, SSL Certificates, SSL Configurations, Proxies, Twemproxy, Autoscaling, High-load, SecOps, Hybrid Cloud Infrastructure, Teams, SLA monitoring, CI/CD Pipelines, Infrastructure Security, Application Security
  • Languages

    Python, Lua, Bash, Groovy
  • Frameworks

    Spark, Hadoop, AWS EMR, OpenVAS
  • Libraries/APIs

    Node.js
  • Paradigms

    Microservices, Agile
  • Storage

    MySQL, MariaDB, PostgreSQL, Redis, Memcached, Tarantool, ClickHouse, MongoDB, Aerospike, InfluxDB, On-premise, Redshift
  • Industry Expertise

    Network Security, IT Security

Certifications

  • Network Security Expert
    MAY 2013 - PRESENT
    Technion - Israel Institute of Technology

To view more profiles

Join Toptal
Share it with others