Ramil Gataullin, Developer in Kazan, Tatarstan, Russia
Ramil is available for hire
Hire Ramil

Ramil Gataullin

Data Engineer and Developer

Kazan, Tatarstan, Russia

Toptal member since May 11, 2021

Bio

Ramil is a data engineer with 9+ years of experience with distributed data processing systems and deep knowledge of SQL and relational databases. Ramil is skilled in designing, implementing, and maintaining ETL processes using technologies like Spark and Airflow. He also constructs ML applications using tools like XGBoost, scikit-learn, and SparkML in AWS and GCP cloud environments. Ramil also has some hands-on experience in team and product management. He follows and promotes DevOps principles.

Portfolio

Sayt
Full-stack, DevOps, Docker, Flask, GitLab, GitLab CI/CD, Metabase...
Rollee SaS
Python, Apache Airflow, Docker, GitLab CI/CD, SQL, PostgreSQL, Metabase
EPAM Systems
Spark, Foundry, SQL, ETL, Git, Data Engineering, Apache Spark, PySpark

Experience

  • SQL - 8 years
  • Python - 7 years
  • Flask - 6 years
  • Git - 5 years
  • Data Engineering - 5 years
  • Docker - 4 years
  • Apache Airflow - 4 years
  • Spark - 4 years

Preferred Environment

Python, Git, Linux, Docker, Apache Airflow, Metabase, Flask, GitLab CI/CD, DevOps

The most amazing...

...thing is that I've mastered my infra design skill using Docker, GitLab, Flask, Metabase, and Airflow to build MVP for several products in a month last year.

Work Experience

Co-founder and Technical Lead

2022 - PRESENT
Sayt
  • Designed, implemented, and maintained a web platform that collects and analyses voice feedback that helps to improve customer service in HoReCa.
  • Participated in business development and testing hypothesis, acting as a CTO to find the product's market fit.
  • Built and managed a team of developers, overseeing their growth through mentoring, conducting code reviews, and providing technical guidance.
Technologies: Full-stack, DevOps, Docker, Flask, GitLab, GitLab CI/CD, Metabase, IT Management, Business

Lead Data Engineer

2021 - 2023
Rollee SaS
  • Built and maintained a data platform to support employers' income data in Europe.
  • Built dashboards (metabase) to support business decision-making.
  • Supervised a team of data engineers, offering mentorship, conducting code reviews, and providing technical guidance.
Technologies: Python, Apache Airflow, Docker, GitLab CI/CD, SQL, PostgreSQL, Metabase

Data Engineer

2021 - 2021
EPAM Systems
  • Assisted in building and maintaining clinical trial pipelines, including cleansing, enrichment, and automation tasks.
  • Designed and implemented an ETL pipeline to support the semantic mapping process.
  • Guided and participated in bug fixes with the Foundry Git repository.
Technologies: Spark, Foundry, SQL, ETL, Git, Data Engineering, Apache Spark, PySpark

CTO | Technical Lead

2019 - 2020
SpecSharing
  • Developed and maintained a special machinery rental web platform (Django).
  • Built and managed the developer team, mentoring, conducting code reviews, and providing technical guidance.
  • Organized and automated startup business processes, including CRM integrations, document flow management, and corporate messenger integrations.
Technologies: Python, Django, PostgreSQL, Redis, Docker, GitLab CI/CD, Git

Data Science Engineer

2015 - 2020
Provectus
  • Designed, implemented, and maintained ETL processes for the project enriching (SQL and machine learning) RTB data with the devices' household information.
  • Designed, implemented, and maintained a reporting system API using Amazon EMR, AWS Lambda, Amazon VPC, Amazon RDS, Amazon SQS, Spark, and Hydrosphere Mist.
  • Built and maintained Airflow DAGs in an eCommerce project using Apache Airflow (with Great Expectation data suits), Snowflake, Amazon S3, and the data vault methodology.
Technologies: Hadoop, Spark, Apache Airflow, Docker, Apache Hive, Amazon Web Services (AWS), Amazon Elastic MapReduce (EMR), AWS Lambda, Amazon Virtual Private Cloud (VPC), SQL, Scikit-learn, XGBoost, Snowflake, ETL, Machine Learning, Database Design, Git, Data Engineering, Apache Spark, EMR, PySpark

Researcher and Software Engineer

2013 - 2020
Institute of Applied Semiotics, Tatarstan Academy of Sciences
  • Implemented a morphological analyzer for the Tatar language.
  • Assisted in designing and implementing the Tatar national text corpus. Designed the database architecture, ETL processes, and web application.
  • Conducted R&D on a morphological disambiguation task. Defended my PhD thesis with its results. Thesis titled: "Morphological Disambiguation in Text-Corpus (on the example of the Tatar language)."
Technologies: Python, PostgreSQL, Flask, Django, Docker, GitLab CI/CD, SQL, Natural Language Processing (NLP), Generative Pre-trained Transformers (GPT), Machine Learning, Database Design, JSON, Git

Experience

Flask Web UI for HFST (NLP Tool)

https://gitlab.com/ipsanrt/hfst_ui
An easy-to-use web interface that helps to compile, test, and use HFST models (the Helsinki Finite State Transducer, natural language processing tool) not from the command line but user-friendly UI. I designed and implemented the Flask web application (built on top of Flask app builder), and integrated the HFST tool.

KazanRent: Web App for Rental Platform

https://kazanrent.ru
A web application that helps customers to book and rent photos, videos, and game equipment daily or weekly. As a full-stack developer, I've designed and implemented a CRM and REST API for TgBot and some kind of ERP system covering most renting business processes. Additionally, I created the BI dashboard with all business KPIs using Metabase. The project is still in progress, and now I maintain the system and lead the development.

AWS AutoML Pipeline for Fashion Startup

All-in-one fashion collection forecasting and management AI platform. I helped with AWS services by designing and executing an AutoML pipeline on the AWS platform (client's data automatically picked up from S3, based on which machine learning model is trained and the forecast made), using Step Functions, SageMaker, and Lambda.

Education

2013 - 2017

PhD in Computer Science

Kazan Federal University - Kazan, Russia

2008 - 2013

Bachelor's Degree in Computer Science

Kazan Federal University - Kazan, Russia

Skills

Libraries/APIs

PySpark, Scikit-learn, XGBoost

Tools

Apache Airflow, Git, Sublime Text, Amazon Elastic MapReduce (EMR), Amazon Virtual Private Cloud (VPC), GitLab CI/CD, Celery, Amazon SageMaker, GitLab

Languages

Python, SQL, HTML, Snowflake, CSS, JavaScript

Frameworks

Flask, Apache Spark, Hadoop, Spark, Django

Paradigms

ETL, Database Design, Object-oriented Programming (OOP), REST, Web Application Architecture, Business Intelligence (BI), DevOps

Platforms

Docker, Amazon Web Services (AWS), AWS Lambda, Linux

Storage

Apache Hive, PostgreSQL, Database Architecture, Redis, JSON

Other

Data Engineering, Natural Language Processing (NLP), Generative Pre-trained Transformers (GPT), Algorithms, Statistics, Discrete Mathematics, Research, Statistical Modeling, Structural Design, Article Design, Foundry, HFST, Machine Learning, Metabase, EMR, Business, IT Management, Full-stack

Collaboration That Works

How to Work with Toptal

Toptal matches you directly with global industry experts from our network in hours—not weeks or months.

1

Share your needs

Discuss your requirements and refine your scope in a call with a Toptal domain expert.
2

Choose your talent

Get a short list of expertly matched talent within 24 hours to review, interview, and choose from.
3

Start your risk-free talent trial

Work with your chosen talent on a trial basis for up to two weeks. Pay only if you decide to hire them.

Top talent is in high demand.

Start hiring