
Ramil Gataullin
Verified Expert in Engineering
Data Engineer and Developer
Kazan, Tatarstan, Russia
Toptal member since May 11, 2021
Ramil is a data engineer with 9+ years of experience with distributed data processing systems and deep knowledge of SQL and relational databases. Ramil is skilled in designing, implementing, and maintaining ETL processes using technologies like Spark and Airflow. He also constructs ML applications using tools like XGBoost, scikit-learn, and SparkML in AWS and GCP cloud environments. Ramil also has some hands-on experience in team and product management. He follows and promotes DevOps principles.
Portfolio
Experience
- SQL - 8 years
- Python - 7 years
- Flask - 6 years
- Git - 5 years
- Data Engineering - 5 years
- Docker - 4 years
- Apache Airflow - 4 years
- Spark - 4 years
Preferred Environment
Python, Git, Linux, Docker, Apache Airflow, Metabase, Flask, GitLab CI/CD, DevOps
The most amazing...
...thing is that I've mastered my infra design skill using Docker, GitLab, Flask, Metabase, and Airflow to build MVP for several products in a month last year.
Work Experience
Co-founder and Technical Lead
Sayt
- Designed, implemented, and maintained a web platform that collects and analyses voice feedback that helps to improve customer service in HoReCa.
- Participated in business development and testing hypothesis, acting as a CTO to find the product's market fit.
- Built and managed a team of developers, overseeing their growth through mentoring, conducting code reviews, and providing technical guidance.
Lead Data Engineer
Rollee SaS
- Built and maintained a data platform to support employers' income data in Europe.
- Built dashboards (metabase) to support business decision-making.
- Supervised a team of data engineers, offering mentorship, conducting code reviews, and providing technical guidance.
Data Engineer
EPAM Systems
- Assisted in building and maintaining clinical trial pipelines, including cleansing, enrichment, and automation tasks.
- Designed and implemented an ETL pipeline to support the semantic mapping process.
- Guided and participated in bug fixes with the Foundry Git repository.
CTO | Technical Lead
SpecSharing
- Developed and maintained a special machinery rental web platform (Django).
- Built and managed the developer team, mentoring, conducting code reviews, and providing technical guidance.
- Organized and automated startup business processes, including CRM integrations, document flow management, and corporate messenger integrations.
Data Science Engineer
Provectus
- Designed, implemented, and maintained ETL processes for the project enriching (SQL and machine learning) RTB data with the devices' household information.
- Designed, implemented, and maintained a reporting system API using Amazon EMR, AWS Lambda, Amazon VPC, Amazon RDS, Amazon SQS, Spark, and Hydrosphere Mist.
- Built and maintained Airflow DAGs in an eCommerce project using Apache Airflow (with Great Expectation data suits), Snowflake, Amazon S3, and the data vault methodology.
Researcher and Software Engineer
Institute of Applied Semiotics, Tatarstan Academy of Sciences
- Implemented a morphological analyzer for the Tatar language.
- Assisted in designing and implementing the Tatar national text corpus. Designed the database architecture, ETL processes, and web application.
- Conducted R&D on a morphological disambiguation task. Defended my PhD thesis with its results. Thesis titled: "Morphological Disambiguation in Text-Corpus (on the example of the Tatar language)."
Experience
Flask Web UI for HFST (NLP Tool)
https://gitlab.com/ipsanrt/hfst_uiKazanRent: Web App for Rental Platform
https://kazanrent.ruAWS AutoML Pipeline for Fashion Startup
Education
PhD in Computer Science
Kazan Federal University - Kazan, Russia
Bachelor's Degree in Computer Science
Kazan Federal University - Kazan, Russia
Skills
Libraries/APIs
PySpark, Scikit-learn, XGBoost
Tools
Apache Airflow, Git, Sublime Text, Amazon Elastic MapReduce (EMR), Amazon Virtual Private Cloud (VPC), GitLab CI/CD, Celery, Amazon SageMaker, GitLab
Languages
Python, SQL, HTML, Snowflake, CSS, JavaScript
Frameworks
Flask, Apache Spark, Hadoop, Spark, Django
Paradigms
ETL, Database Design, Object-oriented Programming (OOP), REST, Web Application Architecture, Business Intelligence (BI), DevOps
Platforms
Docker, Amazon Web Services (AWS), AWS Lambda, Linux
Storage
Apache Hive, PostgreSQL, Database Architecture, Redis, JSON
Other
Data Engineering, Natural Language Processing (NLP), Generative Pre-trained Transformers (GPT), Algorithms, Statistics, Discrete Mathematics, Research, Statistical Modeling, Structural Design, Article Design, Foundry, HFST, Machine Learning, Metabase, EMR, Business, IT Management, Full-stack
How to Work with Toptal
Toptal matches you directly with global industry experts from our network in hours—not weeks or months.
Share your needs
Choose your talent
Start your risk-free talent trial
Top talent is in high demand.
Start hiring