Ilmira Terpugova, Developer in Innopolis, Tatarstan, Russia
Ilmira is available for hire
Hire Ilmira

Ilmira Terpugova

Verified Expert  in Engineering

Machine Learning Developer

Innopolis, Tatarstan, Russia

Toptal member since May 7, 2019

Bio

Ilmira is a data scientist with strong mathematical and programming background. She is hard-working, responsible and conscientiously follows deadlines. Ilmira produces valuable findings while also providing thorough, concise explanations and visualizations of the results.

Portfolio

ChangeDynamix, Inc.
Plotly, Seaborn, Scikit-learn, Pandas, Jupyter, Spark, Python
Eurecat, Centre Tecnològic de Catalunya
Parquet, HDFS, Flask, MLlib, Apache Spark, Python, Scala
SoftPlus CJSC
Selenium, Bootstrap, jQuery, JavaScript, MySQL, Java, Groovy, Grails

Experience

Availability

Part-time

Preferred Environment

Git, PyCharm, Linux, MacOS

The most amazing...

...algorithm I've coded is based on a term-weighting scheme and centroid-based classifier and won Kaggle competition for classifying genetic mutations.

Work Experience

Data Scientist

2017 - 2017
ChangeDynamix, Inc.
  • Implemented unsupervised user clustering using unstructured network activity data collected inside a client network; including feature construction, preprocessing and selection, clustering algorithm selection and evaluation, and visualization.
Technologies: Plotly, Seaborn, Scikit-learn, Pandas, Jupyter, Spark, Python

Data Scientist

2016 - 2017
Eurecat, Centre Tecnològic de Catalunya
  • Implemented an ML classification pipeline for the prediction of complex diseases (diabetes type II) using genomic and environmental data in a distributed environment which included imputation of missing values, feature selection (out of 755,000 features) and analysis of feature importance.
  • Created a web server for an automatized pipeline running and results representation.
  • Improved the implementation of chi-squared feature selection compared to MLlib.
Technologies: Parquet, HDFS, Flask, MLlib, Apache Spark, Python, Scala

Sofware Engineer

2012 - 2014
SoftPlus CJSC
  • Designed and implemented new features for a control panel for Internet-Hosting LLC.
  • Integrated payment systems, social networks, domain name registrars, and an SMS sending service.
Technologies: Selenium, Bootstrap, jQuery, JavaScript, MySQL, Java, Groovy, Grails

Software Engineer

2011 - 2012
Science Research Institute of Measuring Technology — Radio Systems
  • Contributed to the C/C++ and Java software development for landing systems.
Technologies: JavaFX, Java, Qt Quick, Qt

Classifying Clinically Actionable Genetic Mutations

https://www.kaggle.com/c/msk-redefining-cancer-treatment/leaderboard
I worked on this multiclass classification problem for text articles with mutation description (Kaggle). I implemented the term-weighting scheme and centroid-based classifier which won the competition.

Prediction of Complex Diseases Using Genomic and Environmental Data

This project centered around the prediction of type II diabetes using genomic and epidemiological data collected from European residents. The project was challenging due to the large datasets where the number of features was much higher than the number of samples.

I implemented the distributed pipeline for the classification task to predict complex disease; e.g., type II diabetes which is believed to depend on the combination of several genes and lifestyle and environmental factors

Right Whale Recognition

https://arxiv.org/abs/1604.05605
This tool helps to identify individual right whales (Eubalaena glacialis) from aerial photographs using CNN architecture.

Protein Classification From Primary Structures in the Context of Database Biocuration

https://upcommons.upc.edu/bitstream/handle/2117/106701/124491.pdf
I implemented a type of protein classification using the word2vec model to transform protein primary structures. The project included two main parts: an unsupervised transformation of protein sequences of arbitrary lengths into fixed size real-valued vector and classification of the proteins using the transformed vectors.
2015 - 2017

Master's Degree in Artificial Intelligence

Universitat Politecnica de Catalunya (UPC), Universitat de Barcelona (UB), and Universitat Rovirai Virgili (URV) - Barcelona, Spain

2006 - 2011

Specialist's Degree in Applied Mathematics

South Ural State University - Cheliabinsk

Libraries/APIs

Pandas, jQuery, Scikit-learn, Keras, TensorFlow, MLlib

Tools

PyCharm, Git, Jupyter, TensorBoard, Seaborn, Plotly

Languages

Python, Scala, Groovy, Java, JavaScript, Java 7, C++

Frameworks

Spark, Flask, Grails, Bootstrap, Selenium, Qt Quick, Apache Spark, Qt

Platforms

MacOS, Linux, JavaFX

Storage

MySQL, HDFS

Other

Data Science, Data Analysis, Data Analytics, Statistics, Machine Learning, Neural Networks, Parquet, Visualization

Collaboration That Works

How to Work with Toptal

Toptal matches you directly with global industry experts from our network in hours—not weeks or months.

1

Share your needs

Discuss your requirements and refine your scope in a call with a Toptal domain expert.
2

Choose your talent

Get a short list of expertly matched talent within 24 hours to review, interview, and choose from.
3

Start your risk-free talent trial

Work with your chosen talent on a trial basis for up to two weeks. Pay only if you decide to hire them.

Top talent is in high demand.

Start hiring