### Machine Learning and Deep Learning Consultant

2018 - PRESENTSelf-employed- Developed a recommendation system for an activity startup. The model has been implemented into a personalized email generation system. Results showed a 30 percent increase in the matching rate.
- Created a solar panel crack detection model. Developed several additional scripts for image aligning, grid to cell cropping, an automatic relabeling tool for supervise.ly, and a memory-efficient forecasting pipeline with 96 percent accuracy.
- Built a dental x-ray (bite-wings) disease detection model. Semantic segmentation with extended augmentation techniques was used with an accuracy of 88 percent for cavity class.
- Developed an automatic children's speech defects detection model. Provided extended feature engineering and cleaning work. The f1-metric across 60 tasks showed an average 0.90 score.

Technologies: Word2vec, Unsupervised Learning, Unstructured Data Analysis, Test Automation, TensorBoard, Supervised Learning, Statistics, Statistical Data Analysis, Statistical Analysis, SpaCy, Software Development, Scrum, Scikit-learn, Scikit-image, SciPy, SQL, Research, Requirements Analysis, Recurrent Neural Networks, Random Forests, REST APIs, PyCharm, Project Management, Probability Theory, Plotly, Pandas, Optimization, OpenCV, Object Recognition, Object Detection, Natural Language Processing (NLP), NLTK, MySQL, Modeling, Matplotlib, Machine Learning Automation, Linux, Jupyter Notebook, Jupyter, JSON, Gradient Boosting, Gradient Boosted Trees, Google Cloud Platform (GCP), Google APIs, GitHub, Git, Geometry, Feature Analysis, Exploratory Data Analysis, EDA, Docker, Deep Neural Networks, Decision Trees, Decision Tree Classification, Data Science, Automated Data Processing, Data Processing Automation, Data Processing, Data Preprocessing, Data Preparation, Database Modeling, Data Mining, Data Collection, Data Cleaning, Convolutional Neural Networks, Computer Vision Algorithms, Complex Data Analysis, Communication, Classification Algorithms, Classification, Bash Scripting, Bash Script, Bash, Artificial Intelligence (AI), Analytics, Amazon Web Services (AWS), Amazon API, Algorithms, Agile, API Development, Deep Learning, Machine Learning, Signal Filtering, Signal Analysis, Audio Processing, Time Series Analysis, Time Series, Digital Signal Processing, Scripting, Image Recognition, Image Processing, XGBoost, LightGBM, NumPy, REST, Flask, Back-end, Recommendation Systems, Keras, TensorFlow, Computer Vision, Python 3, Python 2, Python### Data Science Consultant

2017 - PRESENTSelf-employed- Developed a time series common pattern detection approach. The flexibility of this algorithm allows using it in different data domains (FMCG, energy consumption data, and others).
- Created an efficient employee adaptation quality estimation model for sparse and short data. Real-time predictions show that the model detects bad adaptation signs one to three months earlier than the linear managers.
- Implemented an optimal wake-up and sleep-times detection model into a habit tracker app. The model handled several manually defined rules as well as ML-approaches.
- Built a recurrent transaction detection approach based on 2D token representation with projection onto a Poincare disk model. As the result, the algorithm efficiently detected transaction subgroups on different depth levels.
- Developed an optimal selling price detection model that incorporated the following features: sellers' data, supply data, ask-demand curves, and manually defined rules.

Technologies: Word2vec, Web Scraping, Unstructured Data Analysis, Trading, Test Automation, Statistical Data Analysis, Statistical Analysis, SpaCy, Software Development, Numerical Simulations, Simulations, Signal Filtering, Signal Analysis, Selenium, Scrum, Scripting, Scikit-learn, SQL, Research, Requirements Analysis, Reports, Reporting, Regression Models, Redis, Recommendation Systems, Random Forests, REST APIs, REST, Python 3, Python 2, Python, PyCharm, Plotly, Pandas, PDF Scraping, NoSQL, NLTK, MySQL, MongoDB, Modeling, Matplotlib, Discrete Mathematics, Mathematics, Mathematical Models, Machine Learning Automation, Machine Learning, Linux, Jupyter Notebook, Jupyter, JSON, High-frequency Trading (HFT), HTML, H2O AutoML, Graph Theory, Google Cloud Platform (GCP), Google APIs, GitHub, Git, Gensim, Flask, Financial Markets, Feature Analysis, Exploratory Data Analysis, EDA, Docker, Digital Signal Processing, Decision Trees, Relational Database Design, Database Schema Design, Database Design, Data Visualization, Data Scraping, Data Science, Data Reporting, Automated Data Processing, Data Processing Automation, Data Processing, Data Preprocessing, Data Preparation, Database Modeling, Data Modeling, Data Mining, Data Collection, Data Cleaning, Data Analysis, Web Dashboards, Dask, Dashboards, Computer Science, Complex Data Analysis, Communication, Clustering Algorithms, CSS, Classification Algorithms, Decision Tree Classification, Text Classification, Classification, Beautiful Soup, Bayesian Statistics, Bash Scripting, Bash Script, Bash, Artificial Intelligence (AI), Anomaly Detection, Data Analytics, Analytics, Amazon Web Services (AWS), Amazon API, Agile, API Development, Recurrent Neural Networks, Unsupervised Learning, Supervised Learning, Predictive Analytics, Predictive Modeling, Optimization, Geometry, Applied Mathematics, Mobile App Development, Back-end, Statistics, Probability Theory, CatBoost, Gradient Boosted Trees, XGBoost, LightGBM, Gradient Boosting, PyMC, SciPy, NumPy, Algorithms, Cluster Analysis, Clustering, Time Series Analysis, Time Series, Natural Language Processing (NLP), TensorFlow### Senior Data Scientist

2014 - 2016Power Industry- Developed a clustering tool for power entities (more than 10'000 items; various types of data). The outcomes gave an opportunity to simplify and speed-up the graph model that was used for simulation purposes.
- Created an NLP power-news classification tool (including a scraping pipeline). This helped the department to generate a database of actual news for each entity type/region. Further, this data was successfully used in various models.
- Handled technical reports, presentations, and their defense in front of semi- and non-technical audiences.
- Provided mentorship and supervision for junior analysts and commercial projects.

Technologies: Time Series Analysis, Time Series, Test Automation, Statistics, Statistical Analysis, Software Development, Signal Filtering, Signal Analysis, Scripting, Scikit-learn, SciPy, Research, Regression Models, Random Forests, RStudio, R, PyCharm, Probability Theory, Predictive Modeling, Predictive Analytics, PowerPoint Design, Pandas, Optimization, NumPy, NLTK, MySQL, Modeling, Microsoft PowerPoint, Matplotlib, Discrete Mathematics, Mathematics, Mathematical Models, Machine Learning Automation, Linux, Linear Regression, Jupyter Notebook, Jupyter, JSON, Feature Analysis, EDA, Digital Signal Processing, Relational Database Design, Database Schema Design, Database Design, Data Visualization, Data Reporting, Data Processing Automation, Data Processing, Data Preprocessing, Data Preparation, Data Modeling, Data Mining, Data Collection, Data Cleaning, Scientific Data Analysis, Unstructured Data Analysis, Exploratory Data Analysis, Statistical Data Analysis, Data Analysis, Analytical Dashboards, Web Dashboards, Dashboards, Complex Data Analysis, Bayesian Statistics, Bash Scripting, Bash Script, Bash, Back-end, Applied Mathematics, Anomaly Detection, Data Analytics, Analytics, Clustering Algorithms, Algorithms, API Development, Unsupervised Learning, Supervised Learning, Requirements Analysis, Project Management, Oracle, SQL, Python 3, Python 2, Simulations, Graph Theory, Natural Language Processing (NLP), Cluster Analysis, Clustering, Numerical Simulations, Classification, Classification Algorithms, Decision Tree Classification, Decision Trees, Text Classification, Naive Bayes, Presentations, Reporting, Reports, Communication, Machine Learning, Data Scraping, Data Science, Python### Data Scientist

2012 - 2014Power Industry- Developed power price/volume prediction models (long- and short-term) with robustness to outliers. As a result, prediction accuracy was improved significantly (MAE was reduced by two and a half times).
- Created a daily/weekly to hourly/daily conversion model. This approach allowed the department to get more accurate high-granularity predictions and use them in various reports and as inputs to models-on-top.
- Developed scripts for several business processes automation (C#, C++, Python, Delphi, VBA, R). Therefore, the speed of some processes was increased four times.
- Built a model and visualization tool on-top for anomalies detection in prices/volumes data. The results have been implemented into the dashboard for real-time emergency early detection.

Technologies: Unstructured Data Analysis, Statistics, Statistical Data Analysis, Statistical Analysis, Software Development, Signal Filtering, Signal Analysis, Scikit-learn, SciPy, Research, Requirements Analysis, Reports, Reporting, Random Forests, Probability Theory, Presentations, Predictive Analytics, PowerPoint Design, Pandas, Optimization, Numerical Simulations, NumPy, Naive Bayes, Modeling, Microsoft PowerPoint, Microsoft Excel, Matplotlib, Mathematics, Mathematical Models, Geometry, Machine Learning Automation, Linux, Linear Regression, Jupyter Notebook, Jupyter, Gradient Boosting, Gradient Boosted Trees, Feature Analysis, Digital Signal Processing, Decision Trees, Database Design, Data Reporting, Data Preprocessing, Data Modeling, Data Cleaning, Complex Data Analysis, Communication, Clustering Algorithms, Clustering, Cluster Analysis, C, Bayesian Statistics, Bash Scripting, Bash Script, Bash, Back-end, Applied Mathematics, Analytics, Unsupervised Learning, Supervised Learning, Data Mining, EDA, Data Analytics, Data Analysis, Exploratory Data Analysis, Data Collection, Data Preparation, Data Processing, Visual Basic for Applications (VBA), Excel VBA, Oracle, MySQL, Scripting, Decision Tree Classification, Classification Algorithms, Classification, Time Series, Dashboards, Data Visualization, Algorithms, Delphi, R, C++, C#, Graph Theory, Simulations, Regression Models, Python 2, Python 3, SQL, Predictive Modeling, Anomaly Detection, Time Series Analysis, Machine Learning, Data Science, Python### Junior Data Scientist (ROI Modeling, Media/Marketing Mix)

2011 - 2012BBDO Group- Developed an anomaly detection function for an ROI predictive model. The MAE score was reduced by two times: the client's marketing budget became much more efficient.
- Created an optimal marketing campaign budget estimation approach. It has been successfully used for both prior and posterior budget estimation/correction.
- Implemented a cluster-based approach for campaign poor performance early detection: A useful tool that helped our clients to correct their marketing strategies in advance.
- Built a CATI to CAWI (computer-assisted telephones interviewing to computer-aided web interviewing) conversion model. The database was extended and aligned; using this dataset showed a significant increase in models' validation quality.
- Developed a totally new approach for EDA by creating more analytical outcomes. The results have been added to our weekly clients' reports (among them is MTS, one of the biggest mobile operators in Russia).
- Created an effective script for automatic media data collection and processing. The departmentâ€™s processes went four times quicker.

Technologies: Statistical Analysis, Research, Requirements Analysis, Reports, Regression Models, RStudio, Probability Theory, Oracle, Optimization, Naive Bayes, Modeling, Mathematics, Mathematical Models, Machine Learning, Linux, Feature Analysis, Database Design, Data Reporting, Data Cleaning, Classification Algorithms, Bayesian Statistics, Supervised Learning, Time Series Analysis, Unsupervised Learning, Statistical Data Analysis, Complex Data Analysis, Unstructured Data Analysis, Dashboards, Communication, Classification, Applied Mathematics, Analytics, Clustering Algorithms, Scripting, Data Processing, Data Preparation, Data Collection, Exploratory Data Analysis, PowerPoint Design, Presentations, Reporting, Data Analytics, Visual Basic for Applications (VBA), MySQL, Microsoft Access, Budget Modeling, Media Marketing, Marketing Mix, Data Preprocessing, Microsoft PowerPoint, Econometrics, EDA, Clustering, Marketing Strategy, Marketing Mix Modeling, Excel VBA, Microsoft Excel, Anomaly Detection, Linear Regression, Time Series, Cluster Analysis, Algorithms, SQL, ROI, Predictive Analytics, Predictive Modeling, Statistics, Data Modeling, Data Visualization, Data Mining, Data Analysis, Data Science, R