
Sergei Markochev
Verified Expert in Engineering
Data Scientist and Developer
Sergei is a lead data scientist with over 15 years of extensive experience. He has experience in applied modeling and developing highly complex enterprise products. He also has successfully managed data science teams and contributed as a senior data scientist and solutions architect. Sergei has published six academic papers and one international patent in his field and was recently the winner of a data science competition.
Portfolio
Experience
Availability
Preferred Environment
Jupyter Notebook, Windows, Linux, Git, Python, Amazon Web Services (AWS)
The most amazing...
...algorithm that I've developed was ranked #1 at an aircraft localization data science competition hosted by AIcrowd.
Work Experience
Data Science and Analytics Manager
CI&T
- Led data analytics projects with a CI&T international client.
- Enabled A/B testing on the client side by fixing code logic and deep-diving tools.
- Investigated data quality and anomalies using SQL and Snowflake.
- Applied AWS Personalize as a custom recommendation system for the client.
Senior Data Scientist
Kainos
- Developed a system for extraction, manipulation, and search of helpful information from employees' resumes. Used natural language processing (NLP) techniques.
- Led data investigation and prototype models development for the client (a construction company).
- Presented some advanced topics on application deployment on AWS for an internal deep dive session.
Machine Learning (ML) Engineer
Bowen & Associates Ltd.
- Developed a state-of-the-art ML model to predict commercial property prices.
- Deployed the ML model on AWS to test its predictions.
- Advised the client on advances and limitations of the model, data quality, and deployment for testing.
Lead Data Scientist
GroupM
- Productionized three apps related to the investigation and optimization of global TV ad schedules.
- Developed a cross-media data fusion model with an external deduplication data set.
- Predicted digital behavior for target audiences defined by TV show viewership and vice versa using ML techniques.
- Created Looker dashboards to present POCs and data insights.
- Developed deep learning models of reach curves for individual TV channels and other combinations.
- Carried out a bespoke analysis for multibillion-dollar stakeholders.
- Communicated results to stakeholders and product managers. Managed and hired data scientists.
Data Scientist (Python)
Applied AI LLC
- Developed an ML model to classify the content of industry-specific PDF documents.
- Investigated different approaches (ML, NLP, statistical) to modelling of document content.
- Assisted the client on best practices and models during the project.
Battery Analytics Scientist
BBOXX LTD
- Invented and deployed a patented state-of-the-art algorithm for remote capacity estimation of lead-acid batteries by their telemetry.
- Produced insights on battery performance and customer usage patterns to reduce battery failure maintenance.
- Developed advanced alerting and anomaly detection systems to monitor over 100,000 solar panels’ performance (broken sensors, tampering, heavy usage, and so on).
- Developed a Bayesian survival model for the prediction of battery failure rate in the future.
Assistant
Moscow Institute of Physics and Technology
- Supported and organized the educational process, conducted courses, and supervised bachelor degree routes.
- Organized and provided the department’s section at the annual university conference.
- Led laboratory courses and seminars on atomic physics and optics.
Senior Research Associate
Central Institute of Chemistry and Mechanics
- Led the experimental research on rare nuclear decays (published in five academic papers and reported on in four international conferences).
- Developed a fully automated digital spectroscopic system for the investigation of rare nuclear decays (Ph.D. thesis).
- Carried out data analyses and Monte Carlo simulations.
Experience
Aircraft Localization Competition
https://github.com/smarkochev/Aircraft_localization_competition_round_2The competition was organized by the Swiss Cyber-Defence Campus of Armasuisse Science and Technology, the data was collected by the OpenSky Network, a large-scale ADS-B sensor network for research.
• https://www.aicrowd.com/challenges/cyd-campus-aircraft-localization-competition/leaderboards
Prediction of Customer Spending
https://github.com/smarkochev/ds_notebooks/Notebook:
• Prediction of customer spending.ipynb
Expedia Hotel Sales | Kaggle Competition
https://www.kaggle.com/c/hotelsales/I was ranked #1 among 19 teams proposing a combination of machine learning models.
Rail-ticket Price Prediction
https://github.com/smarkochev/ds_notebooksNotebooks:
• Rail_ticket_price_prediction_IDE.ipynb
• Rail_ticket_price_prediction_modelling.ipynb
Statoil Kaggle Competition
https://github.com/smarkochev/ds_notebooksNotebooks:
• Statoil_Kaggle_competition_main.ipynb
• Statoil_Kaggle_competition_google_colab_notebook.ipynb
• Statoil_Kaggle_competition_DL_comparison.ipynb
Skills
Languages
SQL, Python, Octave, R, C++, Python 3, Snowflake
Libraries/APIs
Pandas, Scikit-learn, NumPy, SciPy, Keras, PyMC, PySpark, SpaCy
Paradigms
Data Science, Quantitative Research, Agile, Object-oriented Programming (OOP), ETL, Management
Platforms
Jupyter Notebook, Linux, Amazon Web Services (AWS), Azure
Storage
MySQL, PostgreSQL, Azure SQL
Other
Applied Mathematics, Data Analysis, Digital Signal Processing, Machine Learning, Data Cleaning, Nonlinear Optimization, University Teaching, Software Development, Clustering, Applied Physics, Mathematics, Scientific Data Analysis, Data Analytics, Data Visualization, Artificial Intelligence (AI), Monte Carlo Simulations, Deep Learning, Cython, Bayesian Inference & Modeling, Time Series, Predictive Modeling, Dashboard Development, Computer Vision, Multithreading, Unsupervised Learning, Statistics, Natural Language Processing (NLP), GPT, Generative Pre-trained Transformers (GPT), PDF Scraping, Dash, Software, Amplitude, mParticle, A/B Testing, Classification Algorithms, Regression Modeling
Tools
MATLAB, Looker, Git, LaTeX, LaunchDarkly, Jira
Frameworks
Spark
Education
Ph.D. in Nuclear Physics
Moscow Institute of Physics and Technology - Moscow, Russia
Master's Degree in Applied Mathematics and Physics
Moscow Institute of Physics and Technology - Moscow, Russia
Bachelor's Degree in Applied Mathematics and Physics
Moscow Institute of Physics and Technology - Moscow, Russia
Certifications
Probabilistic Graphical Models Specialization
Stanford University | via Coursera
Advanced Data Science with IBM Specialization
IBM | via Coursera