Sergei Markochev
Verified Expert in Engineering
Data Scientist and Developer
London, United Kingdom
Toptal member since May 24, 2021
Sergei is a lead data science and AI/ML developer with extensive experience—over 15 years' worth. He has led end-to-end project delivery and provided technical expertise for complex decision problems for FTSE 100 companies and SME businesses. Sergei possesses a PhD in Physics, has one patent and six academic papers, and recently won 1st place in an international data science competition.
Portfolio
Experience
Availability
Preferred Environment
Jupyter Notebook, Windows, Linux, Git, Python, Amazon Web Services (AWS), Visual Studio Code (VS Code)
The most amazing...
...algorithm I've developed was ranked number one at an aircraft localization data science competition hosted by AIcrowd.
Work Experience
Data Scientist/Data Analyst
Ultraspeed Digital Limited
- Developed software and algorithms to simulate sensors' response to a person's footsteps.
- Conducted research and carried out mathematical modeling of sprint athletes' kinematics.
- Performed experiments with sensor equipment and preprocessed and analyzed data.
Data Scientist
12435136 Canada Inc.
- Developed a revenue management system for predicting optimal prices and revenue for some real estate properties.
- Built a pricing engine for predicting the market price and elasticity of real estate properties.
- Created data processing pipelines to increase data quality.
- Implemented a chat interface using the ChatGPT model to allow users to retrieve modeling results.
Data Science and Analytics Manager
CI&T
- Developed a personalized upselling suggestion recommendation system for one of the top five worldwide fast food chain's mobile apps.
- Deployed the new recommendation system for upsells into the production environment using Terraform and AWS services.
- Planned and performed A/B testing of the new recommendation system for upsells, which showed over 50% improvement.
- Led data analytics and data quality projects for CI&T international clients.
- Championed data science in CI&T UK and developed PoC using generative AI models.
Software Developer
Tellusant
- Improved data quality and filled data gaps using machine learning and custom modeling.
- Reviewed code and helped to build an MVP. Implemented time-series prediction.
- Investigated opportunities to predict audiences for specific products through analysis of global data.
Senior Data Scientist
Kainos
- Developed a data-informed recommendation system for extraction, manipulation, and search of helpful information from employees' resumes. Used natural language processing (NLP) techniques.
- Led data investigation and prototype model development for the client (a construction company).
- Presented some advanced topics on application deployment on AWS for an internal deep dive session.
Machine Learning (ML) Engineer
Bowen & Associates Ltd.
- Developed a state-of-the-art ML model to predict commercial property prices.
- Deployed the ML model on AWS to test its predictions.
- Advised the client on advances and limitations of the model, data quality, and deployment for testing.
Lead Data Scientist
GroupM
- Productionized three apps related to investigating and optimizing global TV ad schedules.
- Developed a cross-media data fusion model with an external deduplication data set.
- Predicted digital behavior for target audiences defined by TV show viewership and vice versa using ML techniques.
- Created Looker dashboards to present POCs and data insights.
- Developed deep learning models of reach curves for individual TV channels and other combinations.
- Carried out a bespoke analysis for multibillion-dollar stakeholders.
- Communicated results to stakeholders and product managers. Managed and hired data scientists.
Data Scientist (Python)
Applied AI LLC
- Developed an ML model to classify the content of industry-specific PDF documents.
- Investigated different approaches (ML, NLP, and statistical) to the modeling of document content.
- Assisted the client on best practices and models during the project.
Battery Analytics Scientist
BBOXX LTD
- Invented and deployed a patented state-of-the-art algorithm for remote capacity estimation of lead-acid batteries by their telemetry.
- Produced insights on battery performance and customer usage patterns to reduce battery failure maintenance.
- Developed advanced alerting and anomaly detection systems to monitor over 100,000 solar panels’ performance (broken sensors, tampering, heavy usage, and so on).
- Developed a Bayesian survival model for the prediction of battery failure rate in the future.
Assistant
Moscow Institute of Physics and Technology
- Supported and organized the educational process, conducted courses, and supervised bachelor degree routes.
- Organized and provided the department’s section at the annual university conference.
- Led laboratory courses and seminars on atomic physics and optics.
Senior Research Associate
Central Institute of Chemistry and Mechanics
- Led the experimental research on rare nuclear decays (published in five academic papers and reported on in four international conferences).
- Developed a fully automated digital spectroscopic system for the investigation of rare nuclear decays (Ph.D. thesis).
- Carried out data analyses and Monte Carlo simulations.
Experience
Aircraft Localization Competition
https://github.com/smarkochev/Aircraft_localization_competition_round_2The competition was organized by the Swiss Cyber-Defence Campus of Armasuisse Science and Technology. The data was collected by the OpenSky Network, a large-scale ADS-B sensor network for research.
• https://www.aicrowd.com/challenges/cyd-campus-aircraft-localization-competition/leaderboards
Prediction of Customer Spending
https://github.com/smarkochev/ds_notebooks/Notebook:
• Prediction of customer spending.ipynb
Expedia Hotel Sales | Kaggle Competition
https://www.kaggle.com/c/hotelsales/I was ranked #1 among 19 teams proposing a combination of machine learning models.
Rail-ticket Price Prediction
https://github.com/smarkochev/ds_notebooksNotebooks:
• Rail_ticket_price_prediction_IDE.ipynb
• Rail_ticket_price_prediction_modelling.ipynb
Statoil Kaggle Competition
https://github.com/smarkochev/ds_notebooksNotebooks:
• Statoil_Kaggle_competition_main.ipynb
• Statoil_Kaggle_competition_google_colab_notebook.ipynb
• Statoil_Kaggle_competition_DL_comparison.ipynb
Education
Ph.D. in Nuclear Physics
Moscow Institute of Physics and Technology - Moscow, Russia
Master's Degree in Applied Mathematics and Physics
Moscow Institute of Physics and Technology - Moscow, Russia
Bachelor's Degree in Applied Mathematics and Physics
Moscow Institute of Physics and Technology - Moscow, Russia
Certifications
Probabilistic Graphical Models Specialization
Stanford University | via Coursera
Advanced Data Science with IBM Specialization
IBM | via Coursera
Skills
Libraries/APIs
Pandas, Scikit-learn, NumPy, SciPy, XGBoost, Matplotlib, Keras, PyMC, Spark ML, TensorFlow, PySpark, SpaCy
Tools
ARIMA, AWS Step Functions, Terraform, ChatGPT, GitLab CI/CD, MATLAB, Looker, Git, LaTeX, Microsoft Power BI, Spark SQL, Amazon SageMaker, LaunchDarkly, Jira
Languages
SQL, Python, Python 3, Snowflake, Octave, R, C++
Paradigms
Quantitative Research, Management, Agile, Object-oriented Programming (OOP), ETL
Platforms
Jupyter Notebook, Amazon Web Services (AWS), Visual Studio Code (VS Code), AWS IoT, Linux, Docker, AWS Lambda, Azure
Storage
MySQL, PostgreSQL, Azure SQL
Frameworks
AWS Serverless Application Model (SAM), Spark
Other
Applied Mathematics, Data Analysis, Digital Signal Processing, Machine Learning, Data Cleaning, Nonlinear Optimization, University Teaching, Software Development, Clustering, Applied Physics, Mathematics, Data Science, Data Analytics, Data Science, Data Visualization, Time Series, Artificial Intelligence (AI), Dash, Software, Classification Algorithms, Regression Modeling, Algorithms, Recommendation Systems, Revenue Management, Pricing, Big Data, Data Quality Analysis, Solution Architecture, Pricing Models, Dynamic Pricing, Forecasting, Large Language Models (LLMs), Mathematical Modeling, Data Preprocessing, Revenue Modeling, Modeling, Time Series Analysis, Random Number Generation, Data-informed Recommendations, Statistical Modeling, Monte Carlo Simulations, Deep Learning, Cython, Bayesian Inference & Modeling, Predictive Modeling, Dashboard Development, Computer Vision, Multithreading, Unsupervised Learning, Statistics, Natural Language Processing (NLP), A/B Testing, Generative Pre-trained Transformers (GPT), Data Scraping, CI/CD Pipelines, AI Agents, LangChain, Machine Learning Operations (MLOps), Transformer Models, Collaborative Filtering, AI Chatbots, Principal Component Analysis (PCA), Document Parsing, OpenAI GPT-3 API, OpenAI GPT-4 API, PDF Scraping, Amplitude, mParticle, FastAPI, Content-based Filtering, Sensor Data
How to Work with Toptal
Toptal matches you directly with global industry experts from our network in hours—not weeks or months.
Share your needs
Choose your talent
Start your risk-free talent trial
Top talent is in high demand.
Start hiring