

Yaroslav Kopotilov
Verified Expert in Engineering
Data Scientist and Developer
Belgrade, Serbia
Toptal member since April 9, 2020
Yaroslav is a senior data scientist with extensive experience in business analysis, predictive modeling, data visualization, data orchestration, and deployment. He has a proven track record of managing complex data science projects and leading small, agile developer teams.
Portfolio
Experience
- Python - 8 years
- Machine Learning - 8 years
- Time Series Analysis - 6 years
- Statistics - 5 years
- Data Engineering - 4 years
- SQL - 4 years
- Data Visualization - 3 years
- Stakeholder Engagement - 3 years
Preferred Environment
Git, Jupyter, Linux, Visual Studio Code (VS Code), SQL, Python, MacOS, NoSQL, Docker
The most amazing...
...thing I designed and built is an algorithmic strategy combining multiple data pipelines, capable of sub-second execution (Python, SQL, AMQP, Docker)
Work Experience
ML/AI Architect
Inteleos, Inc. - Main
- Architected AWS-based pipelines (S3, DynamoDB, Lambda, ECR, Snowflake) to train and deploy 20+ time series forecasting models.
- Created guidelines for machine learning (ML) research and deployment, helping to translate findings into scalable production systems.
- Mentored a junior data scientist, guiding code reviews, model evaluation, and best practices for reproducible research.
- Designed a proof-of-concept AI assistant for data search and summarization (AWS, Snowflake).
Founder | CEO | Speaker
Data Sanity
- Organized four international AI conferences in Serbia and the UK, attracting up to 200 participants each, plus numerous smaller events.
- Co-authored and lectured in the open course “Intro to AI Agents” at the University of Belgrade, attracting 100 students and professionals.
- Assembled and managed a team of 10+ contractors and volunteers.
Senior ML Engineer | Quant Researcher
US Fintech Startup
- Analyzed data from several vendors on US prediction markets and selected sources with the highest signal.
- Independently implemented and backtested an ML-driven trading strategy to verify the previous backtest results.
- Advised on the ML infrastructure and the company's long-term development.
Python and Machine Learning Developer
codeValet Inc.
- Contributed to the redesign of the ML architecture, drastically simplifying and improving the efficiency of an AI platform that integrates NLP and mathematical reasoning.
- Streamlined asynchronous graph and optimization computations using PyTorch, accelerating AI pipelines by over 50x.
- Developed a GraphRAG extension extracting relevant nodes from large codebases in under a second.
Prompt and Software Engineer (via Toptal)
Invisible Technologies Inc
- Developed a Python pipeline for large-scale prompt prototyping, reducing experimentation time severalfold.
- Architected and refined methods for evaluating large language model (LLM) responses based on correctness, safety, and relevance.
- Tested a variety of closed-source and open-source LLMs.
Senior Data Scientist | Python Developer
Bumbee Labs Ab
- Designed an improved version of a visit count algorithm that reduces the out-of-sample model error by 50%.
- Developed a framework for robust machine learning (ML) model prototyping and evaluation by the data science team.
- Accelerated historical sample data processing in Python by more than 50x through the use of more efficient functions and just-in-time compilation. This reduced the processing time for one day of sample data from one hour to one minute.
- Built a 24/7 data pipeline consuming wifi sample data from multiple sensor installations and saving them in an SQL database for historical data analysis and model evaluation.
Lead Data Scientist and Developer | CEO | Founder
YAFinData
- Designed and built a financial data and data analytics platform. The data is shipped in a unified, user-friendly format and can be accessed via a web app and REST API.
- Managed a remote team of up to five developers. Determined the overall direction of product development.
- Analyzed trading opportunities in the UK electricity markets. Backtested several short-term algorithmic strategies. Estimated PnL and risks, accounting for slippage and market impact.
- Created several 24/7 ETL pipelines that collect, clean, and save data for the UK electricity market. Implemented downstream features that are continuously computed from the data feeds in less than 10 ms.
- Developed CI/CD, a backup raw file storage, a parallel redundancy, and a monitoring system to ensure the data collection functions smoothly 24/7.
Systematic Trading - Data Scientist and Developer
TickUp AB
- Analyzed and unified multiple datasets for US equity markets.
- Developed an ML model and several data pipelines for an algorithmic trading strategy.
- Wrote and reviewed both research notebooks and production code.
- Organized a 7-day company meetup, which helped boost team productivity and collaboration.
Equity Trading - Quant Researcher
Independent Client
- Analyzed financial and fundamental data on publicly traded companies.
- Identified and improved a trading signal for a daily equity strategy.
- Presented strategy backtest results and handed off research for implementation.
Energy Trading - Data Scientist
Vitol
- Created market analysis tools and systematic strategies for coal, power, and crude desks. Covered all phases of a data science project, including project setup, data pipelines, modeling, and deployment.
- Analyzed the firm-wide trading market impact under different execution styles.
- Worked with both small (50 data points) and large (several terabytes) datasets.
- Contributed individually and in collaboration with the data science and IT teams.
- Assisted Vitol's employees in Python and machine learning training.
Model Validation, Commodities - Associate
JPMorgan
- Implemented a custom version of the extended Kalman filter from scratch to calibrate exotic option pricing models that outperformed the existing calibration methods.
- Reviewed ten pricing models' options and their implementations in commodities and credit.
- Measured and mitigated numerous model risks in collaboration with the desk and developers.
- Mentored junior employees during their review work.
Algorithmic Trading - Quant Researcher
Credit Suisse
- Designed and implemented two mid-frequency trading strategies for the commodity desk.
- Analyzed portfolio hedging strategies using risk factors for the equity desk.
- Implemented a data pipeline that cleaned and transformed tabular data for the equity desk.
ML Research (Intern)
Novosibirsk State University
- Wrote a research paper describing a metric that uses Fourier descriptors to compare shapes with internal gaps.
- Implemented a classification algorithm that achieved 98% accuracy on a dataset with 19 classes of images.
- Presented the results at the scientific conference MNSK 2015, Novosibirsk.
Experience
Cancer Treatment Research
https://www.milner.cam.ac.uk/machinelearning/OpenAI GPT Telegram Bot
I contributed as a data scientist (GPT model benchmarking, prompt engineering, and text embeddings), software developer (asynchronous Python code and the OpenAI API), project manager, and mentor to junior data scientists.
Stranger News
Top 1 in Time Series Forecast Competition on Kaggle
https://www.kaggle.com/myster/eda-prophet-winning-solution-3-0Exploring and visualizing the dataset was both fun and rewarding, as I uncovered interesting quirks in the data. Notably, I soon realized the dataset had been synthetically generated, which provided a crucial clue for solving the problem. In the end, my analysis paid off — my team secured first place!
Data Sanity Talks Website
https://datasanity.dev/Interactive Website
https://datascienceforhire.net/Data Science Examples
https://github.com/mysterious-ben/ds-examples/Python Data Pipelining Tools
https://github.com/mysterious-ben/apipePlease check out my GitHub page to see other data science and data engineering packages.
Education
Master's Degree in Financial Mathematics
Université Pierre et Marie Curie - Paris, France
Master's Degree in Applied Mathematics
École Polytechnique - Paris, France
Master's Degree in Mathematics and Computer Science
Novosibirsk State University - Novosibirsk, Russia
Bachelor's Degree in Probability and Statistics
Novosibirsk State University - Novosibirsk, Russia
Skills
Libraries/APIs
Scikit-learn, Pandas, NumPy, Matplotlib, XGBoost, OpenCV, REST APIs, SQLAlchemy, SciPy, Python Asyncio, Dask, PyTorch, TensorFlow, Asyncio, AMQP, SpaCy, OpenAI API
Tools
Jupyter, Git, StatsModels, PyCharm, ChatGPT, Algorithm Design, AI Prompts, Claude, Claude Code, Amazon Athena, ActiveBatch, MATLAB, Kibana, Plotly, Boto 3, Ansible, GitHub, Bitbucket, Grafana, GIS, Tableau, AWS Command Line Interface (CLI), AWS Deployment, Prefect, Seaborn
Languages
Python, SQL, R, C++, Java, HTML, CSS, XML, JavaScript, Snowflake, Rust
Storage
Data Pipelines, Oracle SQL, PostgreSQL, Amazon S3 (AWS S3), SQLite, MongoDB, PostGIS, NoSQL
Frameworks
LightGBM, Spark, Flask, LangGraph
Paradigms
Object-oriented Programming (OOP), Quantitative Research, Agile Software Development, Functional Analysis, STOMP, DevOps, Real-time Systems
Platforms
Amazon Web Services (AWS), Jupyter Notebook, AWS IoT, Docker, Linux, MacOS, Visual Studio Code (VS Code), NVIDIA CUDA, Heroku, Ubuntu
Industry Expertise
Project Management, Trading Systems, High-frequency Trading (HFT)
Other
Predictive Modeling, Forecasting, Artificial Intelligence (AI), Data Analysis, Predictive Analytics, Data Science, Statistics, Machine Learning, Supervised Learning, Algorithmic Trading, Regression, Data Analytics, Backtesting Trading Strategies, Minimum Viable Product (MVP), Feature Engineering, Data Cleansing, Statistical Modeling, Time Series, Web Dashboards, Machine Learning Operations (MLOps), Algorithms, Time Series Analysis, Mathematics, Data Visualization, Stakeholder Engagement, Data Engineering, Option Pricing, Unsupervised Learning, Finance, Trading, Financial Markets, Remote Team Leadership, Financial Data, Dashboards, Quantitative Analysis, Quantitative Modeling, Quantitative Finance, Quantitative Risk Analysis, Statistical Analysis, Natural Language Processing (NLP), Financial Modeling, Numba, Financial Software, OpenAI, Metrics, Prompt Engineering, Technical Leadership, Bayesian Statistics, Large Language Models (LLMs), Stock Market, Retrieval-augmented Generation (RAG), AI Consulting, Data Structures, AI Algorithms, Mathematical Modeling, Monte Carlo, Monte Carlo Simulations, Architecture, Finance APIs, Financial Market Data, Real-time Data, Solution Architecture, LangChain, Agentic AI, Random Forests, Optimization, RAG Systems, AI Agents, Risk Management, AI Architecture, AI Model Training, Training, Time Series Forecasting, Logistic Regression, Code Deployment, Futures & Options, Energy, Systematic Trading, Deep Learning, Probability Theory, Mathematical Analysis, Applied Mathematics, Derivative Pricing, Chemistry, Stochastic Modeling, Stochastic Differential Equations, Econometrics, Economics, Computer Vision, Software Development, Genetic Algorithms, Dash, Data Mining, Equity Market Data, Cloud Services, Technical Hiring, Code Review, IT Project Management, Team Leadership, Big Data, APIs, OpenAI GPT-3 API, OpenAI GPT-4 API, Telegram Bots, IT Product Management, Trade Finance, Audio Processing, Numerical Methods, Reports, Applied Physics, Mentorship, Leadership, Mentorship & Coaching, Bayesian Inference & Modeling, CTO, ChatGPT Prompts, Coaching, Workshops, WiFi, Market Risk, Software Engineering, Open-source LLMs, Biotechnology, Investment Banking, Integration, Vector Databases, Geospatial Data, API Integration, AI Chatbots, Chatbots, Product Management, Lean Project Management, Communication, Community, Public Speaking, Hugging Face, Cython, Gemini, Cloud, Front-end Development, AI-generated Code, GitOps, Containerization, Business Development, Website Design, Web Development, Vector Search, Graphs, Linear Regression, System Design, Software Architecture, Back-end Development, Equities, Data Architecture, Spatial Analysis, Combinatorial Optimization, CI/CD Pipelines, Infrastructure, Parquet, Hetzner, Amazon Bedrock, Probabilistic Modeling, Time Series Data
How to Work with Toptal
Toptal matches you directly with global industry experts from our network in hours—not weeks or months.
Share your needs
Choose your talent
Start your risk-free talent trial
Top talent is in high demand.
Start hiring