Andrew Chauzov
Verified Expert in Engineering
Machine Learning Engineer and Developer
Leveraging over a decade of expertise in data science, machine learning, and AI, Andrew has transformed data into actionable insights and strategic assets across various industries. This journey has led him to bring numerous projects from ideation to deployment, profoundly influencing product strategies and lineups.
Portfolio
Experience
Availability
Preferred Environment
PyCharm, GitHub, Google Cloud Platform (GCP), Amazon Web Services (AWS)
The most amazing...
...thing I've done was develop a player-team fit algorithm using football data that enhances talent scouting and reshapes team strategies.
Work Experience
Data Scientist | ML Engineer | Data-Driven Analytics
CRED Investments UK Ltd
- Brought over 25 projects from ideation to deployment, including several foundation models, delivering data-driven insights and diversifying the company’s product lineup.
- Developed LLM-based multimodal models for data enrichment and standardization, restoring 90% of missing data (around 450 million points), enhancing data quality, and directly boosting revenue growth in client projects.
- Collaborated on 10+ sports analytics models that professional scouts and clubs adopted. These models optimized scouting processes and costs and are featured in mobile apps and dashboards.
- Initiated data quality initiatives, including a game statistics pipeline that improved the precision of key predictive models, impacting the accuracy of player performance assessments.
- Built statistical models that introduced matching/recommendations between 100,000+ businesses and 400 million consumers, directly reducing customer attraction costs.
- Established a dual-layer regression model for football market value forecasts with <10% error, improving strategic decision-making for industry experts.
- Deployed an athlete retirement prediction model with a one-season margin of error, managing data irregularities and guiding strategic investments.
- Engineered a salary prediction pipeline with less than 10% error, providing insights that deepened client understanding of consumer demographics.
- Innovated algorithms for customer data refinement, enhancing segmentation and impacting ad campaigns, reducing marketing expenditures.
- Leveraged generative AI to analyze and process 10 million social media profiles, adding depth to our datasets.
DS and ML Consultant | Algorithm Design & NLP
Independent Consulting Services
- Collaborated with startups and consultancies to develop ML solutions (predictive analytics, forecasting, and NLP), achieving cost savings and revenue growth. The role required teamwork, self-management, and an understanding of client needs.
- Created an employee churn detection model with HR, providing actionable insights praised by stakeholders. Boosted retention and identified 10+ at-risk employees.
- Developed a PDF processing algorithm that extracts structured data from diverse financial documents dating back 10 years (totaling over 1,000 papers); this achieved an accuracy rate of over 95%, enabling custom search functionalities.
- Innovated a speech anomaly detection algorithm with 70-95% accuracy across more than 40 defect types. This solution, resulting from collaborative R&D efforts, implemented a core update to a healthcare mobile app.
- Devised a horse race betting system offering real-time, low-latency betting suggestions. This method doubled the performance of the previous system and resulted in an approximate 2% increase in revenue.
- Created a sales volume clustering algorithm, which led to a 15% improvement in sales planning and effectiveness.
Middle/Senior Data Analyst | Modeling Impact & Revenue Growth
Association 'Non-Profit Market Council'
- Played a key role in developing data-driven strategies and managing data processes and analytics. As senior analyst, led projects enhancing decision-making, yielding 2.5-10% revenue increases in the following year.
- Mentored and led three junior analysts, fostering a learning and professional growth culture. The guidance facilitated skill enhancement and resulted in two promotions within the year, demonstrating a commitment to team development and leadership.
- Revolutionized operational efficiency within the team, reducing task completion time from four days to eight hours through improving automation scripts, impacting the department's proficiency in delivering quick and accurate reports and insights.
- Applied time series analysis and data science techniques for anomaly detection, identifying around 50 critical periods annually. This led to a 5-25% reduction in forecast error rates for power price/volume predictions in targeted regions.
- Introduced a data enrichment algorithm, aggregating daily data into weekly and monthly summaries; this innovation improved analysis accuracy during volatile periods, contributing to more reliable forecasting models.
- Promoted to senior analyst in 2014 for exceptional predictive modeling expertise and productivity enhancements, having developed over 10 models that influenced strategic decisions and operational efficiency.
- Pitched and received approval for implementing five predictive models in over 20 stakeholder meetings by communicating their technical and business impact.
Junior Data Analyst | Data Processing & Analytical Modeling
BrandScience
- Facilitated execution of data-centric strategies, focusing on data processing and analytical modeling and driving insights influencing strategic decisions.
- Developed and implemented SQL and VBA-based aggregation logic, incorporating correlation analysis to expand data by 2.5x and enable multiple data sources for modeling, thereby enhancing reliability and supporting more informed decision-making.
- Streamlined media data collection (CATI/CAWI) and processing by introducing automated scripts, increasing departmental task efficiency by 300%, and reducing algorithm execution time from two hours to 30 minutes while making it fully automatic.
- Initiated cluster analysis to estimate early-stage campaign efficiency, enabling more strategic budget pre-allocation. This approach improved budget allocation effectiveness by an average of 20% across over 10 campaigns.
- Enhanced ROI models by integrating a VBA-based anomaly detection function, stabilizing predictions at the early forecasting stages, reducing expenditures by 50%, and boosting client marketing budget efficiency.
- Collaborated in refining the ROI prediction regression model using Excel/VBA, boosting campaign efficiency by 10% (improving brand knowledge from approximately 80% to 90%).
Experience
AutoML: Unsupervised Model Training with Optuna & SHAP Feature Selection
https://github.com/avchauzov/ml_training_optuna_shapAimed to build an AutoML platform, leveraging seasoned expertise and the best methodologies for optimal performance.
APPROACH
Focused on refining hyperparameter optimization for greater efficiency, innovating in feature selection for more profound results, and ensuring development integrity through rigorous testing, making the system robust and user-friendly.
RESULT
The platform now effectively supports essential model types like linear, gradient boosting, and Naive Bayes, with ongoing enhancements to broaden its modeling capabilities and application scope.
Web Scraping: Efficient Data Collection Using Selenium and Headless Chrome
https://github.com/avchauzov/teamform_web_scrapingTo create a web scraping tool capable of gathering league ranking data from TeamForm, utilizing a headless Chrome browser for efficient data collection.
APPROACH
Deployed Selenium for automated web navigation and scraping, with added functionality to manage data load for memory efficiency. The design also allows for future expansion to collect more detailed 'Club' and 'National' data.
RESULT
Successfully extracted league ranking data, offering valuable insights into team standings and performance, and set the groundwork for expanded data retrieval capabilities.
Algorithm: DTW-Based Hierarchical Clustering for FMCG Sales Time Series Analysis
https://github.com/avchauzov/time_series_clusteringTo unlock insights within consumer goods sales data through detailed analysis and clustering to highlight patterns and trends.
APPROACH
Conducted thorough time series analysis, including data cleaning for quality and employing Dynamic Time Warping (DTW) to pinpoint similarities, alongside developing a NumPy-based clustering algorithm for efficient data aggregation.
RESULT
Successfully clustered over 10,000 time series data points, revealing meaningful patterns and trends, significantly enhancing data understanding and strategic planning capabilities.
Skills
Languages
Python, SQL, R, C++
Libraries/APIs
NumPy, API Development, TensorFlow, PyTorch
Tools
PyCharm, Git, ChatGPT, GitHub
Paradigms
Data Science
Storage
MySQL, PL/SQL, Databases
Other
Data Visualization, Machine Learning, Predictive Modeling, Natural Language Processing (NLP), Data Analysis, Data Mining, Communication, Artificial Intelligence (AI), Data Modeling, Optimization, Analytical Thinking, Analytics, Statistics, Prompt Engineering, Cloud Computing, Teamwork, Presentations, Forecasting, Data Analytics, Statistical Data Analysis, Mathematics, Statistical Analysis, Programming, Deep Learning, Software Development, Large Language Models (LLMs), Big Data, Generative AI, English, Algorithms, Computer Vision, Computer Science
Platforms
Google Cloud Platform (GCP), Docker, Linux, Amazon Web Services (AWS)
Industry Expertise
Project Management
Education
Master's Degree in Applied Mathematics and Computer Science
Peoples' Friendship University of Russia - Moscow, Russia
How to Work with Toptal
Toptal matches you directly with global industry experts from our network in hours—not weeks or months.
Share your needs
Choose your talent
Start your risk-free talent trial
Top talent is in high demand.
Start hiring