Viktor Petukhov
Verified Expert in Engineering
Data Scientist and AI Developer
Tbilisi, Georgia
Toptal member since September 22, 2022
Viktor holds a PhD in biostatistics, during which he developed eight open-source packages used by thousands of researchers worldwide. He also worked as a data science team lead and an independent AI consultant, translating business needs into technical terms. Combining scientific and enterprise experience, Viktor can support companies on the full spectrum, from formulating a business problem to building a production-ready AI solution.
Portfolio
Experience
Availability
Preferred Environment
Linux, Jupyter, RStudio, Git, Visual Studio Code (VS Code), PyCharm, CLion
The most amazing...
...project I've developed is a pipeline for unsupervised segmentation of imaging-based transcriptomics data, actively used among the top five labs in the field.
Work Experience
CTO
Tentakel
- Developed an MVP of a product for a service that converts natural language queries to SQL and data visualization.
- Developed a working product for a retrieval-augmented generation (RAG) pipeline over gigabytes of documents.
- Hired and managed a team of three software developers.
Data Science Consultant
Self-employed
- Developed a novel algorithm for interpretable detection of truck failures for Viaduct.ai.
- Created a strategy for starting a computer vision department in a drone-producing startup. It helped to attract $300,000 in funding, and the department was successfully started.
- Analyzed thousands of job ads within the alternative protein industry, which significantly improved the HR agency's business strategy.
Data Science Team Lead
CleverBots
- Developed a match-making algorithm for networking and event recommendation deployed on a week-long educational event hosting over 1,000 participants.
- Built a market share prediction algorithm that achieved more than 95% accuracy.
- Managed developers working on a project for churn-rate prediction for an online educational platform.
Algorithm Developer
EPAM Systems
- Implemented an algorithm for DNA structural variations search using optical map sequencing.
- Re-implemented an algorithm for trend analysis of drug effects, porting it to a new experiment management system.
- Expanded a dose-response curve-fitting algorithm, adding additional curve parameters.
Experience
Bayesian Segmentation of Imaging-based Spatial Transcriptomics Data
https://github.com/kharchenkolab/BaysordropEst: Pipeline for Low-level Processing of scRNA-seq Data
https://github.com/kharchenkolab/dropEst• The first part performs data extraction and conversion from raw sequencing data into a format suitable for data analysis (gene expression matrices).
• The second part corrects sequencing errors and filters the noise in data using string algorithms, Bayesian statistics, and machine learning techniques.
ggrastr: An R Package for Improved Data Visualization
https://cran.r-project.org/web/packages/ggrastr/index.htmlEducation
PhD in Biostatistics
University of Copenhagen - Copenhagen, Denmark
Master's Degree in Informatics and Applied Mathematics
St. Petersburg Polytechnic University - St. Petersburg, Russia
Bachelor's Degree in Informatics and Applied Mathematics
South Ural State University - Chelyabinsk, Russia
Skills
Libraries/APIs
Natural Language Toolkit (NLTK), REST APIs, PyTorch
Tools
Jupyter, Git, PyCharm, CLion
Languages
R, Python, C++, Julia, C#, Java, Perl, SAS, SQL
Platforms
Linux, RStudio, Visual Studio Code (VS Code), Software Design Patterns
Industry Expertise
Bioinformatics
Frameworks
RStudio Shiny
Paradigms
Management
Other
Machine Learning, Computational Biology, Data Analysis, Research, Bayesian Statistics, Statistical Modeling, Network Analysis, Graph Theory, Data Visualization, Artificial Intelligence (AI), Data Science, Data Manipulation, Data Analytics, Large Data Sets, Statistics, Linear Algebra, Linear Optimization, Life Science, Probabilistic Graphical Models, Probability Theory, Data Scraping, Natural Language Processing (NLP), Algorithms, Open Source, Genomics, Data Extraction, Generative Pre-trained Transformers (GPT), Molecular Biology, Strategy, Computer Vision, Business Cases, Deep Learning, Language Models, Biology, Fundraising
How to Work with Toptal
Toptal matches you directly with global industry experts from our network in hours—not weeks or months.
Share your needs
Choose your talent
Start your risk-free talent trial
Top talent is in high demand.
Start hiring