Anzor Gozalishvili, Developer in Tbilisi, Georgia
Anzor is available for hire
Hire Anzor

Anzor Gozalishvili

Machine Learning Developer

Tbilisi, Georgia

Toptal member since November 17, 2021

Bio

Anzor is a Senior ML/MLOps Engineer bridging deep research and industrial AI. He specializes in architecting enterprise platforms (Dagster, SageMaker, and W&B) to move R&D from notebooks to production. His expertise spans agentic LLM frameworks (RAG, MCP), drug discovery, and financial modeling. A published researcher in Nature and Springer, Anzor delivers scalable, high-impact solutions in bioinformatics, NLP, and multi-target regression models that drive global revenue.

Portfolio

Self-employed
Dagster, Amazon SageMaker, Machine Learning Operations (MLOps), LangChain...
KYROS Insights
PyTorch Lightning, Regression, Clustering Algorithms, Profiling, MLflow...
Pfizer - Manufacturing Operations Solutions
Python, Natural Language Processing (NLP)...

Experience

  • Docker - 4 years
  • PyTorch - 3 years
  • SpaCy - 3 years
  • Amazon EC2 - 3 years
  • Amazon S3 (AWS S3) - 3 years
  • Machine Learning Operations (MLOps) - 2 years
  • Amazon SageMaker - 2 years
  • Amazon SageMaker Pipelines - 1 year

Preferred Environment

PyCharm, MacOS, Jupyter Notebook, Docker, PyTorch, Linux, Transformers, Amazon Web Services (AWS), Amazon SageMaker

The most amazing...

...thing I've developed was the core ML part of a shipping document analysis for key information extraction at Holocene GmbH, a German startup.

Work Experience

Senior MLOps Engineer

2024 - 2025
Self-employed
  • Architected an end-to-end platform using Dagster and W&B to manage 100s of drug-discovery training pipelines. Enabled scalable model orchestration and lineage tracking via S3, streamlining complex R&D workflows.
  • Built an LLM-powered agent with a chat interface, integrating vector search and custom MCP tools for automated workflows. Streamlined knowledge retrieval across internal systems via RAG and structured database access.
  • Architected a SageMaker MLOps platform to migrate HPC workflows. Automated custom image builds and pipeline orchestration, transitioning from notebooks to production AWS pipelines to significantly accelerate development.
  • Partnered with bioinformaticians to translate complex NGS and potency model requirements into technical specifications, aligning R&D drug-discovery objectives with engineering execution for specialized workflows.
Technologies: Dagster, Amazon SageMaker, Machine Learning Operations (MLOps), LangChain, LangGraph, AI Agents, Model Context Protocol (MCP), AI Tools, Claude Code, Large Language Models (LLMs), Large Language Model Operations (LLMOps), Python, Retrieval-augmented Generation (RAG), Vector Search, Claude, Amazon Elastic Container Registry (ECR), Amazon Elastic Container Service (ECS), FastAPI, Amazon Bedrock, Agentic AI, Bioinformatics, Generative Artificial Intelligence (GenAI), Artificial Intelligence (AI), RAG Pipelines, LLM Reasoning, AWS Lambda

Senior ML Engineer

2022 - 2024
KYROS Insights
  • Built multi-target regression models predicting user spend and credit redemption. Identified high-intent segments to optimize loyalty credit allocation, directly driving incremental revenue and reducing marketing waste.
  • Engineered scalable DL pipelines using Azure, Spark, and PyTorch Lightning for massive time-series data. Improved data throughput and training speed to enable frequent recalibration of risk models for enterprise clients.
  • Managed model lifecycle via MLflow for experiment tracking and custom metrics. Developed specialized visualizations to translate complex ML outputs into stakeholder-ready insights, streamlining analysis of program ROI.
  • Optimized multi-target regression by designing custom activation functions and modifying gradient calculations. This tailored approach improved convergence and precision across varied regression targets.
Technologies: PyTorch Lightning, Regression, Clustering Algorithms, Profiling, MLflow, Databricks, Spark SQL, Spark ML, Model Evaluation, Prediction Markets, Predictive Modeling, Azure, Python, Azure Databricks, Artificial Intelligence (AI), Plotly

NLP Expert | Data Scientist

2022 - 2022
Pfizer - Manufacturing Operations Solutions
  • Analyzed structures of SOPs from various sources and planned the data extraction strategies for paragraphs, tables, charts, formulas, etc.
  • Developed a table extraction pipeline using different OCR services and the custom logic on top of it.
  • Conducted research on various layout analysis approaches.
Technologies: Python, Natural Language Processing (NLP), Generative Pre-trained Transformers (GPT), Data Science, Amazon Web Services (AWS), Keras, PyTorch, Amazon SageMaker, Parsers, Optical Character Recognition (OCR), Amazon Textract, Tesseract, Bioinformatics, Artificial Intelligence (AI)

NLP Data Scientist

2022 - 2022
Holocene GmbH
  • Developed the document classifier based on textual and visual features of the document.
  • Built several ML pipelines: an entity extractor using the LayoutLMv2 model, an entity extractor using Amazon Textract Forms and the entity type classifier on top of it, and a rule-based entity extractor.
  • Researched barcode and signature detection and extraction.
Technologies: Data Science, Natural Language Processing (NLP), Generative Pre-trained Transformers (GPT), Python, REST APIs, Optical Character Recognition (OCR), Text Mining, Machine Learning, Authentication, APIs, Amazon Textract, Hugging Face Transformers, LayoutLMv2, Docker, Amazon S3 (AWS S3), Amazon SageMaker, Proof of Concept (POC), Artificial Intelligence (AI)

Senior Machine Learning Researcher

2021 - 2021
MaxinAI
  • Reproduced the paper of the previous state-of-the-art bitrate ladder prediction available at https://arxiv.org/pdf/2102.04550.pdf.
  • Generated new features that improved bitrate ladder prediction performance by 3% compared to the previous state-of-the-art.
  • Ran experiments using deep learning models to extract these new features directly from videos and reduce inference time to meet industry requirements.
Technologies: PyTorch, Computer Vision, Video Encoding, FFmpeg, Scikit-image, NumPy, Research, Machine Learning, Scikit-learn, Docker, Docker Compose, Jupyter Notebook, Python, Artificial Intelligence (AI), ETL, CI/CD Pipelines, Amazon Web Services (AWS), Neural Networks

Data Scientist

2021 - 2021
Delivery Hero (Outstaffed from MaxinAI)
  • Performed query optimization and migration from AWS Redshift to Google BigQuery that increased readability and established better query run time.
  • Incorporated new features into models, resulting in 2% performance improvements on average.
  • Moved machine learning pipelines to Airflow DAGs, resulting in a more automated workflow with less human interaction.
  • Ran small experiments on ML pipelines using Amazon SageMaker.
Technologies: Amazon Redshift, BigQuery, SQL, Machine Learning Operations (MLOps), TensorFlow, MLflow, Apache Airflow, Amazon Web Services (AWS), Scikit-learn, Data Science, Data Engineering, Agile, Amazon SageMaker, Artificial Intelligence (AI), Cloud Deployment, Big Data, ETL, CI/CD Pipelines, Containerization, AWS IAM, Data Pipelines

Lecturer | Teaching Assistant

2019 - 2020
MaxinAI
  • Lectured on the topic of introduction to machine learning, including workshops.
  • Taught natural language processing, including workshops.
  • Prepared workshop materials for other lecture sessions.
Technologies: Generative Pre-trained Transformers (GPT), Natural Language Processing (NLP), Docker, Scikit-learn, Jupyter Notebook, Machine Learning, Deep Learning, SpaCy, Pandas, NumPy, Workshop Facilitation, Lecturing, Statistics, Explainable Artificial Intelligence (XAI), Hugging Face, Generative Artificial Intelligence (GenAI), Artificial Intelligence (AI)

Lead Machine Learning and NLP Engineer

2018 - 2020
MaxinAI
  • Led a team of four machine learning engineers and successfully managed multiple projects from many clients.
  • Developed the NLP part of the project by building a Swiss food regulatory startup product that extracts the full nutritional information from food labels. It sped up employees’ work and decreased human errors.
  • Built an intelligent data crawling tool based on classical NLP algorithms that enabled the creation of a database of employees from US-based startups. It facilitated the easy workflow for job candidate recommendations and eliminated manual work.
  • Created an innovative, self-improving NLP tool based on the latest DL models and an OCR that extracts essential data points from formal documents. It helped property managers to get more done in less time.
  • Analyzed call center chat logs to evaluate operators' efficiency and look for potential issues. Applied named-entity recognition and keyword extraction techniques to analyze the problems related to products from the company's tech forum.
  • Developed deep learning NLP models to detect merchant names from bank transaction records. Used LSTM-CRF and CNN models and achieved a 97% F1 score. The US-based startup used the best model to analyze large-scale bank transaction records.
  • Experimented with trading bots for the crypto trading market using time series analysis and backtesting, and generating positive income over the day.
  • Developed a recommender system for a startup selling second-hand clothes online. Personalized recommendations increased sales and customer satisfaction.
  • Built a semantic search engine for retrieving relevant paragraphs from US law cases. It was a fully customizable search engine built on top of keyword extraction and fast vector search capabilities based on BERT embeddings.
Technologies: Docker, Flask, PyTorch, SpaCy, Pandas, Scikit-learn, Amazon EC2, Amazon S3 (AWS S3), RabbitMQ, ABBYY, Amazon Rekognition, Optical Character Recognition (OCR), Docker Compose, Gensim, TensorFlow, Machine Learning Operations (MLOps), Matplotlib, Seaborn, Scikit-image, SciPy, NumPy, PyCharm, Deep Learning, Natural Language Toolkit (NLTK), Datadog, Flask-RESTful, fastText, OpenCV, PIL, SQL, Jupyter Notebook, Word2Vec, BERT, LSTM, Sentiment Analysis, Explainable Artificial Intelligence (XAI), MongoDB, Elasticsearch, Data Science, Data Engineering, Agile, Statistics, Bayesian Statistics, Artificial Intelligence (AI), Cloud Deployment, Google Vision API, Entity Extraction, Data Analysis, Algorithms, GitHub, Amazon Web Services (AWS), XGBoost, Classification, Regression, AutoML, Containerization, AWS IAM, Terraform, Tesseract, REST APIs, Elastic, ELK (Elastic Stack), Statistical Data Analysis, Time Series Analysis, Time Series, Neural Networks, Recurrent Neural Networks (RNNs), Data Pipelines, PostgreSQL, Data Analytics, Algorithmic Trading, Recommendation Systems, Information Retrieval, PyMongo, Text Detection, Keras, Active Learning, Variational Autoencoders (VAEs), Hugging Face, Generative Artificial Intelligence (GenAI), MLflow, Neo4j, Graph Databases, Knowledge Graphs

Experience

ExtractHD Data Extraction Service

A data extraction system from formal documents based on deep learning NLP models. The system can read and analyze several categories of documents, extract meaningful information, and provide them in a structured way. The system can also self-improve and correct past mistakes based on historical data of user actions.

School of AI

https://github.com/MaxinAI/school-of-ai
School of AI was a one-year's initiative from the AI team at MaxinAI for free. Any students with technical background were welcome to take that course.

We started from the basics of ML and maths essentials to advanced techniques. The course consisted of lectures and workshops to help people apply their knowledge in practice while also learning the best practices from industry workers.

Attendees successfully got hired by various tech companies in machine learning positions.

SGS Digicomply LabelWise | Food Label Data Extraction Service

https://www.digicomply.com/label-content-management
A large machine learning pipeline for extracting data from food labels like nutritional information, ingredients list, name, manufacturer, and more. It's based on deep computer vision and natural language processing models to apply segmentation, classification, and text content extraction.

I managed the NLP models and also evaluated the entire system.

Eye Color Prediction

The project aimed to validate and improve the IrisPlex (current SOTA) system for eye color prediction in the Italian population.

I optimized the eye color labeling process using clustering approaches to achieve higher classification accuracy. I also examined DNA methylation values at single-nucleotide polymorphisms (SNPs) to understand the gene expression mechanism.

Published the scientific paper.

Georgian LLM Corpus (ACL Datasets)

https://github.com/AnzorGozalishvili/AnzorGozalishvili.github.io/blob/master/resources/creating_corpus_for_georgian_language_modeling_ACL_ARR_2024_Feb.pdf
I developed a large-scale Georgian corpus for LLM training, filling a critical resource gap for low-resource languages. I managed complex data collection and cleaning pipelines and reviewed the ACL ARR 2024 dataset chapter, but it was unpublished due to compliance with local regulations on digital data ownership.

Syntactic Annotation of Georgian in the UD Schemes (Springer Nature)

I implemented UD scheme annotations for the Georgian language to enable advanced NLP tasks. I engineered syntactic parsing workflows and data validation tools, with results presented at TLT 2024 (ACL) and published in a specialized Springer Nature chapter.

Comparative Analysis of Genetic Perturbation Models

Genetic ML Research (KAUST): I benchmarked foundational DL Models (scGPT, CPA, etc) against classical and linear models for genetic perturbation prediction. I also evaluated model generalization on public datasets to quantify the performance-to-complexity ratio of SOTA techniques.

Education

2018 - 2020

Master's Degree in Computer Science

Tbilisi State University - Tbilisi, Georgia

2017 - 2018

Erasmus Exchange and Scholarship in Computer Science

Universitat Politecnica de Valencia (UPV) - Valencia, Span

2014 - 2018

Bachelor's Degree in Computer Science

Tbilisi State University - Tbilisi, Georgia

Certifications

JUNE 2026 - PRESENT

Getting Started with AWS Generative AI for Developers

Coursera

JULY 2021 - PRESENT

Machine Learning Modeling Pipelines in Production

Coursera

JUNE 2021 - PRESENT

Introduction to Machine Learning in Production

Coursera

JUNE 2021 - PRESENT

Machine Learning Data Lifecycle in Production

Coursera

MARCH 2018 - PRESENT

Deep Learning Specialization

Coursera

MARCH 2018 - PRESENT

Sequence Models

Coursera

DECEMBER 2017 - PRESENT

Convolutional Neural Networks

Coursera

DECEMBER 2017 - PRESENT

Deep Learning for Business

Coursera

OCTOBER 2017 - PRESENT

Structuring Machine Learning Projects

Coursera

OCTOBER 2017 - PRESENT

Improving Deep Neural Networks: Hyperparameter Tuning, Regularization, and Optimization

Coursera

SEPTEMBER 2017 - PRESENT

Neural Networks and Deep Learning

Coursera

DECEMBER 2016 - PRESENT

Machine Learning

Coursera

Skills

Libraries/APIs

Pandas, Scikit-learn, NumPy, PyTorch, SpaCy, Matplotlib, Natural Language Toolkit (NLTK), TensorFlow, OpenCV, SciPy, FFmpeg, PIL, Keras, Flask-RESTful, Amazon Rekognition, PyMongo, LSTM, Google Vision API, XGBoost, REST APIs, Hugging Face Transformers, PyTorch Lightning, Spark ML, Stanford NLP

Tools

PyCharm, Docker Compose, Gensim, Amazon SageMaker, AutoML, RabbitMQ, BigQuery, Apache Airflow, Scikit-image, Seaborn, ABBYY, Amazon Textract, Pytest, GitHub, AWS IAM, Terraform, Elastic, ELK (Elastic Stack), Spark SQL, Claude Code, Claude, Amazon Elastic Container Registry (ECR), Amazon Elastic Container Service (ECS), Plotly

Languages

Python, SQL, Markdown

Platforms

Jupyter Notebook, Docker, Linux, Amazon EC2, Amazon Web Services (AWS), Databricks, Azure, AWS Lambda

Frameworks

Flask, LangGraph, Selenium, Scrapy

Paradigms

Agile, Testing, ETL, Model Context Protocol (MCP)

Storage

Amazon S3 (AWS S3), Datadog, Elasticsearch, MongoDB, Cloud Deployment, Data Pipelines, PostgreSQL, Neo4j, Graph Databases

Industry Expertise

Bioinformatics

Other

Natural Language Processing (NLP), Machine Learning, Word2Vec, Data Science, Generative Pre-trained Transformers (GPT), MLflow, Deep Learning, Feature Engineering, Machine Learning Operations (MLOps), Sentiment Analysis, Explainable Artificial Intelligence (XAI), Optical Character Recognition (OCR), Text Detection, Statistics, Artificial Intelligence (AI), Classification, Regression, Amazon SageMaker Pipelines, Generative Artificial Intelligence (GenAI), Amazon Redshift, Pipelines, fastText, Recommendation Systems, Information Retrieval, Active Learning, Variational Autoencoders (VAEs), Time Series Analysis, Principal Component Analysis (PCA), Lecturing, Workshop Facilitation, Computer Vision, Video Encoding, Research, BERT, Data Engineering, Bayesian Statistics, Evaluation, Text Classification, Entity Extraction, Data Analysis, Algorithms, Big Data, CI/CD Pipelines, Containerization, Tesseract, Statistical Data Analysis, Time Series, Neural Networks, Recurrent Neural Networks (RNNs), Data Analytics, Algorithmic Trading, Clustering, Experimental Design, Optimization, Genomics, Biology, Clustering Algorithms, Data Visualization, Hugging Face, Transformers, Text Mining, Authentication, APIs, LayoutLMv2, Parsers, Proof of Concept (POC), Profiling, Model Evaluation, Dagster, LangChain, AI Agents, DataTrove, Web Crawlers, Open-source LLMs, Tokenization, Computational Linguistics, Conference Speaking, Data Labeling, Scanpy, Prediction Markets, Predictive Modeling, AI Tools, Large Language Models (LLMs), Large Language Model Operations (LLMOps), Retrieval-augmented Generation (RAG), Vector Search, FastAPI, Amazon Bedrock, Agentic AI, Azure Databricks, RAG Pipelines, LLM Reasoning, Knowledge Graphs, Data Transformation, AWS Bedrock AgentCore

Collaboration That Works

How to Work with Toptal

Toptal matches you directly with global industry experts from our network in hours—not weeks or months.

1

Share your needs

Discuss your requirements and refine your scope in a call with a Toptal domain expert.
2

Choose your talent

Get a short list of expertly matched talent within 24 hours to review, interview, and choose from.
3

Start your risk-free talent trial

Work with your chosen talent on a trial basis for up to two weeks. Pay only if you decide to hire them.

Top talent is in high demand.

Start hiring