Sandro Barnabishvili
Verified Expert in Engineering
Machine Learning Engineer and Developer
Tbilisi, Georgia
Toptal member since September 24, 2018
Sandro holds a BSc and MSc in computer science and has over a decade of software engineering experience. His main fields of expertise include machine learning R&D (NLP/CV/DL), MLOps, big data, and algorithm design. Sandro has worked with big companies like Microsoft and Logitech, as well as small startups.
Portfolio
Experience
- Python - 10 years
- NumPy - 8 years
- Machine Learning - 8 years
- Natural Language Processing (NLP) - 8 years
- PyTorch - 8 years
- Machine Learning Operations (MLOps) - 6 years
- Apache Spark - 6 years
- Big Data Architecture - 6 years
Availability
Preferred Environment
Jupyter Notebook, PyCharm, MacOS
The most amazing...
...accomplishment of mine has been leading the development of a critical component for a client's workflow, which positively affected millions of users.
Work Experience
Senior ML Engineer
Lexitas
- Created a scalable RUG framework for semantic search and long-document summarization using AWS-based microservice architecture, specifically SageMaker, Lambda, SQS, SNS, OpenSearch, CloudFormation, Docker, etc.
- Worked on multistage prompt engineering for multiple document understanding tasks using recent LLMs, open source (LLaMA, Mistral, MPT, and Falcon), and OpenAI's GPT.
- Contributed to model automatic deployment (AWS CloudFormation, Docker, Bash), CI/CD, testing, and evaluation.
Applied Scientist
Microsoft
- Tasked to focus on certain areas like prompt-based NLP, question answering/generation, summarization, and information extraction.
- Performed in-depth qualitative and quantitative evaluation of Bing's latest GPT-based ML production pipelines, including building relevant codebase.
- Planned and executed research agendas, conducted ML experiments, participated in brainstorming, and tracked recent papers.
AI Invited Lecturer
Free University of Tbilisi
- Taught class "NLP with Deep Learning" for CS students.
- Supervised multiple Bachelor thesis projects in NLP and Recommendation Systems.
- Created course syllabus and prepared practical materials. Planned exam sessions with TAs.
Lead Machine Learning Engineer
Kyros
- Optimized large‑scale ETL and ML Spark workflows, resulting in huge cost savings and hours into minutes speedup.
- Implemented CI/CD, large‑scale automated tests, model and data versioning (MLFlow), and automatic deployment (Airflow, Docker, Azure).
- Led the development of an AutoML platform for speeding up DL model prototyping, fine‑tuning, and deployment. Features include friendly API, multi‑node distributed training, Bayesian hyper‑parameter search, advanced logging, and analytics.
- Planned new ML-based product features with the CEO, creating new user stories and doing code reviews.
AI Consultant
MaxinAI
- Acted as the lead researcher on the project about estimating video scene motion and texture complexity for optimal transcoding and gave SOTA results compared to the reference paper.
- Helped the team develop multiple nutritional information extraction algorithms from food product images. Used a combination of OCR, language modeling, object detection, clustering, and graph algorithms.
- Implemented a few computer vision POCs for company clients, including real-time object detection and tracking, image quality enhancement, semantic search, etc.
- Added a PostgreSQL C++ extension to support high-dimensional vector searches.
Machine Learning Engineer
Logitech
- Oversaw R&D in end-to-end dialog modeling, long-term memory in neural networks, text representation, and intent understanding.
- Designed and built a software framework for creating dialog assistants, utilizing the latest DL techniques.
- Created a synthetic dialog data generation tool for mitigating data deficiency problems with end-to-end dialog learning.
Software Engineer
AlphaCredit
- Designed and implemented raw text document storage and full-text search engine with REST API.
- Created a client-server web application for searching text fragments efficiently.
- Integrated the new solution to the client's legacy software by accessing API from MS SQL database.
- Managed the full development cycle, helped the client to refine their needs, and delegated tasks to the junior developer.
Full-stack Software Engineer
Alta Software
- Implemented front-end and core business logic components of an enterprise billing software for SOCAR, one of the biggest oil companies in the region.
- Optimized database access via the ORM framework and SQL queries to improve software performance.
- Implemented BI tool with a financial data dashboard, user management, and data reporting functionalities.
Full-stack Web Developer
Happy Group
- Created multiple websites with a content management system from scratch using pure PHP.
- Built a content management system framework like WordPress, allowing building small and medium-sized websites very fast.
- Implemented a few JavaScript plug-ins for existing websites.
Experience
Large-scale Machine Learning for Citation Prediction
Technologies: Python, Apache Spark, HDFS, NLP, Graph algorithms, linear models.
Language IDE
Technologies: C# with WPF
GPU-accelerated Image Patching App
AutoML Framework
Some of the features include:
- Rich high-level API, enabling fast prototyping and deployment.
- Multi-node multi-GPU distributed training in Azure Databricks infrastructure.
- Automatic hyper-parameter search
- Multi-task learning
- Advanced logging and monitoring
- Automatic deployment in the client's production pipeline
Tech Stack: Azure Databricks, Apache Spark, Apache Petastorm, PyTorch, MLFlow.
Education
Master's Degree in Computer Science
Swiss Federal Institute of Technology in Lausanne (EPFL) - Lausanne, Switzerland
Bachelor's Degree (with Honors) in Computer Scienece
St. Andrew the First-Called Georgian University - Tbilisi, Georgia
Certifications
AWS Certified Solutions Architect - Associate
Amazon Web Services
Skills
Libraries/APIs
PyTorch, NumPy, Pandas
Tools
Git, Amazon SageMaker, AWS CloudFormation, Amazon OpenSearch
Languages
Python, SQL, C++
Paradigms
Agile Software Development, Object-oriented Programming (OOP), Microservices Architecture, Testing
Frameworks
Apache Spark
Platforms
AWS Lambda, Docker
Storage
Elasticsearch
Other
Machine Learning, Natural Language Processing (NLP), Machine Learning Operations (MLOps), Computer Vision, Big Data Architecture, Azure Databricks, Prompt Engineering, Large Language Models (LLMs), Retrieval-augmented Generation (RAG), Recursion Testing
How to Work with Toptal
Toptal matches you directly with global industry experts from our network in hours—not weeks or months.
Share your needs
Choose your talent
Start your risk-free talent trial
Top talent is in high demand.
Start hiring