Stefan Mićić
Verified Expert in Engineering
Python Data Engineer and Developer
Novi Sad, Vojvodina, Serbia
Toptal member since July 20, 2022
Stefan is an experienced machine learning and machine learning operations (MLOps) engineer with hands-on experience in big data systems. His almost a decade of expertise is supplemented by a master's degree in artificial intelligence. Stefan has worked on problems such as object detection, classification, sentiment analysis, named-entity recognition (NER), and recommendation systems. He is always looking forward to being involved in end-to-end machine learning projects.
Portfolio
Experience
Availability
Preferred Environment
PyCharm, Python 3, Python, GitHub, Amazon S3 (AWS S3), JSON, Distributed Systems
The most amazing...
...end-to-end machine learning solution I've created optimized the cost of the machine learning pipelines numerous times with state-of-the-art results.
Work Experience
Machine Learning Engineer
RhythmScience Inc.
- Deidentified the database and various types of files (HL7, XML, and PDF) by HIPAA standards and dockerized and automated the whole pipeline.
- Developed ML algorithms to generate text and classify PDF reports.
- Designed, implemented, and deployed the solution.
Senior MLOps Engineer
PlusPower
- Developed big ML pipelines using Sagemaker including preprocessing, training, evaluation and deployment.
- Developed pipeline that was able to generate airflow pipelines based on configs and automated deployment of DAGs.
- Increased test coverage from 15% to 80% and added integration tests so that we can test sagemaker pipelines locally.
AI Lead (via Toptal)
Cumulus Technologies LLC
- Created the whole CI/CD pipeline on AWS. Everything from data ingestion, processing, and model training to model deployment was automated.
- Designed and led the implementation of the whole ML pipeline using various AWS services such as Lambda, Polly, and SageMaker.
- Utilized AWS for development to meet high security requirements (AWS Cloud9, AWS CodeCommit, and AWS CodePipeline).
- Used prompt engineering to force the model to find answers from different sources.
MLOps Engineer
NewsCorp
- Performed the deployment of different LLM and Stable Diffusion models.
- Worked on latency and cost optimizations of LLMs. Successfully reduced latency by five times using different deployment techniques.
- Took responsibility for the complete deployment process of the whole ML part and documentation maintenance.
- Used prompt engineering to make LLM execute the NER tasks.
- Used RAG approach in couple of chatbots.
MLOps Engineer
PepsiCo Global - DPS
- Implemented an end-to-end machine learning pipeline using PySpark.
- Set up CI/CD workflows with unit and integration tests using GitHub Actions.
- Developed Spark and scikit-learn/Pandas ETL jobs to process large data volumes (150 TB).
Tech Lead Data Engineer
Motius
- Led a small team in implementing an ELT pipeline to get data from a GraphQL database and put it into Azure SQL. Everything was Dockerized and pushed to Azure Image Registry.
- Implemented KPI calculations using PySpark, which was communicating with Snowflake. Defined table schema for Snowflake and created migration scripts.
- Followed the Scrum methodology, including daily scrums, retro, and planning, and used Jira.
- Led a small team in implementing ETL Spark jobs with Apache Airflow as an orchestrator, AWS as infra and Snowflake as a data warehouse.
MLOps Engineer
Lifebit
- Carried out deep learning model optimizations using quantization, ONNX Runtime, and pruning, among others.
- Monitored model performance, including memory, latency, and CPU usage.
- Used Valohai to automate the CI/CD process and GitHub Actions to automate some parts of the MLOps lifecycle.
- Created automated experiment tracking using Amazon CloudWatch, Valohai, Python, GitHub Actions, and Kubernetes.
Machine Learning Engineer
HTEC Group
- Optimized a machine learning compiler already on a trained network without re-training using Open Neural Network Exchange (ONNX) and implemented custom operators using PyTorch and C++.
- Worked on an Android machine learning solution and mentored a less experienced developer to train and prepare an object detector and classifier to run smoothly on an Android device.
- Enhanced a project that aimed to upscale images to be as perfect as possible toward 4K resolution.
- Involved in SDP of ship routing problem. Implemented an algorithm from scratch that will guide the ships. Fuel consumption and ETA were used for calculations.
- Worked on open source ONNX Runtime in order to add support for the MIGraphX library.
Machine Learning Engineer
SmartCat
- Contributed to complete MLOps lifecycles using MLflow for model versioning, LakeFS for data versioning, AWS S3 for data storage, and TensorFlow serving in Docker.
- Functioned as a data engineer using Apache Spark for ETL jobs with Prefect and Apache Airflow for scheduling.
- Trained several different architectures for object detection and classification.
Machine Learning Engineer
Freelance
- Scraped product information from various websites, then analyzed and prepared the scraped data for web shops using natural language processing—long short-term memory (LSTM), Word2Vec, and transformers—and added NER since the data was in Serbian.
- Used Amazon SageMaker to automate the machine learning pipeline—data preprocessing, model training, and deployment. Executed automated retraining and deployment of the model, completing the machine learning process before the client updated new data.
- Worked on big data projects using Apache Spark, Kafka, Hadoop, and MongoDB.
- Worked as a data engineer using Spark to create optimized ETL pipelines. Translated the client's needs into SQL.
Experience
Automated End-to-end (E2E) Computer Vision Solution
• Detecting objects in the room
• Classifying person poses
• Automated re-training (active learning)
• Model and data versioning
• Dockerized pipeline
Using those models and predictions, we created a post-processing pipeline for creating reports or key performance indicators (KPIs) for clients.
Android COVID-19 Test Classification
I led a team of two people on this project. We used MobileNet due to size, and all business-relevant metrics were great. We used many optimization techniques to deploy the model to Android, such as quantization, pruning, and knowledge distillation.
MLOps Engineer
Image Super Resolution
ETL Jobs
• Optimized solution to reduce cost and calculation time.
• Scheduled jobs via Airflow and Prefect.
The tech stack was: Spark, Scala, AWS S3, Kafka, Apache Airflow, and Prefect.
NLP Articles Processing
1. Find all relevant tags (events, locations, names, etc.) in the article.
2. Find pairs of tags that are somehow related.
Hugging Face transformers were mainly used to tackle this problem (BERT-based models). Overall metrics were above 95%.
Data Ingestion
Tech Leadership for the DE project
Education
Master's Degree in Artificial Intelligence
University of Novi Sad - Novi Sad, Serbia
Certifications
AWS Certified Machine Learning - Specialty
Amazon Web Services
Skills
Libraries/APIs
PyTorch, Keras, NumPy, Scikit-learn, REST APIs, TensorFlow, Pandas, PySpark, Terragrunt
Tools
PyCharm, Amazon SageMaker, GitHub, Apache Airflow, Pytest, Open Neural Network Exchange (ONNX), Codeship, Prefect, ChatGPT, AWS Glue, Bitbucket, Grafana, Terraform, Celery, Intelligent Content Processing (ICP), AI Prompts
Languages
Python 3, Python, SQL, Scala, Java, Snowflake, GraphQL, C++
Frameworks
Spark, Apache Spark, Streamlit
Paradigms
ETL, Unit Testing, DevOps
Platforms
Amazon Web Services (AWS), Jupyter Notebook, Visual Studio Code (VS Code), Docker, Kubernetes, Amazon EC2, Valohai, Apache Kafka, Azure, Databricks, Google Cloud Platform (GCP), Hyperledger Fabric, Kubeflow
Storage
Amazon S3 (AWS S3), JSON, Databases, PostgreSQL, NoSQL, MongoDB, Data Pipelines, Database Migration, Data Integration, Azure SQL, Datadog, Google Cloud
Industry Expertise
Trading Systems, Project Management
Other
Deep Learning, Machine Learning, Data Science, Artificial Intelligence (AI), Data Engineering, Computer Vision, Natural Language Processing (NLP), Natural Language Understanding (NLU), Convolutional Neural Networks (CNN), Recurrent Neural Networks (RNNs), Machine Learning Operations (MLOps), Neural Networks, AI Design, Deep Neural Networks, Software Engineering, Technical Hiring, Source Code Review, Code Review, Task Analysis, Interviewing, APIs, Generative Pre-trained Transformers (GPT), Large Language Models (LLMs), Models, Data Processing, English, Generative Artificial Intelligence (GenAI), Language Models, MLflow, OpenAI, Recommendation Systems, Lens Studio, Optimization, Team Leadership, Time Series, Data Modeling, Data Mining, Monitoring, Big Data, Image Processing, Transformers, Cloud, Object Detection, Computer Vision Algorithms, Object Tracking, Web Development, Speech Recognition, Voice Recognition, Cloud Services, ETL Tools, Distributed Systems, Data Analysis, CI/CD Pipelines, Query Optimization, Research, Stock Trading, Algorithmic Trading, Finance, Financial Software, Prompt Engineering, Retrieval-augmented Generation (RAG), OpenAI GPT-3 API, OpenAI GPT-4 API, Data Analytics, ELT, System Architecture, Infrastructure, DataOps, Hugging Face, BERT, Back-end, Software Architecture, DocumentDB, Ray.io, AI Translation, FastAPI
How to Work with Toptal
Toptal matches you directly with global industry experts from our network in hours—not weeks or months.
Share your needs
Choose your talent
Start your risk-free talent trial
Top talent is in high demand.
Start hiring