Dawid Smoleń
Verified Expert in Engineering
Machine Learning Engineer and Software Developer
New York, NY, United States
Toptal member since September 15, 2021
Dawid has delivered more than 30 successful projects in data science and machine learning. He has worked with both classical and deep learning solutions in a few industries. Dawid is focused on creating systems that follow MLOps best practices and design patterns. Having experience with many cloud providers, he is able to automate the whole ML process, from data gathering to automated deployments and continuous training.
Portfolio
Experience
Availability
Preferred Environment
Ubuntu, Azure, Amazon Web Services (AWS), Google Cloud, Python, Scikit-learn, PyTorch, TensorFlow
The most amazing...
...things I've created are reliable systems not only in terms of data modeling methodology but also in implementing the best MLOps design patterns.
Work Experience
ML Consultant | MLOps Engineer
Freelance
- Acted as a data science trainer for two training companies and conducted training for around seven teams from various enterprises.
- Deployed modeling services to Kubernetes clusters, Amazon EKS and Google Kubernetes Engine (GKE).
- Introduced tracking servers to the existing projects to improve the observability of a model and understanding of a problem.
- Developed an end-to-end solution from data investigation to a deployed model that monitors daily statistics and business metrics regarding user experience in eCommerce.
- Consulted an ECG-related company from Latin America. Helped with the design and implementation of crucial Holter analysis steps.
- Prepared NFT market analysis tools based on machine learning traits valuation.
- Prepared deduplication service for real-estate website scraper.
MLOps Engineer
Sinch
- Managed thousands of models in production. Maintained them and also significantly optimized the costs and speed.
- Added a lot of observability tools on many levels.
- Worked with the hottest tech, including GitOps and event-based architecture.
- MIgrated massive projects between popular cloud providers.
Machine Learning Engineer
Grape Up
- Developed an end-to-end deep learning automotive project together with full automation (CI, CD, and CT) and infrastructure.
- Created POCs and demos in machine learning and data science areas together with simple UI demos and API first approach.
- Contributed to the company's entry into the AI market, working on papers, blog posts, offers, and creating POCs.
- Worked on machine learning best practices using modern tools and solutions.
Deep Learning Engineer
Lekta
- Created a library for users' intent classification that employs industry best practices to make predictions millions of times a month in a real-time, demanding environment.
- Developed a novel speech recognition system based on state-of-the-art papers that beat the current market in some areas in terms of accuracy or performance.
- Researched numerous topics in the areas of speech recognition, voice-based gender recognition, intent classification, sentence representation, and text representation.
- Developed machine learning algorithms for both voice bots and chatbots.
Machine Learning Engineer
Aspel SA
- Created a brand new QRS detector tested on many benchmarks and real-world monitoring tests.
- Developed clustering algorithms that can efficiently cluster long Holter monitor tests, focusing on user experience.
- Developed embedded resampling algorithms for ECG devices.
- Contributed to QRS morphology classifiers that highly improved the work of doctors and met AMA standards.
- Helped develop user experience-related algorithms that simplify the work of the doctors and technicians.
NLP Engineer
WitKom – Virtual Translator of Sign Communication
- Developed the first Polish to Polish Sign Language translation system on the language level.
- Built the first Polish Sign Language to Polish translation system on the language level using Seq2Seq models.
- Created huge artificial datasets for sign languages based on heuristics, rules, and DL technology.
Experience
Gomrade — Play Go Against AI on a Real, Physical Board
https://github.com/smolendawid/GomradeSpeech Representation and Exploration Notebook
https://www.kaggle.com/davids1992/speech-representation-and-data-explorationThe Simplest Python Cache for Data Scientists
https://github.com/smolendawid/cachaContrary to many other tools, cacha boasts the following features:
• It is used at the function call, not the definition. Many packages implement the @cache decorator that has to be used before the definition of a function that is not easy enough to use.
• It stores the cache on disk, which means you can use the cache between runs. This is convenient in data science work.
Drifting – The Most Flexible Drift Detection Server
https://github.com/sign-ai/driftingPYTHON-FIRST
Communicate with the Drift Detection server using a super simple Python client. No additional management needed!
EASY INTEGRATIONS
Using drifting is simple thanks to standardized, ML server-based integrations like Kafka, OpenAPI, and gRPC.
FLEXIBLE
One server for managing many models, projects, versions, and features without any further tools.
STATE-OF-THE-ART
An open-source project built upon the top-tier libraries—Alibi Detect, ML server, and more!
Education
PhD in Electrical and Electronics Engineering
AGH University of Science and Technology - Cracow, Poland
Master's Degree in Acoustical Engineering
AGH University of Science and Technology - Cracow, Poland
Certifications
ML Practitioner
Dataiku
Core Designer
Dataiku
Machine Learning
Coursera
Neural Networks and Deep Learning
Coursera
Structuring Machine Learning Projects
Coursera
Improving Deep Neural Networks: Hyperparameter Tuning, Regularization and Optimization
Coursera
Skills
Libraries/APIs
Scikit-learn, PyTorch, TensorFlow, REST APIs, Node.js, React, OpenCV, Keras, SciPy, Natural Language Toolkit (NLTK)
Tools
Google Kubernetes Engine (GKE), MATLAB, Helm
Languages
Python, C++
Paradigms
Continuous Integration (CI), DevOps
Platforms
Azure, Kubeflow, Jupyter Notebook, Ubuntu, Dataiku, Docker, Amazon Web Services (AWS), Kubernetes
Storage
Google Cloud
Frameworks
Metaflow
Other
Machine Learning Automation, Audio Processing, Natural Language Processing (NLP), Deep Neural Networks (DNNs), Deep Learning, Machine Learning, Sequence Models, Machine Learning Operations (MLOps), ECG, Data Science, Artificial Intelligence (AI), CI/CD Pipelines, Generative Pre-trained Transformers (GPT), Data Scraping, Data Engineering, Data Analysis, Acoustics, Digital Signal Processing, Speech Recognition, Convolutional Neural Networks (CNNs), Training, Chatbots, Predictive Modeling, Regression Modeling, Classification Algorithms, GitOps, Large-scale Projects, Front-end, Non-fungible Tokens (NFT), Acoustical Engineering
How to Work with Toptal
Toptal matches you directly with global industry experts from our network in hours—not weeks or months.
Share your needs
Choose your talent
Start your risk-free talent trial
Top talent is in high demand.
Start hiring