Karol Kulasiński
Verified Expert in Engineering
Data Science Developer
Warsaw, Poland
Toptal member since October 22, 2021
Karol is a highly experienced senior data scientist with a strong focus on NLP and wide AI applications. He has a unique academic background in physics and large-scale models, in addition to relevant experience in the customer-facing data science industry. He enjoys working with data and leading and implementing R&D projects. With his PhD in physics and the recent MBA degree, Karol combines easy technology and business.
Portfolio
Experience
Availability
Preferred Environment
Python, Docker, SQL, Generative Pre-trained Transformers (GPT), Natural Language Processing (NLP), Machine Learning, Data Science, Artificial Intelligence (AI), Azure, Generative Artificial Intelligence (GenAI)
The most amazing...
...project I led was the creation of autonomous AI agents as a service platform that allowed the client to use his data securely with LLMs.
Work Experience
Co-founder
Fractile
- Designed and co-implemented a platform to create, customize, and apply AI agents. The agents disposed of an additional layer of security, thanks to the anonymization service that protects 100% of the data from the client's server.
- Applied AI agents with the ability to learn on previous tasks and be spawned via Chat, API, or Jira.
- Deployed the solution to two medium-sized companies.
Machine Learning Expert and Engineering Manager
Warsaw Stock Exchange
- Led the development process of a scalable exchange platform for personalized ads on Polish TV.
- Designed logo placement detection AI in live-streamed TV broadcasts.
- Led the development team of nine, implementing a supply-side platform, end-to-end, from conceptualization and MVP to a scalable production stage.
- Created an end-to-end pipeline to train the behavioral models based on the data from TV.
Lead Data Scientist
Sweetgreen Inc - Main
- Designed and led the implementation of the salad recommendation engine at the production level.
- Created a BI tool with live-updated sales data and ML forecasting for the CxOs.
- Built a PoC sales forecasting model based on historical sales and weather forecast data.
- Improved the legacy ML production models for supply chain forecasting. Managed to improve the processing time by one order of magnitude.
Senior Data Scientist
Meloncast
- Created a complete training and deploying pipeline for NLP models (BERT) to classify target audience marketing texts.
- Trained ML models recognizing most similar pictures in terms of content and coloristic that the client provided.
- Designed and deployed a production-level API for containerized Docker services.
Lead Data Scientist
Physica Solutions
- Built an NLP ecosystem for using ChatGPT on the company's private data.
- Created subMIND, a tool for extracting subconscious information from a large body of text that uses state-of-the-art techniques for entity recognition, graph relations, and visualizations.
- Built Microsoft Power BI reports for a private Polish university, working directly with the business.
- Designed an architecture for classifying fake news in social media for the most prominent Polish university, including NLP (BERT) classification, data collection, and overall flow.
Lead Data Scientist
Yieldbird
- Optimized pricing models for online ad auctions using ML tools.
- Created an entire ML pipeline, including data ingestion, testing, prototyping, error handling, monitoring, and evaluation.
- Directed the process of product development from the R&D side, including hypothesis testing and handling client feedback.
Data Scientist
DS Stream
- Created Tableau reports identifying fraudulent behavior of employees.
- Built a fully automated quality assurance system for data ingestion.
- Designed a Twitter fake news detector front end for data visualization.
Postdoctoral Researcher
Lawrence Berkeley National Lab
- Carried out state-of-the-art research using molecular dynamics and Monte Carlo simulations on nanoscopic materials.
- Published three technical papers in a highly respected scientific journal.
- Created, simulated, and interpreted numerical simulations with over 10^7 degrees of freedom.
Doctoral Researcher
ETH Zurich
- Carried out numerical simulations that resulted in models further used by other team members.
- Published nine technical papers in top-ranked journals as the first author.
- Contributed to the physical chemistry field by explaining the water adsorption-related phenomena in cellulose.
Intern
Texas A&M University
- Created a numerical model of the secondary loop of the BWR nuclear reactor under the direction of Professor J. Ragusa.
- Applied the Monte Carlo method for sensitivity analysis of numerical coefficients in different equation functions of the state.
- Expanded the lab's Python library for carrying out finite element method simulations.
Experience
Autonomous AI Agents
https://fractile.iosubMIND
Hot Topics Classifier
I predicted the topics using the LDA method and ran collected texts through BERT that could, at the end of the day, determine what target audience does the specific text pertains to. Texts were scraped from LinkedIn and online newspapers.
PriceGenius | Ad Price Optimization
https://yieldbird.com/price-geniusThe ads are first-price auctions, and we used ML techniques to find their optimum price and predict the price that would allow the end customer to maximize their revenue. Thanks to our ML models, the revenue boost was up to 10%.
I created an entire ML pipeline, including data ingestion, testing, prototyping, error handling, monitoring, and evaluation. I also directed the product development process from the R&D side, including hypothesis testing and handling client feedback.
Education
Professional Degree in Business Administration (MBA)
Kozminski University - Warsaw, Poland
Postdoc in Computational Physics
UC Berkeley - Berkeley, CA
PhD in Physics
ETH Zurich - Zurich, Switzerland
Master's Degree in Nuclear Engineering
University Paris 11 - Paris, France
Bachelor's Degree in Physics
Warsaw University of Technology - Warsaw, Poland
Certifications
Microsoft Data Science Certificate
Microsoft
Skills
Libraries/APIs
Pandas, XGBoost
Tools
Jupyter, Microsoft Power BI, ChatGPT, Tableau, MATLAB, AWS CLI, Azure OpenAI Service, Amazon SageMaker
Languages
Python, SQL, HTML, JavaScript, Snowflake, C++
Paradigms
REST, Management, Parallel Computing
Platforms
Jupyter Notebook, Linux, Amazon Web Services (AWS), Docker, Google Cloud Platform (GCP), Azure, Azure Functions, Kubernetes
Frameworks
Flask, Spark, Django
Storage
NoSQL, PostgreSQL, Datadog, Databases
Other
Simulations, Mathematical Analysis, Applied Physics, Natural Language Processing (NLP), Machine Learning, Data Science, Generative Pre-trained Transformers (GPT), Artificial Intelligence (AI), Data Engineering, Data Analysis, Applied Mathematics, Data Science, Hypothesis Testing, Statistics, Publication, Conference Speaking, Recommendation Systems, APIs, Ads, Pipelines, Product Roadmaps, Image Processing, BERT, Big Data, Web Scraping, Numerical Methods, Data Analytics, Data Visualization, Finance, Strategic Planning, Human Resources (HR), Accounts, Negotiation, Generative Artificial Intelligence (GenAI), Visualization, Large Language Models (LLMs), LangChain, FastAPI, Transformer Models, Machine Learning Operations (MLOps), Time Series Analysis, Time Series
How to Work with Toptal
Toptal matches you directly with global industry experts from our network in hours—not weeks or months.
Share your needs
Choose your talent
Start your risk-free talent trial
Top talent is in high demand.
Start hiring