Pawel Kaplanski
Verified Expert in Engineering
Data Science Developer
Sydney, New South Wales, Australia
Toptal member since April 3, 2019
Pawel is an experienced data-scientists and machine learning professional. He has worked for Fortune 100 companies, and he has an academic background in the field. Before moving to data science, he was a former lead architect in Samsung R&D Center. Pawel holds a Ph.D. in knowledge representation and reasoning as well as a master's degree and a bachelor of science degree in computer science.
Portfolio
Experience
- Data Science - 10 years
- Machine Learning - 7 years
- Generative Pre-trained Transformers (GPT) - 6 years
- Natural Language Processing (NLP) - 6 years
- Python - 5 years
- SQL - 5 years
- Natural Language Toolkit (NLTK) - 3 years
- TensorFlow - 2 years
Availability
Preferred Environment
Python
The most amazing...
...thing I've coded is a Clinical Decisions Support System implementing ESMO guideline for cancer treatment.
Work Experience
Senior Machine Learning Engineer
Undisclosed
- Recommended systems, image processing, NLP, and deep learning to the production.
Data Scientist
Cognitum
- Created machine-learning models using Sklearn and Tensorflow for Fortune 100 customer in the area of trade promotion optimization.
- Created a cognitive programming language that makes AI programming easy allowing mixing reasoning with machine learning, used in a fraud detection system for a public institution.
- Designed and implemented controlled natural language for formalizing the knowledge around lung cancer, used by the oncologist to formalize ESMO guidelines.
- Created affective-computing AI models that are combining both expert knowledge and their intuitions, to calculate the quality score of complex decisions.
- Created the novel, automated user interface synthesis algorithm in which a set of requirements is automatically translated into a working application, currently used by 30+ clinical centers and biggest telecon in Australia.
- Created an NLP classification algorithm for legal documents corpora based on the NLTK library, constructed using mixed feature-extraction techniques: POS-Tagging, noun-phrase extraction, collocations and NER (named entity recognition), followed by Tf/Idf, feature reduction and finally the classification with Passive-Aggressive, scalable classifier.
- Created a critical part of a tax-fraud detection system was based on natural language rules enabling decision makers and specialists to manage a tax fraud knowledge base. The stream-based reasoner allows discovering fraudulent activities in the stream of 5 million invoices per day.
Assistant Professor
Gdansk University Of Technology, Department of Applied Informatics in Management
- Reviewed “Government Information Quarterly, An International Journal of Information Technology Management, Policies, and Practices," IF=2.515, 5Y IF=3.161.
- Acted as an academic visitor at the University of Newcastle, Australia.
- Participated as a member of the EU Maria-Courie research project "Smart multipurpose knowledge administration environment for intelligent decision support systems development."
- Reviewed and contributed to the “18th International Conference on Knowledge-Based and Intelligent Information & Engineering Systems."
- Served as a member of the international BRIDGE project: "CDSS for Oncology."
- Taught the following classes: R Programming, Introduction to DataScience, Business Intelligence and BigData Processing, Software Development Process Methodology and Tools.
Lead Architect
Samsung
- Led design and implementation of an industrial software stack for digital television receivers.
- Led design and implementation of a set-top-box device emulator for efficient application level testing purposes.
- Designed and implemented automated smoked test system with ASP.Net, MSMQ, image recognition, and remote controller emulation.
- Technically managed a team of 30+ programmers.
- Conducted training for newcomers about advanced multithreaded design patterns in C++.
Experience
CDSS - Clinical Decision Supporting System
We organized available data into the knowledge of the diagnostic process, based on many sources like studies, publications, recommendations, so it supports doctors decisions. We also developed a central registry for collecting patient’s clinical data from over 70 oncological institutions in Poland. In production since 2016.
The results were published in Expert Systems With Applications that is currently ranked number 1 in the Google Scholar h-index listed under the top publications of artificial intelligence.
Trade Promotion Optimization
- Can we lower overall costs by optimizing products volume sales and its promotion strategy by anticipating a promotion calendar for a given period?
- Can we predict using key indicators when and which sales pattern is the most effective and can be used to increase volume sales?
- Can we set up a useful promotion calendar for “slow-moving products”?
- Can we optimize budget KPIs when planning the next sales period?
In our case, the mis-forecasting (avg. the error was around 20%) led to budget reduction (across multiple stages within a whole supply chain). To solve the problem, we combined business knowledge of subject matter experts with historical sales data that we received. We also took into account their anomalies and outliers.
The solution allowed the company to increase its accuracy in prediction by up to 10% of volume planning.
Tax-fraud Detection on VAT
Automated Decision Making System
Abusive-clause detector
Cyber Assessment
The tool is allowing customers to perform guided cyber-security health check, and after the health-check is completed, the detailed report (diagnosis) is generated allowing the customer to understand the current state of the company’s cybersecurity maturity level and understand the weak points. The estimation of the potential cost of the Problem is also provided.
Education
Ph.D. in Computer Science
Gdansk University of Technology - Gdańsk, Poland
Master of Engineering Degree in Computer Science
Wroclaw University of Technology - Wrocław, Poland
Bachelor of Engineering Degree in Computer Science
Wroclaw University of Technology - Wrocław, Poland
Certifications
Sequence Models
Coursera
Deep Learning Specialization
Coursera
Convolutional Neural Networks
Coursera
Structuring Machine Learning Projects
Coursera
Improving Deep Neural Networks: Hyperparameter tuning, Regularization and Optimization
Coursera
Neural Networks and Deep Learning
Coursera
Oracle Certified Professional, Java SE 5 Programmer
Oracle
Skills
Libraries/APIs
Natural Language Toolkit (NLTK), OWL API, TensorFlow, Scikit-learn, Keras, NumPy, Pandas, PyTorch, PySpark, SymPy, SciPy
Tools
Protégé, SikuliX, Microsoft Visual Studio, Git, Jira, OpenLink Virtuoso, Apache Solr
Languages
OWL, RDF, SPARQL, R, SQL, C++, Java, C#, Python, Semantic Web Rule Language (SWRL), JavaScript, T-SQL (Transact-SQL), UML, XML
Frameworks
Apache Jena, Ontology Framework, TinkerPop, .NET
Paradigms
Anomaly Detection, BPMN, Scrum
Platforms
Azure, Amazon EC2, Jupyter Notebook, Amazon Web Services (AWS), RStudio, Azure AI Studio
Storage
Cassandra, Titan Graph, Oracle SQL, MySQL
Other
Data Science, WordNet, Genetic Algorithms, Natural Language Processing (NLP), Machine Learning, Generative Pre-trained Transformers (GPT), Minimum Viable Product (MVP), Team Leadership, Artificial Intelligence (AI), Large Language Models (LLMs), Generative Artificial Intelligence (GenAI), Recurrent Neural Networks (RNNs), Deep Learning, Classification Algorithms, Regression Modeling, Clustering Algorithms, Bayesian Inference & Modeling, Logistic Regression, Decision Trees, Random Forests, Markov Model, Ensemble Methods, Evolutionary Algorithms, Sesame, Data Visualization, Scalable Architecture, Time Series Analysis, Principal Component Analysis (PCA), Simple Knowledge Organization System (SKOS), Embedded Systems, Schema.org
How to Work with Toptal
Toptal matches you directly with global industry experts from our network in hours—not weeks or months.
Share your needs
Choose your talent
Start your risk-free talent trial
Top talent is in high demand.
Start hiring