Aleksandar Pantovic, Developer in Belgrade, Serbia
Aleksandar is available for hire
Hire Aleksandar

Aleksandar Pantovic

Data Scientist and Developer

Belgrade, Serbia

Toptal member since July 25, 2022

Bio

Aleksandar is a highly accomplished data scientist who has successfully built and implemented machine learning algorithms, predictive models, and generative AI applications in various industries. He's passionate about machine learning and AI and their applications in real-world industry problems. Aleksandar is an expert in Python, R, SQL, PySpark, Databricks, TensorFlow, PyTorch, SAS, Power BI, Git, and others, and is experienced in the top 3 cloud vendors (AWS, GCP, and Azure)

Portfolio

Independent Contractor
Python, R, Azure Machine Learning, SQL, Docker, Dask, PySpark, Databricks...
Treat Technologies, Inc
Google BigQuery, Python, Data Science, Data Build Tool (dbt)...
Banca Intesa Serbia
SAS, R, SQL, Microsoft Power BI, Machine Learning, Predictive Modeling...

Experience

  • Data Science - 7 years
  • Artificial Intelligence (AI) - 5 years
  • R - 5 years
  • Machine Learning - 5 years
  • SQL - 5 years
  • Python 3 - 3 years
  • Docker - 2 years
  • Natural Language Processing (NLP) - 1 year

Preferred Environment

Windows, Linux, MacOS, Python 3, SQL

The most amazing...

...things I've built are a fully automated, end-to-end demand forecasting pipeline and a document processing and automation service.

Work Experience

Senior Data Scientist

2020 - PRESENT
Independent Contractor
  • Built various microservices to orchestrate the workload and perform OCR, extract information from documents (tables, checks, mentions, etc.), chunk the document, store it into a vector database, and process it through relevant controls assessments.
  • Developed a fully automated demand forecasting pipeline based on statistical forecasts and machine learning models. Its workflow allows users to tweak parameters, run and compare models interactively, and analyze error metrics.
  • Created a data processing pipeline in Spark to work with several million data points of hourly data.
  • Optimized multiple R scripts that do weekly processing, thus cutting the processing time by half, and created the distribution planning logic to optimize warehouse inventory levels and re-route existing inventory if possible.
  • Built a claim processing flow by utilizing an LLM to analyze contracts and business logic to determine rules for estimating allowable payments.
Technologies: Python, R, Azure Machine Learning, SQL, Docker, Dask, PySpark, Databricks, Machine Learning, Scikit-learn, Pandas, Predictive Modeling, Data Science, Predictive Analytics, Data Analysis, Data Visualization, Data Analytics, Artificial Intelligence (AI), Azure, Neural Networks, Generative Pre-trained Transformers (GPT), Natural Language Processing (NLP), JSON, Apache Spark, Jupyter Notebook, Data Reporting, Data Mining, Web Scraping, Amazon Web Services (AWS), Big Data, TensorFlow, REST APIs, Keras, Google Cloud Platform (GCP), Statistics, Data Pipelines, Three.js, BigQuery, ETL, Data Engineering, Looker, Project Management, PyTorch, Google BigQuery, XGBoost, Random Forest Regression, Linear Regression, Random Forests, Supply Chain, Supply Chain Management (SCM), Supply Chain Optimization, Machine Learning Operations (MLOps), Exploratory Data Analysis, Matplotlib, Seaborn, MySQL, Amazon SageMaker, Spark, SAS, Apache Airflow, Recommendation Systems, eCommerce, Statistical Analysis, Statistical Modeling, LSTM, Dashboards, Excel 365, Pricing, Reporting, Agentic RAG Systems, LangGraph, Large Language Models (LLMs), LangChain, FastAPI, Feature Engineering, Agentic AI, Anthropic, Prompt Engineering, Retrieval-augmented Generation (RAG), Vector Databases, Statistical Methods, APIs, AI Agents, API Integration, Decision Modeling, RAG Systems, Claude Code, Pgvector, Supabase, AI Agent Orchestration, OpenAI API, Inventory Management, PostgreSQL, AI Architecture, Optical Character Recognition (OCR)

Data Scientist

2022 - 2023
Treat Technologies, Inc
  • Developed a classification model for each client to determine whether the client belongs to a high, low, or mid-value bucket.
  • Managed customer lifetime value estimation and estimated expenditures for the next 12 months.
  • Implemented all models and documented code and process.
Technologies: Google BigQuery, Python, Data Science, Data Build Tool (dbt), Google Cloud Platform (GCP), SQL, eCommerce, Statistical Analysis, Statistical Modeling, Excel 365, Feature Engineering

Senior Risk Modeling Associate

2018 - 2021
Banca Intesa Serbia
  • Developed predictive models for the probability of default (PD) and exposure at default (EAD) estimation for retail and corporate clients utilizing logistic regression, decision trees, and ensemble models. Implemented them into IT infrastructure.
  • Calculated IFRS 9 migration matrices, estimated TTC PDs, and automated the whole process flow, migrating it from complex Excel sheets to R.
  • Monitored models' performance monthly, reported to internal and external stakeholders, and optimized processes by automating rating calculations and data quality controls. Migrated the entire process from Excel workbooks to Power BI.
  • Participated in recruitment and training of new interns and supervised projects assigned to them.
Technologies: SAS, R, SQL, Microsoft Power BI, Machine Learning, Predictive Modeling, Data Science, Predictive Analytics, Data Analysis, Data Visualization, Reports, Data Analytics, Artificial Intelligence (AI), JSON, Data Reporting, Data Mining, Web Scraping, Big Data, Statistics, Data Pipelines, ETL, Data Engineering, Quantitative Finance, Finance, XGBoost, Random Forest Regression, Linear Regression, Random Forests, Financial Modeling, Exploratory Data Analysis, Financial Data, Tableau, Statistical Analysis, Statistical Modeling, Dashboards, Excel 365, Pricing, Reporting, Feature Engineering

Risk Data Scientist

2017 - 2018
Addiko Bank AG
  • Built predictive models for loan approvals and capital allocation, implemented these models, and performed monthly performance monitoring. Monitored the performance of retail credit portfolios of all AG SEE banks.
  • Created a completely automated model performance report for all member banks of AG SEE banks.
  • Developed a predictive model for the HR department to help them in their selection process.
Technologies: SAS, SQL, R, Python, Excel 2013, Machine Learning, Predictive Modeling, Data Science, Predictive Analytics, Data Analysis, Data Visualization, Reports, Data Analytics, Artificial Intelligence (AI), Jupyter Notebook, Data Reporting, Data Mining, Big Data, Statistics, Quantitative Finance, Finance, Random Forest Regression, Linear Regression, Random Forests, Financial Modeling, Exploratory Data Analysis, Matplotlib, Seaborn, Financial Data, Statistical Analysis, Statistical Modeling, Dashboards, Excel 365, Reporting, Feature Engineering

Reporting Anlayst

2015 - 2016
Nielsen
  • Learned to manipulate large datasets and derive insights from data that gave the sales team a better negotiating position and the ability to seal better deals for the company.
  • Automated most reporting processes, cutting report generation time by 40 to 50%.
  • Transitioned old reports into new templates that were easier to maintain and looked more professional.
Technologies: SQL, Excel VBA, Predictive Analytics, Data Analysis, Data Visualization, Reports, Data Analytics, Data Reporting, Data Mining, Exploratory Data Analysis, Dashboards, Excel 365, Pricing, Reporting

Experience

Automated Scorecard Builder

A fully-automated scorecard builder that can be used in PD modeling or any other binary classification task. The app goes through the standard workflow to create such models. Users can upload desired data, choose target variables, explore data, and perform automated coarse classing and variable selection, model fitting and testing, rating scale development, etc.

Real Estate Appraisal App

A real estate appraisal tool for the Belgrade RE market. By filling out all the
required data, the model estimates the price for an apartment and also makes comparisons with average values for that municipality. Also, a geolocation service is included in the app. The data was scraped from local real estate listing websites.

Time Series Forecasting in Spark

https://github.com/apantovic/spark_ts
A template for creating a pipeline in PySpark for time series forecasting. This template can be used for modeling hundreds or thousands time series at the same time.

Pipeline can be also adjusted to accommodate different types of regression/classification problems.

Education

2015 - 2020

Master's Degree in Statistics

University of Belgrade - Belgrade, Serbia

Skills

Libraries/APIs

Scikit-learn, Pandas, TensorFlow, Keras, XGBoost, Matplotlib, LSTM, OpenAI API, PySpark, Dask, REST APIs, PyTorch, Three.js

Tools

Microsoft Power BI, Seaborn, BigQuery, Apache Airflow, Claude Code, Azure Machine Learning, Excel 2013, Looker, Amazon SageMaker, Tableau

Languages

R, SQL, Python 3, SAS, Python, Excel VBA

Platforms

Databricks, Jupyter Notebook, Azure, Docker, Amazon Web Services (AWS), Google Cloud Platform (GCP)

Storage

JSON, MySQL, PostgreSQL, Data Pipelines

Frameworks

Apache Spark, Spark, LangGraph

Paradigms

ETL

Industry Expertise

Project Management

Other

Time Series, Time Series Analysis, Forecasting, Machine Learning, Data Science, Predictive Modeling, Predictive Analytics, Data Analysis, Data Visualization, Data Analytics, Data Reporting, Data Mining, Big Data, Statistics, Quantitative Finance, Finance, Random Forest Regression, Linear Regression, Random Forests, Supply Chain, Supply Chain Management (SCM), Supply Chain Optimization, Exploratory Data Analysis, Financial Data, eCommerce, Statistical Analysis, Statistical Modeling, Dashboards, Excel 365, Agentic RAG Systems, Large Language Models (LLMs), Feature Engineering, Statistical Methods, APIs, RAG Systems, Pgvector, Inventory Management, Quantitative Analysis, Reports, Artificial Intelligence (AI), Neural Networks, Natural Language Processing (NLP), Web Scraping, Data Engineering, Google BigQuery, Financial Modeling, Machine Learning Operations (MLOps), Risk Analysis, Generative Pre-trained Transformers (GPT), Recommendation Systems, Applied Mathematics, Pricing, Reporting, LangChain, FastAPI, Agentic AI, Anthropic, Prompt Engineering, Retrieval-augmented Generation (RAG), Vector Databases, AI Agents, API Integration, Decision Modeling, Supabase, AI Agent Orchestration, AI Architecture, Optical Character Recognition (OCR), User Behavior, Gradient Boosting, Data Build Tool (dbt), Vehicle Routing, Geographic Information Systems

Collaboration That Works

How to Work with Toptal

Toptal matches you directly with global industry experts from our network in hours—not weeks or months.

1

Share your needs

Discuss your requirements and refine your scope in a call with a Toptal domain expert.
2

Choose your talent

Get a short list of expertly matched talent within 24 hours to review, interview, and choose from.
3

Start your risk-free talent trial

Work with your chosen talent on a trial basis for up to two weeks. Pay only if you decide to hire them.

Top talent is in high demand.

Start hiring