Francesco Fontan, Developer in Berlin, Germany
Francesco is available for hire
Hire Francesco

Francesco Fontan

Verified Expert  in Engineering

Data Scientist and Developer

Location
Berlin, Germany
Toptal Member Since
January 30, 2023

Francesco is a seasoned data scientist with robust analytical and technical capabilities. His area of interest is tackling business problems using traditional machine learning and deep learning for computer vision or natural language processing (NLP) tasks. Fascinated by operational research, optimization, and GPU-accelerated computing, Francesco has a strong machine learning and cloud engineering background that helps him drive conversations and coordinate heterogeneous teams.

Portfolio

Levi Strauss & Co.
Data Science, Optimization, Pricing, Promotion, Artificial Intelligence (AI)...
Delivery Hero
Python, Optimization, Data Science, Artificial Intelligence (AI), Docker...
Machine Learning Reply
Python, PyTorch, Deep Learning, Machine Learning, Forecasting, Computer Vision...

Experience

Availability

Part-time

Preferred Environment

Linux, Python 3, Pandas, Scikit-learn, TensorFlow, PyTorch, Hugging Face

The most amazing...

...thing I've developed is a scheduling tool for a cruise company using forecasting and optimization, saving $2 million per year.

Work Experience

Data Scientist

2022 - PRESENT
Levi Strauss & Co.
  • Implemented a system that proposes the most effective promotions for all US stores, such as buy one get one or X% off. This system optimized key business KPIs (margin increased by at least 2 – 3%) and provided valuable insights.
  • Led the technical global promotion team, providing recommendations for five global markets for retail stores, outlets, and eCommerce, creating value of around $20 million in additional revenue annually.
  • Redesigned the pricing recommendation tool, increasing speed by six times using BigQuery and Apache Airflow.
  • Supported the migration from AWS to Google Cloud Platform (GCP), coordinating the work between data engineers and machine learning (ML) engineers.
  • Developed a GenAI product description generator for 2,000+ items and 20+ languages with PaLM2 and Imagen.
Technologies: Data Science, Optimization, Pricing, Promotion, Artificial Intelligence (AI), ChatGPT, Docker, CI/CD Pipelines, SQL, Forecasting, Data Analysis, Data Analytics, Language Models, Software Architecture, Containerization, FastAPI, Apache Airflow, Big Data, Time Series, Chatbot, Looker, OpenAI

Data Scientist

2022 - 2022
Delivery Hero
  • Designed the first version of the picker scheduling tool that optimized the shifts of the people working in dark stores leveraging Python and mixed-integer programming (MIP), reducing the costs by more than $1 million per year.
  • Prototyped the first version of a smart location-based inventory that suggests where to place items optimally to minimize the picking time and other operational activities inside a warehouse.
  • Created automated pipelines for autoformatting using Python and SQL codes based on custom rules, helping data scientists to speed up deploys to two hours per person per sprint and bringing uniformity across different teams.
Technologies: Python, Optimization, Data Science, Artificial Intelligence (AI), Docker, CI/CD Pipelines, SQL, Data Modeling, Forecasting, Data Analysis, Data Analytics, Software Architecture, Containerization, Apache Airflow, Big Data, Time Series, Open-source LLMs, Looker

Data Scientist

2017 - 2021
Machine Learning Reply
  • Created text classification to analyze emails automatically, speeding up the entire business process by ten times.
  • Designed an optimization tool for a cruise company that handles embarking and disembarking for more than 50,000 people, resulting in estimated average savings of $1 million annually.
  • Conducted ten lectures for the course "AI and ML: Platforms and vendor solutions" to graduate students enrolled in the second-level master studies in artificial intelligence and cloud at the Polytechnic University of Turin.
  • Worked on a recommender system for Reply's internal social network using traditional collaborative filtering methods, item-based models, and NLP techniques. The algorithms handled over 10 thousand active daily users and increased engagement by 10%.
  • Redesigned an ML model for swaption prices, achieving better performance by decreasing the mean squared error (MSE) by 10% compared to the previous implementation and lowering the RAM required by 30%, with an increased speed by 1.5 times.
  • Built a system to detect and classify various road defects, as predictive maintenance applied to highway asphalt is crucial to cut costs. This object detection model was based on YOLOv5.
  • Organized and delivered more than 20 Nvidia courses on the latest in machine learning and deep learning as an experienced instructor.
Technologies: Python, PyTorch, Deep Learning, Machine Learning, Forecasting, Computer Vision, Natural Language Processing (NLP), GPT, Generative Pre-trained Transformers (GPT), Categorization, Regression, Google Cloud Platform (GCP), Unsupervised Learning, Keras, Computer Vision Algorithms, Bots, Email, Docker, CI/CD Pipelines, Amazon Web Services (AWS), Data Science, SQL, Databricks, Financial Analysis, Azure, Data Analysis, Data Analytics, Language Models, Software Architecture, Containerization, FastAPI, Apache Airflow, Big Data, Time Series, Chatbot, Open-source LLMs, Looker, Azure OpenAI Service

Data Scientist

2015 - 2015
Evo Pricing
  • Onboarded two major customers with revenues exceeding $1 billion, showcasing my ability to build strong relationships and deliver value to high-profile clients.
  • Implemented automation for initial analysis during customer onboarding, reducing development and analysis time from 25 days to just three days, enabling swift and accurate evaluation of elasticities, sales trends, product segmentation, and inventory.
  • Architected a computer vision system that effectively recognized similar competitor products using images. This implementation enhanced the accuracy of our competitor data, leading to improved forecasting algorithms (-5% MSE).
Technologies: R, Dynamic Pricing, Optimization, Data Science, SQL, Pricing, Data Analysis, Data Analytics, Big Data, Time Series

NLP Ticketing System

I led a team of three people to design a machine learning tool on Microsoft Azure to improve the ticketing management of a telco customer.

This system analyzed emails received from customer support and logs from the network infrastructure, extracting useful information using NLP techniques and LLMs to open, assign, and finally close tickets fully automatically.

This automation increased the speed by ten times, predicting more than ten ticket fields with an average accuracy of 90%. Finally, the pipeline could scale: training was performed on half a million records, while the inference module handled more than 20 tickets per hour.

CV for Predictive Maintenance

Implementing predictive maintenance for highway asphalt is a pivotal approach to cost reduction and enhancing the overall service quality for motorists. Within this project, I engineered a sophisticated system to identify and categorize diverse road pavement defects effectively.

This solution harnesses an object detection model, utilizing Yolo v5, in conjunction with external data points pertinent to the road section under scrutiny, such as traffic patterns and weather conditions.

Notably, the recall rate surpassed 80%, and the tool achieved significant cost savings of 600,000 euros in its inaugural year of operation.

Chatbot QA

I created a versatile chat and voice bot for Question and Answer (QA) and Requesting Assistance and Guidance (RAG) using Llama 2, LangChain, and Weaviate. This system seamlessly integrates chat and voice features, including phone call interactions using Twilio, speech-to-text (Whisper AI), and text-to-speech (Polly) models. The bot's Natural Language to SQL module enhances information retrieval and user interactions, effectively managing approximately 1,000 daily calls with an 85% success rate. Implemented in Python with FastAPI, the system offers robust performance.

Smart Planning for a Cruise Line

The main goal was to use artificial intelligence to enhance the planning phase of crew members by optimizing the global fleet planning by minimizing the money spent on flight tickets without lowering the expected quality.

I designed a Python tool based on Google OR-Tools, an optimization suite by Google AI, able to solve MIP problems that involved more than 30,000 embarks every year, obtaining an estimated average saving of 10%, corresponding to $1 million per year.

NVIDIA Instructor

In my role at Machine Learning Reply, I have been instrumental in offering a range of training courses tailored to meet the unique needs of our clientele. My primary focus involves delivering workshops that delve into the intricate realms of deep learning and accelerated computing.

Courses Delivered:

Fundamentals of Deep Learning
Building Transformer-Based Natural Language Processing Applications
Building Intelligent Recommender Systems
Fundamentals of Accelerated Data Science with RAPIDS
Fundamentals of Deep Learning for Multiple Data Types
Fundamentals of Accelerated Computing with CUDA Python and C/C++

Optimizing Budget Allocation

As the technical team leader for this project at an airline company, I spearheaded the development of a tool to optimize budget allocation. The objective was to recommend the most effective distribution of the marketing budget between campaigns to enhance traffic volume and conversion rates.

To achieve this, we crafted forecasting models that meticulously analyzed ticket sales over time, factoring in external variables such as weather, competitor actions, and specific events. Additionally, we engineered a system capable of predicting future visitor numbers based on a set of marketing campaigns.

Through these efforts, we estimated an annual cost savings of approximately $300,000.

Swaption

I led the overhaul of a critical machine learning model and its associated data pipeline, which played a pivotal role in Swaption price prediction by applying the Black-Scholes theorem. My efforts yielded significant improvements, most notably a remarkable 10% reduction in Mean Squared Error (MSE) compared to the previous implementation.

Additionally, I achieved resource optimization by slashing RAM requirements by an impressive 30% while accelerating model processing by a factor of 1.5x. To fortify the model's reliability, I strategically employed Ensemble methods, harnessing the strengths of XGBoost, Random Forest, and Neural Network techniques.

Rec Sys for Internal Social Network

I developed a recommender system for Reply's internal social network using traditional collaborative filtering methods, item-based models, but also NLP techniques to extrapolate information from posts and user profiles. These algorithms were entirely on the AWS cloud and handled more than 10k
active daily users.

Optimization of Budget Allocation

For an airline company, two metrics are the key to success: conversion rate and traffic volume.

I was the technical team leader in this project that aimed to define a tool capable of suggesting how to distribute, in the most effective way, the budget between marketing campaigns to increase volumes and more aggressive pricing policies to increase the conversion rate.

We developed forecasting models to analyze the tickets sold over time and consider external factors such as weather, competitors, and particular events. But also a system able to predict the number of visitors in the future given a set of marketing campaigns. Finally, we estimated a cost saving of around $300,000 per year.

NLP for Voicebot

A versatile chatbot and voicebot that I designed for Q&A and requesting assistance and guidance. This system seamlessly integrates chat and voice features, including phone call interactions through technologies like Twilio, speech-to-text, and text-to-speech models. The bot's natural language to SQL module enhances information retrieval and user interactions, effectively managing approximately 1,000 daily calls with an 85% success rate. Implemented in Python with FastAPI, the system offers robust performance.

Generative AI for Product Descriptions in eCommerce

As the project leader, I spearheaded the automation of more than 5,000 product description generations in over 20 languages using Generative AI. Within a team of five, we successfully developed a promising prototype and are currently working on scaling the solution for multiple markets. Our methodology involves leveraging product attributes, images, and customer reviews to create HTML descriptions for the website, employing a pipeline with three language model modules and a focus on prompt engineering and few-shot learning using ImageGen and Palm2, with potential plans for fine-tuning in the future.

Dynamic Pricing in Retail

I spearheaded a dynamic pricing optimization initiative at Levi's, overseeing a team of four specialists across data science and engineering domains. Our project targeted promotional periods across Europe, the US, and Mexico, covering mainline, outlet, and eCommerce channels. Through demand forecasting utilizing sktime, statsforecast, and XGBoost, coupled with optimization via mixed integer programming, we maximized quantity sold, revenue, and margin while adhering to inventory constraints and business rules. Deployment on GCP with Kubeflow and Airflow ensured scalability, supported by CI/CD with Jenkins. This implementation contributed over $40 million in annual revenue, affirming the significant impact of our data-driven strategies on financial outcomes.

Bundle Promotion Optimization

Led a bundle promotion optimization project aimed at enhancing markdown and pricing strategies by incorporating bundle offers such as "Buy one, get another item at 50%." Utilizing demand forecasting techniques with sktime, statsforecast, and XGBoost, we integrated mixed integer programming optimization to maximize quantity sold, revenue, and margin while considering inventory constraints and business rules. Deployment in US outlets led to a remarkable revenue increase of over 10% based on recent A/B tests facilitated by Kubeflow for scalability and effectiveness.

Languages

Python 3, Python, R, C++, SQL

Libraries/APIs

Pandas, Scikit-learn, TensorFlow, PyTorch, Keras, SpaCy, RAPIDS, XGBoost, Dask

Tools

ChatGPT, Apache Airflow, Looker, Azure OpenAI Service, MATLAB, You Only Look Once (YOLO), Whisper

Paradigms

Data Science, DevOps

Platforms

Google Cloud Platform (GCP), Linux, Docker, Amazon Web Services (AWS), Databricks, Azure, Twilio, NVIDIA CUDA, Kubeflow

Other

Statistics, Probability Theory, Deep Learning, Machine Learning, Optimization, Natural Language Processing (NLP), Computer Vision, Machine Learning Operations (MLOps), Pricing, Artificial Intelligence (AI), GPT, Generative Pre-trained Transformers (GPT), Data Modeling, Data Analysis, Language Models, Big Data, Time Series, Chatbot, Large Language Models (LLMs), Open-source LLMs, OpenAI, Hugging Face, Network Science, Forecasting, Bots, Email, CI/CD Pipelines, Data Analytics, Marketing Analytics, Software Architecture, Containerization, Promotion, Cloud, Data Engineering, GPU Computing, Text Classification, Reinforcement Learning, Categorization, Regression, Unsupervised Learning, Dynamic Pricing, Computer Vision Algorithms, BERT, Neural Networks, Torch, Object Detection, Predictive Modeling, FastAPI, Voice Chat, Chatbots, Recommendation Systems, Financial Engineering, Collaborative Filtering, Financial Analysis, LangChain, Sales Forecasting, Llama 2

2015 - 2017

Master's Degree in Mathematical Engineering

Polytechnic University of Turin - Turin, Italy

2012 - 2015

Bachelor's Degree in Mathematics and Computer Science

Polytechnic University of Turin - Turin, Italy

JUNE 2022 - PRESENT

TensorFlow Developer Certificate

TensorFlow

JANUARY 2022 - PRESENT

Professional Machine Learning Engineer

Google Cloud

OCTOBER 2019 - PRESENT

Professional Data Engineer

Google Cloud

FEBRUARY 2019 - PRESENT

Deep Learning Institute Certified Instructor

NVIDIA

JULY 2018 - PRESENT

Microsoft DAT257x: Reinforcement Learning Explained

edX

MARCH 2018 - PRESENT

Deep Learning

DeepLearning.AI

Collaboration That Works

How to Work with Toptal

Toptal matches you directly with global industry experts from our network in hours—not weeks or months.

1

Share your needs

Discuss your requirements and refine your scope in a call with a Toptal domain expert.
2

Choose your talent

Get a short list of expertly matched talent within 24 hours to review, interview, and choose from.
3

Start your risk-free talent trial

Work with your chosen talent on a trial basis for up to two weeks. Pay only if you decide to hire them.

Top talent is in high demand.

Start hiring