Luca Puggini, Developer in Rome, Metropolitan City of Rome, Italy
Luca is available for hire
Hire Luca

Luca Puggini

Verified Expert  in Engineering

Data Scientist and Developer

Location
Rome, Metropolitan City of Rome, Italy
Toptal Member Since
December 4, 2020

Luca is a senior data scientist with a Master's degree in mathematics, a Ph.D. in machine learning, and several years of industry experience. He worked at Intel and the University of Bergen, and developed tools for network performance monitoring in high-frequency trading, data-driven algorithms for production optimization in semiconductor manufacturing, and anomaly detection applied to network security. Luca excels with complex AI projects from the research phase to deployment and production.

Portfolio

TechnipEnergies
Python 3, ChatGPT, Large Language Models (LLMs), Artificial Intelligence (AI)...
Freelance
Python 3, Amazon Web Services (AWS), Data Engineering, Statistical Analysis...
Pico
Python, JavaScript, Data Analytics, Data Engineering, Statistical Analysis...

Experience

Availability

Part-time

Preferred Environment

Scikit-learn, SQL, Pandas, NumPy, Artificial Intelligence (AI), Machine Learning, Python

The most amazing...

...software I've developed recognizes users from their network traffic and contained 30,000+ lines of Python code.

Work Experience

Digitail Innovation Coordinator

2023 - PRESENT
TechnipEnergies
  • Developed a tool that automatically completes engineering 3D models.
  • Created a tool using Large Language Models that finds the best matches between documents.
  • Managed a team of seven data scientists and software engineers.
Technologies: Python 3, ChatGPT, Large Language Models (LLMs), Artificial Intelligence (AI), Mentorship & Coaching, Time Series, Forecasting, Technical Leadership, Leadership, OpenAI GPT-3 API, Minimum Viable Product (MVP), Architecture, Software Architecture, Back-end Development, Machine Learning Operations (MLOps), Transformer Models

Data Scientist | Consultant

2020 - PRESENT
Freelance
  • Developed A/B testing and other statistics for an eCommerce plugin vendor to optimize conversion rate.
  • Helped a consulting company to enter the data-science industry.
  • Estimated the effect of water filters on the population's health in African villages.
  • Developed a chatbot for eCommerce using ChatGPT and LLM.
Technologies: Python 3, Amazon Web Services (AWS), Data Engineering, Statistical Analysis, Databases, JSON, ETL, Data Mining, Data Reporting, Neural Networks, Real-time Data, Technical Hiring, Source Code Review, Code Review, Task Analysis, Interviewing, APIs, Team Management, Jupyter Notebook, Dashboards, Data Lakes, Data Analysis, Model Development, Classification Algorithms, PostgreSQL, Pricing, Pricing Strategy, Data Wrangling, Jupiter, Data Collection, Computer Vision, Datasets, Financial Forecasting, Data Pipelines, Predictive Modeling, Statistical Modeling, Team Leadership, API Integration, Natural Language Processing (NLP), Generative Pre-trained Transformers (GPT), Amazon, OpenAI GPT-4 API, Large Language Models (LLMs), ChatGPT, OpenAI, Consulting, Time Series, Forecasting, OpenAI GPT-3 API, Prompt Engineering, Minimum Viable Product (MVP), Architecture, Software Architecture, Bots, Back-end Development, Machine Learning Operations (MLOps), Transformer Models, Chatbots, Retrieval-augmented Generation (RAG)

Vice President Data Scientist | Technical Lead

2022 - 2023
Pico
  • Served as the technical referent for data science for the whole company.
  • Developed a database containing market tick data (700 tera) and used it for computing trading metrics.
  • Oversaw and participated in the development of large software projects mainly focused on big data and data science.
Technologies: Python, JavaScript, Data Analytics, Data Engineering, Statistical Analysis, Data Visualization, Multivariate Statistical Modeling, Data Lake Design, Time Series Analysis, Algorithmic Trading Analysis, Databases, JSON, ETL, Data Mining, Data Reporting, Neural Networks, Real-time Data, Technical Hiring, Source Code Review, Code Review, Task Analysis, Interviewing, APIs, Team Management, Jupyter Notebook, Dashboards, Docker, Data Lakes, Data Analysis, Model Development, Classification Algorithms, PostgreSQL, Data Wrangling, Jupiter, Data Collection, Datasets, Financial Forecasting, Data Pipelines, Product Analytics, Product Development, Predictive Modeling, Statistical Modeling, Team Leadership, API Integration, Mentorship & Coaching, Time Series, Forecasting, Technical Leadership, Leadership, Minimum Viable Product (MVP), Architecture, Software Architecture, High-Frequency Trading (HFT), Back-end Development, Machine Learning Operations (MLOps)

Assistant Vice President Data Scientist

2019 - 2022
Pico
  • Developed tools for network performance monitoring in the high-frequency trading domain.
  • Built a REST API using Flask, enabling users to interact with product systems and to consume the generated data.
  • Ensured that all the shipped software was bug-free and able to scale as required.
Technologies: REST, Bash, SQL, Python, Data Engineering, Statistical Analysis, Data Modeling, Time Series Analysis, Databases, JSON, ETL, Data Mining, Data Reporting, Neural Networks, Real-time Data, Technical Hiring, Source Code Review, Code Review, Task Analysis, Interviewing, APIs, Jupyter Notebook, Dashboards, Docker, Data Analysis, Model Development, Deep Learning, Classification Algorithms, PostgreSQL, Data Wrangling, Jupiter, Data Collection, Datasets, Financial Forecasting, Data Pipelines, Product Analytics, Product Development, Predictive Modeling, Statistical Modeling, API Integration, High-frequency Trading (HFT), Finance, Time Series, Forecasting, Technical Leadership, Leadership, Architecture, Software Architecture, High-Frequency Trading (HFT), Back-end Development, Machine Learning Operations (MLOps)

Data Scientist

2016 - 2019
Corvil
  • Developed software to recognize users from their network traffic. Created 30,000+ lines of Python code containing advanced machine learning algorithms and highly optimized data pipelines.
  • Developed several data-based products mainly focused on anomaly detection applied to network security.
  • Tested the developed products, ensuring both statistical accuracy and scalability under heavy loads.
Technologies: Artificial Intelligence (AI), Machine Learning, Data Science, SQL, Python, Data Engineering, Statistical Analysis, Data Modeling, Time Series Analysis, Databases, JSON, ETL, Data Mining, Data Reporting, Neural Networks, Real-time Data, Technical Hiring, Source Code Review, Code Review, Task Analysis, Interviewing, APIs, Jupyter Notebook, Dashboards, Docker, Data Analysis, Model Development, Classification Algorithms, PostgreSQL, Data Wrangling, Jupiter, Data Collection, Datasets, Financial Forecasting, Data Pipelines, Product Analytics, Product Development, Predictive Modeling, Statistical Modeling, API Integration, Natural Language Processing (NLP), Generative Pre-trained Transformers (GPT), Time Series, Architecture, Software Architecture, High-Frequency Trading (HFT), Back-end Development, Machine Learning Operations (MLOps)

Demonstrator

2015 - 2016
Maynooth University
  • Demonstrated and tutored students of the C++ course for the Electronic Engineering Department of Maynooth University.
  • Evaluated students' skills in embedded C++ development.
  • Helped students to improve their C++ software development skills.
Technologies: C++, Arduino, Jupiter

Medical Statistician (Contract 20%)

2013 - 2016
University of Bergen
  • Investigated the relationship between menopause and asthma.
  • Analyzed data using classical statistical inference and epidemiological techniques.
  • Created visualizations to make results consumable to non-data-savvy users.
Technologies: Bayesian Statistics, Statistics, STATA, R, Statistical Analysis, Data Reporting, Task Analysis, Data Analysis, Model Development, Classification Algorithms, Data Wrangling, Predictive Modeling, Statistical Modeling

Visiting Researcher

2013 - 2016
Intel
  • Researched and developed new data-driven algorithms for production optimization in semiconductor manufacturing.
  • Developed an anomaly detection algorithm for optical emission spectroscopy high dimensional data collected during plasma etching.
  • Developed a supervised and unsupervised variable selection algorithm to reduce the cost of data collection.
Technologies: Statistics, Machine Learning, Python, Sensor Data, Model Development, Predictive Modeling, Statistical Modeling

User Recognition

The user-recognition project had the aim to recognize users by their activity on the network. It was a large software project composed of more than 30,000 lines of Python code containing highly optimized data pipelines and advanced machine-learning algorithms. The project lasted more than one year, and I have been responsible for the development from the initial statistical research to the full implementation of the product.

Anomaly Detection for Large Scale Network Data

I developed an anomaly detection system from scratch that can detect faults and anomalies in a network system. The software was able to analyze more than 100,000 series in real time and is currently used by many trading companies to monitor the efficiency of their networks. I handled both the model research phase and the actual implementation of the system.

A/B Testing for eCommerce SaaS

Worldz is a SaaS plugin that eCommerce businesses can add to their site to improve conversion rate and revenue. I developed the AI portion of the software, designing and implementing A/B tests and optimizing the tool.
2013 - 2016

Ph.D. in Data Science

Maynooth University - Maynooth, Ireland

2011 - 2013

Master's Degree in Mathematics and Computer Science

Tor Vergata University - Rome, Italy

2008 - 2011

Bachelor's Degree in Mathematics

Tor Vergata University - Rome, Italy

Libraries/APIs

NumPy, Pandas, Scikit-learn, XGBoost

Tools

ChatGPT, STATA

Languages

Python, Python 3, SQL, R, Bash, Bash Script, JavaScript, C++

Paradigms

Data Science, ETL, REST, Anomaly Detection

Platforms

Jupyter Notebook, Docker, Amazon, Amazon Web Services (AWS), Arduino

Storage

Data Pipelines, Data Lakes, Databases, JSON, Data Lake Design, PostgreSQL

Industry Expertise

High-frequency Trading (HFT)

Other

Machine Learning, Artificial Intelligence (AI), Mathematics, Statistics, APIs, Data Engineering, Data Analytics, Statistical Analysis, Data Modeling, Data Visualization, Multivariate Statistical Modeling, Time Series Analysis, Data Analysis, Model Development, Classification Algorithms, Data Mining, Data Reporting, Real-time Data, Technical Hiring, Source Code Review, Code Review, Task Analysis, Interviewing, Dashboards, Data Wrangling, Jupiter, Data Collection, Datasets, Product Analytics, Product Development, Predictive Modeling, Statistical Modeling, API Integration, Natural Language Processing (NLP), OpenAI GPT-4 API, Large Language Models (LLMs), OpenAI, Mentorship & Coaching, Consulting, Time Series, Forecasting, Technical Leadership, Leadership, Prompt Engineering, Minimum Viable Product (MVP), Architecture, Software Architecture, Bots, Back-end Development, Machine Learning Operations (MLOps), Retrieval-augmented Generation (RAG), Probability Theory, Neural Networks, Team Management, A/B Testing, Pricing, Pricing Strategy, Team Leadership, Finance, OpenAI GPT-3 API, High-Frequency Trading (HFT), Transformer Models, Chatbots, Numerical Methods, Bayesian Statistics, Sensor Data, Algorithmic Trading Analysis, Deep Learning, Computer Vision, Financial Forecasting, Generative Pre-trained Transformers (GPT)

Collaboration That Works

How to Work with Toptal

Toptal matches you directly with global industry experts from our network in hours—not weeks or months.

1

Share your needs

Discuss your requirements and refine your scope in a call with a Toptal domain expert.
2

Choose your talent

Get a short list of expertly matched talent within 24 hours to review, interview, and choose from.
3

Start your risk-free talent trial

Work with your chosen talent on a trial basis for up to two weeks. Pay only if you decide to hire them.

Top talent is in high demand.

Start hiring