Marco Pietrosanto, Developer in Rome, Metropolitan City of Rome Capital, Italy
Marco is available for hire
Hire Marco

Marco Pietrosanto

Verified Expert  in Engineering

Data Engineer and Software Developer

Rome, Metropolitan City of Rome Capital, Italy

Toptal member since September 13, 2022

Bio

Marco has always been eager to help people understand the concepts behind technicalities and the reasons behind people's behaviors. He is a generalist who has worked as a physicist, a bioinformatics researcher, and a data scientist. Marco likes to delve into problems, understand the thin connections between different fields, and is eager to share his skills with teams working on challenging projects.

Portfolio

Omninext
AWS Lambda, Code Review, Communication, Data Science, Docker, Hiring...
Omninext
Python, Scikit-learn, Computer Vision, Keras, TensorFlow, Pandas, Visualization...
Self-employed
Unix, Teaching, Research, Problem Management, Data Science...

Experience

  • Scikit-learn - 6 years
  • Storytelling - 6 years
  • Pandas - 6 years
  • Research - 6 years
  • Python - 6 years
  • Code Review - 6 years
  • Remote Team Leadership - 2 years

Availability

Part-time

Preferred Environment

Python, Jira, Keras, Pandas, Scikit-learn

The most amazing...

...project I've developed and managed is a system worth €2 million that integrates geo, weather, and social data into risk indexes for territorial monitoring.

Work Experience

Project Manager

2021 - PRESENT
Omninext
  • Assumed a project halfway through, working with halved deadlines and delivering three financed projects.
  • Coordinated the projects end-to-end (design, back-end, machine learning, front-end, marketing) while working as a data scientist with my team.
  • Became the person to go to when new opportunities arose, and there was a need to speak with potential project partners/clients.
Technologies: AWS Lambda, Code Review, Communication, Data Science, Docker, Hiring, Computer Vision, Machine Learning, Jira, E-learning

Lead Data Scientist

2020 - PRESENT
Omninext
  • Developed the machine learning (ML) core of a service monitoring environmental risks and integrating social posts, environmental rasters, vector images, weather data, and satellite measurements.
  • Developed an ensemble model to tackle an eolic energy production designed for energy dispatchers for non-programmable energy sources.
  • Built a proof of concept (POC) for invoice automated readings, a delivery service checkpoint system, and unknown element tracking in videos through re-identification.
  • Coordinated project planning meetings and spoke at a regional radio station and national and international tech fairs.
  • Taught data science approaches and methodologies to non-tech Montenegrin audience in an ICT Cortex course to facilitate communication between managers, commercial and HR departments, and data scientists.
Technologies: Python, Scikit-learn, Computer Vision, Keras, TensorFlow, Pandas, Visualization, Storytelling, Communication, Code Review, Remote Team Leadership, QGIS, Data Science, Machine Learning, Hiring, Software Development, Problem Management

Consultant | Freelance

2022 - 2022
Self-employed
  • Worked as an online consultant on developing a bioinformatics strategy to analyze the results of the RNA-seq experiment. I created it and taught a researcher how to code and reproduce the pipeline.
  • Developed a system that proposes a set of incentives that the Italian state could implement to reduce CO2 emissions by 2050. The problem could not be framed by classical minimization, so we chose the Monte Carlo Simulation as the solution.
  • Translated board games for an Italian publisher, which allowed me to transfer my skills from one field to another.
Technologies: Unix, Teaching, Research, Problem Management, Data Science, Software Development, Communication

Data Scientist | Postdoctoral Researcher

2016 - 2020
Helmer-Citterich Laboratory
  • Taught coding to first-year bioinformatics master of science (MSc) students, including how to approach it from a non-programmer point of view and how to understand and implement an algorithm.
  • Taught data science to last-year bioinformatics MSc students, including pitfalls and strategies on effectively using Python to rule over data.
  • Oversaw PhD and MSc students and supported them in their research and thesis writing.
Technologies: Python, University Teaching, Research, Code Review, Data Science, Machine Learning, Hiring, Teaching, Unix, Statistics, Software Development, Unix Shell Scripting, Scientific Computing, Problem Management, Communication, Visualization, Pandas, Keras

Experience

Omnienergy

https://www.omni-energy.it/
A system that predicts the energy that will be produced from renewable plants in the short-term, i.e., between one and 12 hours. I communicated with partners and external collaborators as the project and technical lead. Also, I directed and actively developed the ML core.

The system uses an ensemble model that directs other sub-models' contributions following a hypothesis that eolic energy production does not follow one specific regime, but each sub-model contributes dynamically throughout a certain period of time. I came up with the concept of an orchestra director for the storytelling, so the specific models were named after particular music compositions, like Minuet for the eolic energy and Bolero for its photovoltaic iteration.

SeVaRA

https://www.sevara.it/
A system that predicts short-term environmental risks. It integrates social posts, weather data, satellite images, and population information to train a multi-XGBoost model to predict risk values for people, buildings, and the environment.

After being put in charge of the project halfway through, i.e., in the last 1.5 years, I had to rebuild it entirely while respecting contracts and grants of around €2 million. Additionally, I had to be a scientific validator, understand the needs and capabilities of all external collaborators, and communicate with all the stakeholders to help them understand what the project was really about. Finally, I discovered that one of the collaborators did not deliver the ML core of the project even though everyone else before me thought he did, so I built it myself.

Datada

https://datada.it/
A B2B service designed for the HoReCa field provides support in analyzing and forecasting product demand. The system is able to cluster providers, analyze product time series, and provide suggestions on what products are sold together.

Education

2013 - 2016

PhD in Bioinformatics

University of Rome "Tor Vergata" - Rome, Italy

2008 - 2013

Master's Degree in Physics

Sapienza University of Rome - Rome, Italy

Certifications

APRIL 2020 - PRESENT

The Carpentries Instructor

The Carpentries

Skills

Libraries/APIs

Scikit-learn, Keras, Pandas, TensorFlow

Tools

Jira

Languages

Python

Industry Expertise

Teaching, E-learning

Platforms

Unix, Docker, AWS Lambda

Storage

Amazon DynamoDB

Other

Storytelling, Communication, University Teaching, Problem Management, Code Review, Data Science, Visualization, Research, Mathematics, IT Project Management, Hiring, Machine Learning, Computer Vision, Scientific Computing, Unix Shell Scripting, Software Development, Statistics, Remote Team Leadership, QGIS, Data Analysis, Data Science Product Manager

Collaboration That Works

How to Work with Toptal

Toptal matches you directly with global industry experts from our network in hours—not weeks or months.

1

Share your needs

Discuss your requirements and refine your scope in a call with a Toptal domain expert.
2

Choose your talent

Get a short list of expertly matched talent within 24 hours to review, interview, and choose from.
3

Start your risk-free talent trial

Work with your chosen talent on a trial basis for up to two weeks. Pay only if you decide to hire them.

Top talent is in high demand.

Start hiring