Drappeau Samia, Developer in Toulouse, France
Drappeau is available for hire
Hire Drappeau

Drappeau Samia

Verified Expert  in Engineering

Data Science Developer

Location
Toulouse, France
Toptal Member Since
May 7, 2020

Samia is an accomplished astrophysicist turned full-stack data scientist. She has a PhD in astronomy/astrophysics and leverages her critical and creative thinking with industry-oriented problem-solving skills. She offers innovative approaches to solving data strategy and data-driven problems using custom-built machine learning and deep learning approaches. Samia has published a dozen peer-reviewed articles and a book and developed specialized apps and predictive prototypes for clients.

Portfolio

BNP Paribas
Kubernetes, Python, Docker, Argo Workflows, Keycloak, GitLab CI/CD, LDAP, Vault...
Preventive and Digital Archaeology
Agile, Python, SQL, PostgreSQL, Hasura, Amazon Web Services (AWS)...
Yara
Agile, Data

Experience

Availability

Part-time

Preferred Environment

Amazon Web Services (AWS), Docker, Git, Python, Linux, OS X

The most amazing...

...project I've contributed to provides a fleet of hundreds of connected cars with personalized data services such as road hazard warning and overtake assistance.

Work Experience

Senior DataOps Engineer

2023 - PRESENT
BNP Paribas
  • Contributed to the development of a CI/CD workflow enabling the lifecycle of the client's 10+ data platforms.
  • Troubleshot advanced incidents in zero-trust environments.
  • Contributed to the development of data science environments in Docker.
  • Improved functional monitoring over time, such as usage tracking or audit requests.
  • Provided training on best practices and user support to a community of 400+ data scientists and data engineers.
Technologies: Kubernetes, Python, Docker, Argo Workflows, Keycloak, GitLab CI/CD, LDAP, Vault, IBM Cloud

Senior Data Consultant

2020 - PRESENT
Preventive and Digital Archaeology
  • Designed and developed a comprehensive data platform for archaeologists.
  • Integrated multiple technical tools to streamline the data management process.
  • Developed a full-stack data application that reduced data quality check time for archaeologists by 70%.
  • Applied Agile methods and processes to promote a disciplined and transparent project management process.
Technologies: Agile, Python, SQL, PostgreSQL, Hasura, Amazon Web Services (AWS), Data Visualization, Data Engineering

Data Product Owner

2021 - 2021
Yara
  • Collaborated with stakeholders and development teams to successfully deliver the first iteration of the end-to-end backbone of the data platform within three months.
  • Facilitated communication and collaboration between the development team and stakeholders to ensure project requirements were met.
  • Coordinated with cross-functional teams to ensure project goals and timelines were achieved.
  • Employed Agile methodologies to prioritize tasks and manage project timelines.
  • Demonstrated expertise in data product management to ensure the platform's successful launch.
Technologies: Agile, Data

AWS Data Developer

2021 - 2021
Freelance
  • Significantly increased the UAT team's velocity on root-cause analysis and bug-fixing of SQL and Athena data pipelines with Python scripts.
  • Resolved a persistent bug affecting multiple KPIs by identifying the root cause and contributing to code change within two weeks of joining the UAT team.
  • Helped set up a reproducible bug-root-cause analysis environment, enabling capitalization of knowledge despite a high turnover team.
Technologies: SQL, Python, Amazon Athena, Agile, Exploratory Data Analysis, Pandas, Matplotlib, Data Analysis

Senior Full-stack Data Scientist

2019 - 2021
Freelance
  • Contributed to developing the big data platform for the HR division of a telecommunications company through both technical expertise and data expert leadership.
  • Created a Qlik Sense app that helps HR managers tremendously in their daily work.
  • Assisted in developing a prototype application to set end-to-end monitoring on the IT production platform through data expertise.
  • Developed a Python application that uses machine learning to predict incident probability.
Technologies: Machine Learning, Data Science, Python, Heroku, Swagger, Docker, Spark, Hadoop, Cloudera, Qlik Sense, Exploratory Data Analysis, Pandas, Seaborn, Streamlit, Scikit-learn, Matplotlib, Data Analysis, Data Visualization

Full-stack Data Scientist and Scrum Master

2017 - 2019
Continental
  • Collaborated in developing new services for connected vehicles by providing scrum teams with expertise in data science.
  • Developed a personalized, most probable path service for connected vehicles, using geospatial time-series data and engineered business knowledge.
  • Assisted teams in embodying Agile values and principles and supported them in applying Scrum or Kanban frameworks.
Technologies: Amazon Web Services (AWS), Machine Learning, Data Science, Agile, Kubernetes, Docker, Apache Kafka, Go, Python, Exploratory Data Analysis, Pandas, Seaborn, Deep Learning, Microservices, PostgreSQL, TensorFlow, NumPy, Microservices Architecture, Streamlit, RESTful Microservices, Scikit-learn, Matplotlib, Data Analysis

Astrophysicist

2008 - 2016
The University of Amsterdam and The University of Toulouse III
  • Developed several spectral and timing models of multi-wavelength data observations that help researchers better understand the ins and outs of emissions on accreting black holes.
  • Translated a legacy Fortran code into independent and modular C++ programs, resulting in a drastic gain in computation time.
  • Published a book and a dozen peer-reviewed articles.
  • Provided over 300 hours of teaching time at Bachelor's and Master's levels.
Technologies: Git, IDL, Fortran, C++, Python, Exploratory Data Analysis, Pandas, Seaborn, NumPy, Matplotlib, Data Analysis, Data Visualization

Road Weather App for Connected Vehicles

A data app for an on-the-road, real-time weather forecast powered by machine learning.

I was the lead data scientist and was in charge of labeling the raw geolocalized time-series data and training a model. I worked with the data engineer to integrate the model into Kafka Streams architectures and helped the second data scientist develop, in Python Bokeh, the front end to display the predictions in the driver dashboard.

Data Exchange Platform for Agriculture

An open-data exchange platform for agriculture actors.

I was the product owner and liaised with the stakeholders and the development teams to deliver the first version of the end-to-end backbone of the target data platform in under three months.

A Comprehensive Data Platform for Archaeologists

As a data team of one, I designed and developed a comprehensive data platform for archaeologists. It utilized Esri and Dropbox as the data source layer, n8n workflows and custom Python scripts for the data ingestion layer, MinIO and PostgreSQL for the data storage layer, Dagster for the data orchestration layer, dbt for the data transformation layer, and Metabase as the data visualization layer. In addition, I implemented the Hugging Face framework for building and deploying machine learning models to provide advanced data analysis capabilities. Finally, it leverages GitOps as the operational framework to automate code management and deployment, increasing efficiency and productivity.

Archaeologists have widely adopted the platform, saving significant time and resources in data management and analysis.

Libraries/APIs

Matplotlib, NumPy, Pandas, Scikit-learn, TensorFlow, Dropbox API

Paradigms

Data Science, Agile, Microservices Architecture, Microservices

Other

Data Analysis, Exploratory Data Analysis, Machine Learning, Data Engineering, Data Visualization, Deep Learning, RESTful Microservices, Data, IT Project Management, Dagster, Data Build Tool (dbt), MinIO, Metabase, GitOps, Argo Workflows, LDAP, IBM Cloud, Research, Scientific Data Analysis, Mathematics, Advanced Physics

Languages

SQL, Python, C++, Fortran, IDL, Go

Tools

Git, Qlik Sense, Seaborn, Cloudera, Amazon Athena, Plotly, Kafka Streams, Esri, N8n, Keycloak, GitLab CI/CD, Vault

Platforms

OS X, Docker, Amazon Web Services (AWS), Linux, Apache Kafka, Heroku, Kubernetes

Frameworks

Streamlit, Swagger, Spark, Hadoop

Storage

PostgreSQL, Hasura

2008 - 2012

PhD in Astronomy and Astrophysics

University of Amsterdam - Amsterdam, Netherlands

2007 - 2008

Master's Degree in Theoretical and Mathematical Physics

Université de la Méditerranée - Marseille, France

2005 - 2007

Master's Degree in Subatomic Physics

Université Claude Bernard Lyon 1 - Lyon, France

MAY 2020 - PRESENT

Data Engineer with Python

DataCamp

NOVEMBER 2019 - PRESENT

QlikSense Data Architect

Udemy

JANUARY 2017 - PRESENT

Deep Learning

Udacity

Collaboration That Works

How to Work with Toptal

Toptal matches you directly with global industry experts from our network in hours—not weeks or months.

1

Share your needs

Discuss your requirements and refine your scope in a call with a Toptal domain expert.
2

Choose your talent

Get a short list of expertly matched talent within 24 hours to review, interview, and choose from.
3

Start your risk-free talent trial

Work with your chosen talent on a trial basis for up to two weeks. Pay only if you decide to hire them.

Top talent is in high demand.

Start hiring