
Drappeau Samia
Verified Expert in Engineering
Data Science Developer
Toulouse, France
Toptal member since May 7, 2020
Samia is an accomplished astrophysicist turned full-stack data scientist. She has a PhD in astronomy/astrophysics and leverages her critical and creative thinking with industry-oriented problem-solving skills. She offers innovative approaches to solving data strategy and data-driven problems using custom-built machine learning and deep learning approaches. Samia has published a dozen peer-reviewed articles and a book and developed specialized apps and predictive prototypes for clients.
Portfolio
Experience
- Data Visualization - 12 years
- Data Analysis - 12 years
- Python - 12 years
- Data Science - 12 years
- Machine Learning - 5 years
- SQL - 4 years
- Docker - 4 years
- Amazon Web Services (AWS) - 4 years
Availability
Preferred Environment
Amazon Web Services (AWS), Docker, Git, Python, Linux, OS X
The most amazing...
...project I've contributed to provides a fleet of hundreds of connected cars with personalized data services such as road hazard warning and overtake assistance.
Work Experience
Senior DataOps Engineer
BNP Paribas
- Contributed to the development of a CI/CD workflow enabling the lifecycle of the client's 10+ data platforms.
- Troubleshot advanced incidents in zero-trust environments.
- Contributed to the development of data science environments in Docker.
- Improved functional monitoring over time, such as usage tracking or audit requests.
- Provided training on best practices and user support to a community of 400+ data scientists and data engineers.
Senior Data Consultant
Preventive and Digital Archaeology
- Designed and developed a comprehensive data platform for archaeologists.
- Integrated multiple technical tools to streamline the data management process.
- Developed a full-stack data application that reduced data quality check time for archaeologists by 70%.
- Applied Agile methods and processes to promote a disciplined and transparent project management process.
Data Product Owner
Yara
- Collaborated with stakeholders and development teams to successfully deliver the first iteration of the end-to-end backbone of the data platform within three months.
- Facilitated communication and collaboration between the development team and stakeholders to ensure project requirements were met.
- Coordinated with cross-functional teams to ensure project goals and timelines were achieved.
- Employed Agile methodologies to prioritize tasks and manage project timelines.
- Demonstrated expertise in data product management to ensure the platform's successful launch.
AWS Data Developer
Freelance
- Significantly increased the UAT team's velocity on root-cause analysis and bug-fixing of SQL and Athena data pipelines with Python scripts.
- Resolved a persistent bug affecting multiple KPIs by identifying the root cause and contributing to code change within two weeks of joining the UAT team.
- Helped set up a reproducible bug-root-cause analysis environment, enabling capitalization of knowledge despite a high turnover team.
Senior Full-stack Data Scientist
Freelance
- Contributed to developing the big data platform for the HR division of a telecommunications company through both technical expertise and data expert leadership.
- Created a Qlik Sense app that helps HR managers tremendously in their daily work.
- Assisted in developing a prototype application to set end-to-end monitoring on the IT production platform through data expertise.
- Developed a Python application that uses machine learning to predict incident probability.
Full-stack Data Scientist and Scrum Master
Continental
- Collaborated in developing new services for connected vehicles by providing scrum teams with expertise in data science.
- Developed a personalized, most probable path service for connected vehicles, using geospatial time-series data and engineered business knowledge.
- Assisted teams in embodying Agile values and principles and supported them in applying Scrum or Kanban frameworks.
Astrophysicist
The University of Amsterdam and The University of Toulouse III
- Developed several spectral and timing models of multi-wavelength data observations that help researchers better understand the ins and outs of emissions on accreting black holes.
- Translated a legacy Fortran code into independent and modular C++ programs, resulting in a drastic gain in computation time.
- Published a book and a dozen peer-reviewed articles.
- Provided over 300 hours of teaching time at Bachelor's and Master's levels.
Experience
Road Weather App for Connected Vehicles
I was the lead data scientist and was in charge of labeling the raw geolocalized time-series data and training a model. I worked with the data engineer to integrate the model into Kafka Streams architectures and helped the second data scientist develop, in Python Bokeh, the front end to display the predictions in the driver dashboard.
Data Exchange Platform for Agriculture
I was the product owner and liaised with the stakeholders and the development teams to deliver the first version of the end-to-end backbone of the target data platform in under three months.
A Comprehensive Data Platform for Archaeologists
Archaeologists have widely adopted the platform, saving significant time and resources in data management and analysis.
Education
PhD in Astronomy and Astrophysics
University of Amsterdam - Amsterdam, Netherlands
Master's Degree in Theoretical and Mathematical Physics
Université de la Méditerranée - Marseille, France
Master's Degree in Subatomic Physics
Université Claude Bernard Lyon 1 - Lyon, France
Certifications
Data Engineer with Python
DataCamp
QlikSense Data Architect
Udemy
Deep Learning
Udacity
Skills
Libraries/APIs
Matplotlib, NumPy, Pandas, Scikit-learn, TensorFlow, Dropbox API
Tools
Git, Qlik Sense, Seaborn, Cloudera, Amazon Athena, Plotly, Kafka Streams, Esri, n8n, Keycloak, GitLab CI/CD, Vault
Languages
SQL, Python, C++, Fortran, IDL, Go
Paradigms
Agile, Microservices Architecture, Microservices
Platforms
OS X, Docker, Amazon Web Services (AWS), Linux, Apache Kafka, Heroku, Kubernetes
Frameworks
Streamlit, Swagger, Spark, Hadoop
Storage
PostgreSQL, Hasura
Other
Data Science, Data Analysis, Exploratory Data Analysis, Machine Learning, Data Engineering, Data Visualization, Deep Learning, RESTful Microservices, Data, IT Project Management, Dagster, Data Build Tool (dbt), MinIO, Metabase, GitOps, Argo Workflows, LDAP, IBM Cloud, Research, Scientific Data Analysis, Mathematics, Advanced Physics
How to Work with Toptal
Toptal matches you directly with global industry experts from our network in hours—not weeks or months.
Share your needs
Choose your talent
Start your risk-free talent trial
Top talent is in high demand.
Start hiring