Mouad Khouy, Developer in Colomiers, France
Mouad is available for hire
Hire Mouad

Mouad Khouy

Verified Expert  in Engineering

Bio

Mouad is a senior data engineer with 7+ years of experience delivering end-to-end big data use cases for Fortune 500 companies. He specializes in the Palantir Foundry platform and PySpark and skillfully designs, builds, and maintains data pipelines. He achieved a 90% performance boost by auditing and performance-tuning a complex PySpark data pipeline. Mouad enables businesses to make data-driven decisions and mentors engineers to apply best practices and maintain high-level performance.

Portfolio

Capgemini
Foundry, PySpark, SQL, Python, TypeScript, PostgreSQL, Big Data...
Stellantis
Foundry, PySpark, SQL, PostgreSQL, REST APIs, Data Integration, Data Pipelines...
Airbus
Foundry, PySpark, Big Data, ETL, SQL, Data Governance, Data Quality Management...

Experience

  • Foundry - 7 years
  • PostgreSQL - 7 years
  • PySpark - 7 years
  • Python - 7 years
  • Data Pipelines - 7 years
  • ETL - 7 years
  • Data Engineering - 7 years
  • Ontologies - 5 years

Availability

Part-time

Preferred Environment

Foundry, PySpark, ETL, Data Engineering, Ontologies, Python, TypeScript, PostgreSQL, Databricks, Data Integration

The most amazing...

...thing I've done is audit and performance-tune a complex PySpark data pipeline, remarkably improving performance by around 90%.

Work Experience

Data Engineer | Senior Consultant

2022 - PRESENT
Capgemini
  • Collaborated with clients and local teams to deliver modern data products and build relationships.
  • Analyzed current business practices, processes, and procedures and identified future opportunities for leveraging Foundry services and implementing effective metrics and monitoring processes.
  • Translated business problems into Foundry operational improvements and end user solutions in collaboration with internal and external stakeholders.
  • Capitalized on Palantir Foundry data engineering best practices and created hands-on shares with Palantir Foundry data engineers.
  • Animated an internal Palantir Foundry community of more than 225 members and demonstrated new Palantir Foundry releases.
Technologies: Foundry, PySpark, SQL, Python, TypeScript, PostgreSQL, Big Data, Data Engineering, ETL, Data Integration, Ontologies, Databricks, Spark, Data Modeling, Technical Leadership

Senior Data Engineer | Technical Leader

2022 - 2024
Stellantis
  • Designed and implemented an end-to-end Palantir Foundry solution for car claims management.
  • Collected and integrated cars and claims data into Foundry using data connections from disparate and various sources such as SQL databases, cloud storage, and REST APIs.
  • Implemented data transformations in PySpark and Pipeline Builder to derive new datasets and create ontology objects.
  • Maintained a qualified data pipeline by integrating data expectations and data health checks.
  • Created Palantir Object View and Workshop applications used by more than 450 users worldwide to take actions and interact with the ontology.
  • Integrated the automatic classification of claims based on user rules defined using the Foundry Rules tool.
  • Established notifications and emails sent to users and car dealers based on object changes and users' actions.
  • Managed and mentored a team of seven data engineers and developers.
Technologies: Foundry, PySpark, SQL, PostgreSQL, REST APIs, Data Integration, Data Pipelines, ETL, Big Data, Data Engineering, Databricks, Python, TypeScript, Ontologies, Data Quality Management, Data Governance, Spark, Data Modeling, Technical Leadership

Data Engineer | Technical Leader

2019 - 2022
Airbus
  • Developed and maintained a data hub that contains all data related to the bill of materials of more than 11,400 aircraft and their exchange using the Palantir Foundry data connection and repository and PySpark.
  • Integrated data into Palantir Foundry, combining manufacturing, sales, engineering, maintenance, inflight, suppliers, and client data sources while ensuring compliance with the company data governance standards.
  • Cleaned, transformed, and connected integrated datasets to generate a trusted, harmonized, and healthy data catalog to be consumed by different use cases.
  • Reviewed code repository pull requests to guarantee quality code delivery and challenged six data engineers to apply best practices and maintain high-level performance.
Technologies: Foundry, PySpark, Big Data, ETL, SQL, Data Governance, Data Quality Management, Data Pipelines, Data Integration, PostgreSQL, REST APIs, Python, Data Engineering, Spark, Data Modeling, Technical Leadership

Data Engineer

2017 - 2019
Airbus
  • Designed and developed data pipeline transformations to track parts locations and maintenance contracts using the Palantir Foundry repository and PySpark.
  • Produced target datasets that are high-quality, relevant, and frequently updated to feed dashboards.
  • Built a tailored dashboard using target datasets in Palantir Foundry Slate.
  • Collaborated with functional teams to deliver and demonstrate user story development on a 3-week sprint interval.
Technologies: Foundry, PySpark, Python, SQL, PostgreSQL, REST APIs, Big Data, ETL, Data Pipelines, JavaScript, CSS, Markdown, Continuous Integration (CI), Continuous Deployment, Agile Sprints, Data Integration, Code Review, Spark, Data Modeling

Experience

Contextual Chatbot

A chatbot that showcases how generative AI can enhance the effectiveness of electronic product certification.

Before commencing production, the certifier must ensure that the product complies with regulations. By integrating retrieval-augmented generation, LangChain, Chroma DB, Streamlit, and ChatGPT 4, I developed a chatbot with exceptional abilities to comprehend conversations and provide relevant responses based on extensive regulatory articles.

Education

2015 - 2017

Master's Degree in Mathematics and Computer Science

Université de Lorraine - Metz, France

Certifications

NOVEMBER 2023 - PRESENT

Palantir Foundry Data Engineer Professional

Palantir

Skills

Libraries/APIs

PySpark, REST APIs

Tools

ChatGPT

Languages

Python, SQL, TypeScript, JavaScript, CSS, Markdown

Frameworks

Spark, Streamlit

Paradigms

ETL, Business Intelligence (BI), Continuous Integration (CI), Continuous Deployment

Storage

PostgreSQL, Data Pipelines, Data Integration

Platforms

Databricks

Other

Foundry, Data Engineering, Ontologies, Technical Leadership, Big Data, Data Warehousing, Data Modeling, Algorithms, Mathematics, Artificial Intelligence (AI), Agile Sprints, Code Review, Data Governance, Data Quality Management, Retrieval-augmented Generation (RAG), Generative Artificial Intelligence (GenAI), OpenAI, LangChain, ChromaDB

Collaboration That Works

How to Work with Toptal

Toptal matches you directly with global industry experts from our network in hours—not weeks or months.

1

Share your needs

Discuss your requirements and refine your scope in a call with a Toptal domain expert.
2

Choose your talent

Get a short list of expertly matched talent within 24 hours to review, interview, and choose from.
3

Start your risk-free talent trial

Work with your chosen talent on a trial basis for up to two weeks. Pay only if you decide to hire them.

Top talent is in high demand.

Start hiring