
Mouad Khouy
Verified Expert in Engineering
Data Engineer and Developer
Colomiers, France
Toptal member since May 28, 2024
Mouad is a senior data engineer with 7+ years of experience delivering end-to-end big data use cases for Fortune 500 companies. He specializes in the Palantir Foundry platform and PySpark and skillfully designs, builds, and maintains data pipelines. He achieved a 90% performance boost by auditing and performance-tuning a complex PySpark data pipeline. Mouad enables businesses to make data-driven decisions and mentors engineers to apply best practices and maintain high-level performance.
Portfolio
Experience
- Foundry - 7 years
- PostgreSQL - 7 years
- PySpark - 7 years
- Python - 7 years
- Data Pipelines - 7 years
- ETL - 7 years
- Data Engineering - 7 years
- Ontologies - 5 years
Availability
Preferred Environment
Foundry, PySpark, ETL, Data Engineering, Ontologies, Python, TypeScript, PostgreSQL, Databricks, Data Integration
The most amazing...
...thing I've done is audit and performance-tune a complex PySpark data pipeline, remarkably improving performance by around 90%.
Work Experience
Data Engineer | Senior Consultant
Capgemini
- Collaborated with clients and local teams to deliver modern data products and build relationships.
- Analyzed current business practices, processes, and procedures and identified future opportunities for leveraging Foundry services and implementing effective metrics and monitoring processes.
- Translated business problems into Foundry operational improvements and end user solutions in collaboration with internal and external stakeholders.
- Capitalized on Palantir Foundry data engineering best practices and created hands-on shares with Palantir Foundry data engineers.
- Animated an internal Palantir Foundry community of more than 225 members and demonstrated new Palantir Foundry releases.
Senior Data Engineer | Technical Leader
Stellantis
- Designed and implemented an end-to-end Palantir Foundry solution for car claims management.
- Collected and integrated cars and claims data into Foundry using data connections from disparate and various sources such as SQL databases, cloud storage, and REST APIs.
- Implemented data transformations in PySpark and Pipeline Builder to derive new datasets and create ontology objects.
- Maintained a qualified data pipeline by integrating data expectations and data health checks.
- Created Palantir Object View and Workshop applications used by more than 450 users worldwide to take actions and interact with the ontology.
- Integrated the automatic classification of claims based on user rules defined using the Foundry Rules tool.
- Established notifications and emails sent to users and car dealers based on object changes and users' actions.
- Managed and mentored a team of seven data engineers and developers.
Data Engineer | Technical Leader
Airbus
- Developed and maintained a data hub that contains all data related to the bill of materials of more than 11,400 aircraft and their exchange using the Palantir Foundry data connection and repository and PySpark.
- Integrated data into Palantir Foundry, combining manufacturing, sales, engineering, maintenance, inflight, suppliers, and client data sources while ensuring compliance with the company data governance standards.
- Cleaned, transformed, and connected integrated datasets to generate a trusted, harmonized, and healthy data catalog to be consumed by different use cases.
- Reviewed code repository pull requests to guarantee quality code delivery and challenged six data engineers to apply best practices and maintain high-level performance.
Data Engineer
Airbus
- Designed and developed data pipeline transformations to track parts locations and maintenance contracts using the Palantir Foundry repository and PySpark.
- Produced target datasets that are high-quality, relevant, and frequently updated to feed dashboards.
- Built a tailored dashboard using target datasets in Palantir Foundry Slate.
- Collaborated with functional teams to deliver and demonstrate user story development on a 3-week sprint interval.
Experience
Contextual Chatbot
Before commencing production, the certifier must ensure that the product complies with regulations. By integrating retrieval-augmented generation, LangChain, Chroma DB, Streamlit, and ChatGPT 4, I developed a chatbot with exceptional abilities to comprehend conversations and provide relevant responses based on extensive regulatory articles.
Education
Master's Degree in Mathematics and Computer Science
Université de Lorraine - Metz, France
Certifications
Palantir Foundry Data Engineer Professional
Palantir
Skills
Libraries/APIs
PySpark, REST APIs
Tools
ChatGPT
Languages
Python, SQL, TypeScript, JavaScript, CSS, Markdown
Frameworks
Spark, Streamlit
Paradigms
ETL, Business Intelligence (BI), Continuous Integration (CI), Continuous Deployment
Storage
PostgreSQL, Data Pipelines, Data Integration
Platforms
Databricks
Other
Foundry, Data Engineering, Ontologies, Technical Leadership, Big Data, Data Warehousing, Data Modeling, Algorithms, Mathematics, Artificial Intelligence (AI), Agile Sprints, Code Review, Data Governance, Data Quality Management, Retrieval-augmented Generation (RAG), Generative Artificial Intelligence (GenAI), OpenAI, LangChain, ChromaDB
How to Work with Toptal
Toptal matches you directly with global industry experts from our network in hours—not weeks or months.
Share your needs
Choose your talent
Start your risk-free talent trial
Top talent is in high demand.
Start hiring