Nizar Malkiya
Verified Expert in Engineering
Data Scientist and Developer
Paris, France
Toptal member since May 3, 2021
Nizar is an engineer with 10+ years of experience in research, design, and implementation of data solutions. He is passionate about crafting quality software to transform large amounts of data into easily understood and useful insights. Nizar excels in all phases of this quest: data collection, visualization, and modeling; system architecture; algorithm design; and software deployment. Clients include healthcare, research, insurance, telecom, aerospace, advertising, consulting, and education.
Portfolio
Experience
Availability
Preferred Environment
Jupyter Notebook, Visual Studio Code (VS Code), Python, MacOS
The most amazing...
...project I've done was helping an advertising company exploit big data and machine learning techniques to optimize segmentation of prospecting campaigns.
Work Experience
Data Science Teacher
Le Wagon
- Presented lectures on statistics, Python, machine learning, and deep learning.
- Supported students during their daily exercises throughout boot camp sessions.
- Guided students during their two-week final projects, from the inception to the deployment of a machine learning model (NLP, recommendation systems, time series predictions, generative models, computer vision, etc.).
- Participated in the training of more than 400 students and mentored more than 50 final projects.
Data Science Tutor
University of London
- Provided online support to students of several Data Science MSc program modules, such as Big Data Analysis, Statistics and Statistical Data Mining, and Artificial Intelligence.
- Participated in webinars to answer questions from students of the Data Science MSc program modules.
- Graded the students' coursework and exams on the Data Science MSc modules.
Data Scientist
Pfizer - Manufacturing Operations Solutions
- Developed a time-series-based machine learning model for predictive maintenance of industrial equipment using Python and TensorFlow for a COVID-19 project from a global pharmaceutical company.
- Built several web apps to analyze and visualize the available sensor data with Python and Streamlit.
- Created preprocessing routines to collect, clean, and prepare the raw data using Python.
Data Scientist
Datafolio
- Collected and integrated different types of data related to traffic and weather-related accidents using SQL, MongoDB, Python, and Dataiku.
- Transformed geographical time series data to project them on the same referential model using Python, Spark, and Dataiku.
- Developed a road-risk model based on environmental conditions.
Senior Data Specialist
Roland Berger
- Developed data analysis routines using Jupyter, Python, scikit-learn, and Keras. These included machine learning for explanation, prediction, and clustering, NLP for sentiment and topic extraction, and geographical data analysis.
- Developed data visualization applications using JavaScript, Leaflet, and Vue.
- Wrote Python scripts for scraping data from the web.
- Led data training sessions for consultants, covering Dataiku, scraping, and SQL.
Software Engineer and Data Scientist
Ve Global UK
- Built pipelines to store real-time data using Java, Spark, and HBase.
- Created tools to perform user segmentation using Jupyter, Python, and Spark.
- Developed APIs in C# to collect data generated by user interactions on the website.
Software Engineer
Be-Mobile
- Developed metrics in C# to evaluate the quality of the traffic data produced.
- Designed data visualization tools for the quality metrics, using JavaScript and D3.js.
- Evaluated the performance of an alternative data storage solution in Cassandra.
Software Engineer
Institut d’Astrophysique de Paris
- Developed tools in Python to check the data model for the science ground segment of the ESA Euclid mission.
- Built tools in JavaScript and D3.js to visualize the data model.
- Developed a testbed in C to assess and compare the performance of Berkley DB with Oracle and HBase storage solutions.
Software Engineer
Sisteer
- Co-developed Bus, a Java middleware that acted as a service bus for Sisteer's platform.
- Developed new components for Bus based on different protocols and technologies: HTTP, FTP, XML, web services, and SQL.
- Created graphical user interfaces for end users, using Java and Swing.
R&D Engineer
Sorbonne
- Designed a search engine in Java, using latent semantic analysis (LSA) techniques.
- Performed statistical analysis on a collection of one year of bank frauds, using MATLAB.
- Designed algorithms to perform clustering on a large dataset of bank frauds, using MATLAB and Java.
Integration Engineer
Wyde
- Integrated and deployed numerous new software releases.
- Identified and corrected bugs on current releases.
- Maintained the mapping between the current software releases and the Oracle databases.
Research Engineer
ENSTA Paris
- Measured the transmission coefficients of the channel with a VNA.
- Implemented signal processing and transformation using MATLAB.
- Performed statistical modeling—using MATLAB—of the channel's time response as a function of different parameters, such as antenna types and positions.
R&D Intern
GE Healthcare
- Designed algorithms in C for the extraction of blood vessels from high-contrast fluoroscopic images.
- Designed algorithms in C to register fluoroscopic images that have different contrast levels.
- Created a POC to showcase the enhancement of fluoroscopic images in angioplasty procedures.
Experience
Vélib Hourly Visualization
https://nidata.io/vizy/velibI developed a simple web page that provides an interactive map showing the localization of bikes (Vélibs) throughout the day. This data was collected from the official Vélib web page.
Education
Master's Degree in Telecommunications
Télécom ParisTech - Paris, France
Master's Degree in Information and Communication Technologies
Politecnico di Torino - Turin, Italy
Bachelor's Degree in Information and Communication Technologies
Università degli Studi di Perugia - Perugia, Italy
Certifications
Deep Learning Specialization
Coursera
Skills
Libraries/APIs
Scikit-Learn, Pandas, D3.js, Keras, Leaflet, Vue, TensorFlow, REST APIs
Tools
MATLAB, DataViz, Jupyter, Amazon SageMaker
Languages
Python, SQL, Java, JavaScript, C#, C, Batch, XSD, Snowflake
Frameworks
Spark, Hadoop, Streamlit
Platforms
Jupyter Notebook, Dataiku, Visual Studio Code (VS Code), Oracle, Docker, MacOS, Amazon Web Services (AWS), Google Cloud Platform (GCP)
Storage
Databases, HBase, Berkeley DB, Cassandra, MongoDB, NoSQL
Industry Expertise
Telecommunications
Paradigms
Real-time Systems
Other
Computer Science, Machine Learning, Natural Language Processing (NLP), Scraping, Web Scraping, Data Science, Generative Pre-trained Transformers (GPT), Mathematics, Signal Processing, Image Processing, Deep Learning, Physics, Electronics, Automatics, Robotics, Networks, Digital Communication, Coding, RF Electronics, Security, Satellite Images, Medical Applications, Software Integration, Medical Imaging, Statistics, Antenna Design, FTP, HTTP, Big Data, APIs, Image Recognition, Data Analysis, Data Scraping, SOAP, Predictive Modeling, Data Visualization, Time Series, Statistical Analysis, Data Engineering, Artificial Intelligence (AI)
How to Work with Toptal
Toptal matches you directly with global industry experts from our network in hours—not weeks or months.
Share your needs
Choose your talent
Start your risk-free talent trial
Top talent is in high demand.
Start hiring