
Badr Jaidi
Verified Expert in Engineering
Artificial Intelligence Developer
Badr is a data scientist specializing in natural language processing. He speaks four languages fluently and has been in the technology field for nearly a decade, where he has worked on a broad range of projects going from hardware to front-end development.
Portfolio
Experience
Availability
Preferred Environment
Visual Studio Code (VS Code), Unix
The most amazing...
...product I've developed is a topic modelling pipeline that handles bilingual text and brings insights to clients on hundred of thousands of documents.
Work Experience
Data Scientist
Plutoshift, Inc.
- Trained high accuracy time series classification models for a Fortune 500 company.
- Preprocessed huge amounts of complex time series hardware signals into an interpretable and trainable format.
- Used various complex time series transformations and machine learning techniques to do time series classification.
NLP Expert
Bhavik Muni
- Implemented an NLP solution that extracts actionable insights from YouTube-related text data.
- Efficiently extracted large amounts of text data from YouTube's platform.
- Designed an architecture that combines the NLP analytics and then extracted data to display insights live to the client.
Data Scientist
Ai Outcome
- Researched and developed a topic extraction model that led to the creation and sales of a new product.
- Developed an optimized and efficient bilingual French and English text processing pipeline.
- Used time series forecasting to help clients manage their resources more efficiently.
- Set up and managed a database server infrastructure to host hundreds of gigabytes of raw data.
Teacher Assistant
École de Technologie Supérieure
- Assisted researchers by conducting experiments in a THz research lab.
- Helped students learn to program in C by assisting them through the whole learning cycle, from the basics to making their first project.
- Helped students learn to use Linux platforms, install, manage, configure their networks, and use a bash script to automate them.
Software Developer
iBwave Solutions
- Developed features for an application used by hundreds of clients worldwide.
- Tracked and resolved bugs using Jira as a reporting tool.
- Contributed to quality insurance by testing the application thoroughly.
Associate Developer
Carrotsoftware.co.ltd
- Developed parts of web and iOS applications and set up databases for a diverse range of customers.
- Created a map application that smoothly displayed live data from millions of database rows.
- Maintained the company's internal chatbot regularly.
Experience
Fake News Classification
https://github.com/LiamNiisan/fake_news_detectionTwo models were trained and compared. A linear model with FastText and a neural model with TensorFlow. With the training data, the neural model gave the best results, but on manually annotated data from online news, FastText performed much better.
This showed that linear models are very good at generalizing, and neural models need to be trained on lots of good data to perform well.
Legal Corpus NER
https://github.com/LiamNiisan/legal-corpus-NERThe data was scrapped from BC's Court of Appeal and Supreme Court and was annotated using label-studio.
BERT for Hate Speech Detection
https://github.com/LiamNiisan/BERT-Fine-Tuning-Hate-Speech-DetectionBGC NASA Landslide Detection
https://github.com/LiamNiisan/BGC-NASA-landslide-detectionThe project consists of two parts:
1. News articles are extracted from online sources and then passed to a model that extracts the landslide's event properties.
2. The model extracts information from the articles: time, location, casualties, landslide category, and landslide trigger.
Skills
Languages
Python, C, SQL, C#, Swift, JavaScript, Java, PHP, HTML, Bash Script
Libraries/APIs
Natural Language Toolkit (NLTK), SpaCy, PyTorch, TensorFlow, jQuery
Other
Topic Modeling, Natural Language Processing (NLP), GPT, Generative Pre-trained Transformers (GPT), Computational Linguistics, BERT, Long Short-term Memory (LSTM), Machine Learning, Artificial Intelligence (AI), Deep Learning, Artificial Neural Networks (ANN), Full-stack, Electrical Engineering, Text Classification, QA Testing, fastText, Deep Neural Networks, FastAPI, Scraping, Annotations, Custom BERT, Waveforms, Cisco, GPU Computing, Time Series, Signal Processing, Information Retrieval
Tools
Gensim, MATLAB, Geocoder
Paradigms
Data Science, Agile
Platforms
Unix, Docker, Visual Studio Code (VS Code)
Frameworks
Bootstrap
Education
Master's Degree in Data Science
University of British Columbia - Vancouver, BC, Canada
Bachelor's Degree in Engineering
École de Technologie Supérieure - Montreal, QC, Canada