
Benjamin Breton
Verified Expert in Engineering
Data Scientist and Developer
Benjamin is passionate about data science and enjoys operating in different sectors. His mission is to identify business needs, design an adapting solution, and create value from data. Benjamin has prolific professional experience and has collaborated with 25 startups and large companies during 35 missions.
Portfolio
Experience
Availability
Preferred Environment
Python 3, TensorFlow, Scikit-learn, Pandas, Flask
The most amazing...
...thing I've achieved is the state-of-the-art result on an OCR document, reducing response time by 75%.
Work Experience
Senior Data Scientist
Mindee
- Directed the continuous improvement of a receipt-processing API that extracts essential information from images. Reduced response time and memory footprint by 75% and improved accuracy.
- Developed deep-learning computer vision algorithms for document processing in TensorFlow, such as OCR, segmentation, and classification.
- Designed synthetic data generators to train these models without manually labeled data.
- Created a cleaning tool to improve data quality automatically.
Data Scientist
Orange Bank
- Managed a team of two data scientists for a fraud detection task. Tested, supervised (XGBoost), and unsupervised (auto-encoders) algorithms with financial analysts and achieved a recall of 85%.
- Developed NLP algorithms to improve conversational frameworks like Rasa and Watson, including sentiment analysis, entity extraction, and intent classification.
- Aggregated and cleaned online posts from various sources, such as Twitter, Facebook, app stores, and blogs, to prepare training corpora adapted to the mobile banking industry.
- Designed a social media post analysis tool for the marketing team.
Data Scientist
Clustaar
- Developed an NLP platform in French and English using Python and Scala.
- Built an entity and intents extractor to populate chatbot conversations automatically and reduce the bot design time.
- Installed and optimized a parallel calculus framework, Spark, to achieve the NLP tools' scalability.
IT Consultant
Mazars USA
- Developed a fraud-detection system using machine learning.
- Completed IT general-control audits, including security review, risk assessment, and automation of these processes.
- Performed consulting technology missions, such as data mining and penetration testing in the energy and financial sectors.
Experience
Twitter Dashboard | French 2017 Elections
https://bbreton3.github.io/big-bang-data/Discrete Simulation Monte Carlo
Skills
Languages
Python 3, Scala, Excel VBA, Fortran, SQL, Python
Libraries/APIs
TensorFlow, Scikit-learn, Pandas, SciPy, NumPy, Rasa NLU
Other
Machine Learning, Data Analysis, Natural Language Processing (NLP), Computer Vision, GPT, Generative Pre-trained Transformers (GPT), Statistics, Time Series, Fraud Audits, Know Your Customer (KYC), Numerical Methods, Simulations, Stochastic Modeling, Mechanical Engineering, Fluid Mechanics, Vibration Analysis, Physics, Mathematics, Calculus, Engineering, Chemistry, Linear Algebra, Advanced Physics, Analytics, API Integration
Frameworks
Flask, Spark
Paradigms
Anomaly Detection, Data Science
Tools
Rasa.ai
Education
Master's Degree in Mechanical Engineering
The Georgia Institute of Technology - Atlanta, USA
Master's Degree in Mechanical Engineering
National School of Arts and Crafts - Paris, France