Benjamin Breton, Developer in South Bend, IN, United States
Benjamin is available for hire
Hire Benjamin

Benjamin Breton

Verified Expert  in Engineering

Data Scientist and Developer

Location
South Bend, IN, United States
Toptal Member Since
November 7, 2022

Benjamin is passionate about data science and enjoys operating in different sectors. His mission is to identify business needs, design an adapting solution, and create value from data. Benjamin has prolific professional experience and has collaborated with 25 startups and large companies during 35 missions.

Portfolio

Mindee
Python 3, TensorFlow
Orange Bank
Python 3, Scikit-learn, Rasa.ai, Rasa NLU, Pandas, SQL, Data Analysis...
Clustaar
Python 3, Scikit-learn, SciPy, NumPy, Scala, Spark, Python, Analytics...

Experience

Availability

Part-time

Preferred Environment

Python 3, TensorFlow, Scikit-learn, Pandas, Flask

The most amazing...

...thing I've achieved is the state-of-the-art result on an OCR document, reducing response time by 75%.

Work Experience

Senior Data Scientist

2019 - 2022
Mindee
  • Directed the continuous improvement of a receipt-processing API that extracts essential information from images. Reduced response time and memory footprint by 75% and improved accuracy.
  • Developed deep-learning computer vision algorithms for document processing in TensorFlow, such as OCR, segmentation, and classification.
  • Designed synthetic data generators to train these models without manually labeled data.
  • Created a cleaning tool to improve data quality automatically.
Technologies: Python 3, TensorFlow

Data Scientist

2017 - 2019
Orange Bank
  • Managed a team of two data scientists for a fraud detection task. Tested, supervised (XGBoost), and unsupervised (auto-encoders) algorithms with financial analysts and achieved a recall of 85%.
  • Developed NLP algorithms to improve conversational frameworks like Rasa and Watson, including sentiment analysis, entity extraction, and intent classification.
  • Aggregated and cleaned online posts from various sources, such as Twitter, Facebook, app stores, and blogs, to prepare training corpora adapted to the mobile banking industry.
  • Designed a social media post analysis tool for the marketing team.
Technologies: Python 3, Scikit-learn, Rasa.ai, Rasa NLU, Pandas, SQL, Data Analysis, Data Science

Data Scientist

2015 - 2017
Clustaar
  • Developed an NLP platform in French and English using Python and Scala.
  • Built an entity and intents extractor to populate chatbot conversations automatically and reduce the bot design time.
  • Installed and optimized a parallel calculus framework, Spark, to achieve the NLP tools' scalability.
Technologies: Python 3, Scikit-learn, SciPy, NumPy, Scala, Spark, Python, Analytics, API Integration, Data Science

IT Consultant

2014 - 2015
Mazars USA
  • Developed a fraud-detection system using machine learning.
  • Completed IT general-control audits, including security review, risk assessment, and automation of these processes.
  • Performed consulting technology missions, such as data mining and penetration testing in the energy and financial sectors.
Technologies: Fraud Audits, Anomaly Detection, Know Your Customer (KYC), Python 3, Excel VBA

Twitter Dashboard | French 2017 Elections

https://bbreton3.github.io/big-bang-data/
Developed a microservice-based app to track the evolution of topics and sentiments mentioned on Twitter during the 2017 French presidential elections. The topic modeling was updated daily based on the latest trends.

Discrete Simulation Monte Carlo

Developed a new method for the US Air Force to model fluid flow in a very low-pressure environment. I wrote a model to simulate a Couette flow between two plates in Fortran 90. Established an explicit coupling between the direct simulation Monte Carlo and the Navier-Stokes equation.

Languages

Python 3, Scala, Excel VBA, Fortran, SQL, Python

Libraries/APIs

TensorFlow, Scikit-learn, Pandas, SciPy, NumPy, Rasa NLU

Other

Machine Learning, Data Analysis, Natural Language Processing (NLP), Computer Vision, GPT, Generative Pre-trained Transformers (GPT), Statistics, Time Series, Fraud Audits, Know Your Customer (KYC), Numerical Methods, Simulations, Stochastic Modeling, Mechanical Engineering, Fluid Mechanics, Vibration Analysis, Physics, Mathematics, Calculus, Engineering, Chemistry, Linear Algebra, Advanced Physics, Analytics, API Integration

Frameworks

Flask, Spark

Paradigms

Anomaly Detection, Data Science

Tools

Rasa.ai

2012 - 2014

Master's Degree in Mechanical Engineering

The Georgia Institute of Technology - Atlanta, USA

2007 - 2012

Master's Degree in Mechanical Engineering

National School of Arts and Crafts - Paris, France