Bernardt Duvenhage, Developer in Rocky Mountain House, AB, Canada
Bernardt is available for hire
Hire Bernardt

Bernardt Duvenhage

Verified Expert  in Engineering

Machine Learning Developer

Rocky Mountain House, AB, Canada

Toptal member since October 21, 2020

Bio

Bernardt is passionate about developing technology that fundamentally improves lives and broadens our knowledge. He led teams and developed software for computer vision, natural language understanding, modeling and simulation, and computer graphics projects. Bernardt has experience programming in C++ and Python, using frameworks such as scikit-learn and PyTorch for machine learning and deep learning, transformers, Torchvision, CUDA, OpenGL, OpenSceneGraph, OpenCV, and NLTK.

Portfolio

Pipio.ai
Artificial Intelligence (AI), Software Development...
Helm.africa
Data Analytics, REST APIs, Google Cloud, BigQuery, Google BigQuery...
Council for Scientific and Industrial Research
3D Math, OpenGL, Embedded Software, Assembly, OpenSceneGraph, GLSL, OpenCV, C++...

Experience

  • C++ - 20 years
  • Linux - 15 years
  • Computer Vision - 10 years
  • Machine Learning - 10 years
  • Computer Graphics - 10 years
  • Python - 8 years
  • PyTorch - 5 years
  • NLU - 4 years

Availability

Part-time

Preferred Environment

C++, Python, MacOS, Linux

The most amazing...

...project I've worked on is a digital twin generative AI service.

Work Experience

Head of R&D | Co-founder

2020 - 2024
Pipio.ai
  • Co-founded a generative AI company called Pipio.ai as the technical co-founder and developed the technology that helped the company attract the first round of funding and grow.
  • Built a generative AI company using in-house developed algorithms and models and the backend render service.
  • Collaborated with my co-founder and successfully raised capital and grew the R&D team and company.
Technologies: Artificial Intelligence (AI), Software Development, Generative Artificial Intelligence (GenAI), Minimum Viable Product (MVP), Team Leadership, Product Discovery, Full-stack, Model Deployment, Speech to Text, Speech Recognition, Text to Speech (TTS), Automatic Speech Recognition (ASR), Speech to Text AI, Data Annotation, Audio Processing, Machine Learning Operations (MLOps), Whisper, Azure Text to Speech, Real-time Audio Processing, Data Science, Data Cleansing, Data Analysis, Generative Adversarial Networks (GANs), Image Generation, Stable Diffusion, Text to Image AI, Containerization, Architecture, Kubernetes, AI Modeling, JavaScript, LoRa, AI Prompts, Text to Image, Prompt Engineering, DALL-E, AI Art Visualization, OpenCV, Unity, Hand Tracking, FastAPI, OpenAI API, Node.js, Azure, AI Content Creation, Transformers, Diffusion Models, Diffusion-based AI Models, AI Chatbots, Amazon Web Services (AWS), Speech Analytics, Graphics Processing Unit (GPU), CTO, Agile, CI/CD Pipelines, Technical Leadership, Data Pipelines, Hugging Face, Open-source LLMs

Machine Learning Lead

2017 - 2023
Helm.africa
  • Developed a natural language processing (NLP) and computer vision service to build task-oriented conversational agents for low-resource languages. Technologies include transformers, PyTorch, Torchvision, Flask-RESTful, and PostgreSQL.
  • Created intent classification, sentiment, and entity extraction models that rely on various deep learning and classical machine learning techniques to accommodate various languages. Used transformers, scikit-learn, PyTorch, and TensorFlow.
  • Developed a machine vision-based quality assurance application with a machine vision camera interface. The technologies include Torchvision, Flask-RESTful, GenICam, and React. Used deep transfer learning to improve sample efficiency of clients' data.
Technologies: Data Analytics, REST APIs, Google Cloud, BigQuery, Google BigQuery, Deep Neural Networks (DNNs), Neural Networks, Data Science, Generative Pre-trained Transformers (GPT), Natural Language Processing (NLP), Artificial Intelligence (AI), Google Cloud Platform (GCP), Flask-RESTful, Python, Computer Vision, Natural Language Understanding (NLU), Machine Learning, Sentiment Analysis, Text Classification, Image Classification, Leadership, GPU Computing, Image Recognition, Speech to Text, Text to Speech (TTS), Google Speech API, Speech Recognition, Large Language Models (LLMs), Team Leadership, Full-stack, Model Deployment, Technical Writing, Writing & Editing, Automatic Speech Recognition (ASR), Speech to Text AI, Data Annotation, Audio Processing, Machine Learning Operations (MLOps), Call Centers, Real-time Audio Processing, Data Cleansing, Data Analysis, Optical Character Recognition (OCR), Containerization, Architecture, Kubernetes, AI Modeling, JavaScript, AI Prompts, Prompt Engineering, OpenCV, ChatGPT, FastAPI, OpenAI API, Node.js, Transformers, AI Chatbots, Amazon Web Services (AWS), Speech Analytics, NVIDIA NeMo, Graphics Processing Unit (GPU), Agile, CI/CD Pipelines, Technical Leadership, Data Pipelines, TensorFlow, Hugging Face, Open-source LLMs

Research Group Leader | Principal Research Scientist

2014 - 2017
Council for Scientific and Industrial Research
  • Developed a real-time image processing framework as well as image enhancement, target detection, object tracking, and state estimation algorithms.
  • Developed software interfaces to hardware devices like cameras, pan-tilt-zoom systems, communication radios, and real-time clocks.
  • Led the company's image processing team of full-time employees and students, published some papers, and helped with talent management and screening.
Technologies: 3D Math, OpenGL, Embedded Software, Assembly, OpenSceneGraph, GLSL, OpenCV, C++, Real-time Vision Systems, Leadership, GPU Computing, Image Recognition, Video Capture, Facial Tracking, Facial Recognition, Team Leadership, Full-stack, Model Deployment, Technical Writing, Writing & Editing, Data Science, Data Cleansing, Data Analysis, Architecture, Internet of Things (IoT), AI Modeling, Unity, Graphics Processing Unit (GPU), Agile, Technical Leadership, Data Pipelines, TensorFlow

Senior Research Scientist, Optronic Sensor Systems

2008 - 2014
Council for Scientific and Industrial Research
  • Developed models for a physically based optronics scene simulator. The models were developed in C++ and an in-house 3D modeling tool.
  • Developed physically based renderers for the long and medium-wave infrared bands. The full software and hardware accelerated renderers were developed in C++ and OpenGL with GLSL. The rendering algorithms ranged from simple to path tracing.
  • Developed physically based renderers for the short wave (reflective) and visual bands. The software and hardware accelerated renderers were developed in C++ and OpenGL with GLSL. The rendering algorithms ranged from simple to path tracing.
Technologies: 3D Math, Rendering, OpenGL, C++, Real-time Vision Systems, GPU Computing, Video Capture, Facial Tracking, Facial Recognition, Full-stack, Model Deployment, Technical Writing, Writing & Editing, Data Cleansing, Data Analysis, Architecture, Internet of Things (IoT), OpenCV, Graphics Processing Unit (GPU), Agile, Data Pipelines

Senior Research Scientist, Modelling and Simulation

2004 - 2008
Council for Scientific and Industrial Research
  • Developed a faster-than-real-time distributed modeling and simulation framework for wargaming-type simulations. The simulation framework was implemented in C++ and employed TCP communication between the nodes.
  • Built vehicle and equipment models for wargaming-type simulations. The models ranged from behavioral to physically based and were implemented in C++.
  • Created a 3D simulation viewer and analysis tool using OpenSceneGraph and osgEarth.
Technologies: Simulations, Algorithms, OpenSceneGraph, Modeling, OpenGL, C++, GPU Computing, NVIDIA CUDA, Video Capture, Architecture, Graphics Processing Unit (GPU), Agile, Data Pipelines

Talking Head | Avatar API

https://www.pipio.ai
In-house AI and API for talking head/avatar rendering where I created a photo-realistic AI for creating stock and custom avatars for a video editing and content generation application. I also developed the cloud service to make the AI available to the application team.

A Natural Language Understanding and Computer Vision Cloud Service

A Python-based cloud service for training and deploying natural language understanding and computer vision models for task-oriented conversational agents. I was responsible for developing the data science workflow, model, and data management as well as the production service. I was also responsible for implementing and training a number of models for image segmentation and classification, intent detection, sentiment, and entity extraction. The technologies used include PyTorch, Torchvision, TensorFlow, scikit-learn, Pandas, Swagger/OpenAPI, Flask, and PostgreSQL.

Real-time Image Processing Framework

An image processing and machine vision framework implemented in C++, CUDA, and GLSL and optimized for real-time execution on multiple simultaneous high-resolution camera streams. The framework also included my own long-range and low light image enhancement algorithms, image stabilization, object detection, and target tracking implementations as well as software interfaces to hardware devices like machine vision cameras, pan-tilt-zoom systems, and communication radios.

Physically-based Path Tracing Renderer

A physically-based renderer for indoor scenes written in C++, where the renderer used path tracing and other ray tracing variants to create simulated images of indoor and outdoor scenes for a surveillance modeling application. The implementation included area and skylights, various bidirectional scattering functions, and acceleration structures to support large complex scenes, as well as concepts such as flux, radiance, irradiance, and brightness.
2008 - 2015

PhD in Computer Science

University of Pretoria - South Africa

SEPTEMBER 2020 - PRESENT

Deeplearning.ai NLP Specialization (In Progress: 3/4 Course Certificates Completed)

Coursera

DECEMBER 2019 - PRESENT

Deep Reinforcement Learning for Enterprise Nanodegree

Udacity

SEPTEMBER 2018 - OCTOBER 2020

Google Cloud Certified - Professional Data Engineer

Google Cloud

AUGUST 2018 - PRESENT

Deep Learning 5-course Specialization by Deeplearning.ai

Coursera

Libraries/APIs

PyTorch, OpenCV, REST APIs, Flask-RESTful, OpenGL, Scikit-learn, Natural Language Toolkit (NLTK), NumPy, Pandas, TensorFlow, Google Speech API, OpenAI API, Node.js

Tools

BigQuery, Whisper, AI Prompts, OpenSceneGraph, Torchvision, ChatGPT

Languages

Python, C++, GLSL, Assembly, JavaScript

Paradigms

Agile, ETL

Platforms

Linux, MacOS, Google Cloud Platform (GCP), NVIDIA CUDA, Kubernetes, Docker, Azure, Amazon Web Services (AWS), NVIDIA NeMo

Storage

Data Pipelines, Google Cloud

Frameworks

Unity

Other

NLU, Computer Vision, Computer Graphics, Machine Learning, Computer Science, Artificial Intelligence (AI), Natural Language Processing (NLP), Data Science, Neural Networks, Text Classification, Image Classification, AI Design, Generative Adversarial Networks (GANs), Generative Pre-trained Transformers (GPT), Generative Artificial Intelligence (GenAI), Image Generation, Stable Diffusion, Architecture, AI Modeling, AI Content Creation, CI/CD Pipelines, Technical Leadership, Hugging Face, Modeling, Transformers, Google BigQuery, Data Analytics, Simulations, Facial Recognition, Facial Tracking, Video Capture, Image Recognition, Rendering, Visual Computing, Sentiment Analysis, Real-time Vision Systems, Leadership, GPU Computing, Speech to Text, Text to Speech (TTS), Speech Recognition, Large Language Models (LLMs), Minimum Viable Product (MVP), Team Leadership, Full-stack, Model Deployment, Technical Writing, Writing & Editing, Automatic Speech Recognition (ASR), Speech to Text AI, Data Annotation, Audio Processing, Machine Learning Operations (MLOps), Azure Text to Speech, Real-time Audio Processing, Data Cleansing, Data Analysis, Text to Image AI, Containerization, Internet of Things (IoT), Text to Image, Prompt Engineering, Hand Tracking, FastAPI, Diffusion Models, Diffusion-based AI Models, AI Chatbots, Speech Analytics, Graphics Processing Unit (GPU), CTO, Open-source LLMs, Mathematics, Physics, Molecular Biology, Hardware Development, Data Engineering, Deep Reinforcement Learning, Deep Learning, Natural Language Understanding (NLU), Embedded Software, Image Processing, Hardware Drivers, Graphical User Interface (GUI), Ray Tracing, Algorithms, 3D Math, Deep Neural Networks (DNNs), Software Development, Product Discovery, Text to Video, Call Centers, Optical Character Recognition (OCR), LoRa, DALL-E, AI Art Visualization

Collaboration That Works

How to Work with Toptal

Toptal matches you directly with global industry experts from our network in hours—not weeks or months.

1

Share your needs

Discuss your requirements and refine your scope in a call with a Toptal domain expert.
2

Choose your talent

Get a short list of expertly matched talent within 24 hours to review, interview, and choose from.
3

Start your risk-free talent trial

Work with your chosen talent on a trial basis for up to two weeks. Pay only if you decide to hire them.

Top talent is in high demand.

Start hiring