Derrick Bakhuis, Developer in Rotterdam, Netherlands
Derrick is available for hire
Hire Derrick

Derrick Bakhuis

Verified Expert  in Engineering

Data Warehousing Developer

Rotterdam, Netherlands

Toptal member since October 13, 2021

Bio

Derrick is a data engineer with an academic background focused on building advanced data platforms in Azure. He's specialized in Azure Cloud, and he has made, migrated, and maintained data platforms for different types of organizations. His ability to adapt, coach, and deliver sets him out from the rest. Over the past three years, his customers rated him with 8.7 of 10.

Portfolio

Inlichtingenbureau
Databricks, Azure DevOps, Azure Synapse, Scrum Master, UX Research...
Croonwolter & Dros
Azure Synapse, Azure Data Lake, Databricks, Delta Lake, Data Warehousing, Azure...
Marlink
Databricks, PySpark, Python, CI/CD Pipelines, EventHub, Streaming Data...

Experience

  • Python - 6 years
  • Data Engineering - 5 years
  • Data Warehousing - 5 years
  • Databricks - 4 years
  • Database Design - 4 years
  • Big Data Architecture - 4 years
  • Azure DevOps - 4 years
  • PySpark - 2 years

Availability

Part-time

Preferred Environment

Azure, Databricks, Spark, Python, Azure Data Lake, Azure Data Factory (ADF), Delta Lake, PostgreSQL, GitHub, Azure DevOps

The most amazing...

...project I've delivered is a migration to Azure Databricks, coaching a team of eight, setting up CI/CD pipelines, and making use of cutting-edge frameworks.

Work Experience

Data Architect | Engineer | Scrum Master

2021 - 2023
Inlichtingenbureau
  • Developed a data mesh for a government organization that delivers data models and Power BI reports (embedded) to 200+ municipalities.
  • Designed and led UI/UX workshops to define KPIs and dashboard requirements with over 20 end users.
  • Helped the data engineers build up knowledge about Apache Spark and Delta Lake on the Azure Databricks platform. Now, they are a completely self-sufficient team.
Technologies: Databricks, Azure DevOps, Azure Synapse, Scrum Master, UX Research, Microsoft Power BI, Data Mesh, Data Warehousing, Azure, Data Engineering, Azure Databricks, Apache Spark, Apache Maven, Azure Blob Storage API

Senior Data Engineer

2021 - 2021
Croonwolter & Dros
  • Migrated a pilot that used Azure SQL databases to a production system using Azure Synapse, Databricks, IoT Hubs, and ADLS.
  • Helped design the new architecture of the data platform to be event-driven during the follow-up project. Built it using Delta Lake and optimized performance and costs.
  • Coached the project manager in monitoring the solution through the Synapse pipelines and Spark SQL notebooks.
Technologies: Azure Synapse, Azure Data Lake, Databricks, Delta Lake, Data Warehousing, Azure, Data Engineering, Azure Databricks, Continuous Integration (CI), Continuous Delivery (CD)

Databricks Architect and Lead Data Engineer

2021 - 2021
Marlink
  • Architected and implemented a migration from Azure Data Lake Analytics (ADLA) to Azure Databricks. Together with a team of eight, we migrated 100+ U-SQL scripts and several terabytes of data using Spark and the Delta Framework.
  • Implemented Delta Lake and combined batch and streaming workloads. We rolled out the deployment of libraries (pip and maven) through development, test, acceptance, and production using CI/CD pipelines.
  • Coached a team of eight in using Python, Spark, and the Delta Framework. Provided workshops aimed at the development workflow. The coaching sessions were focused on helping individual members of the team make the most of the platform.
Technologies: Databricks, PySpark, Python, CI/CD Pipelines, EventHub, Streaming Data, Big Data Architecture, Scrum, Data Warehousing, Azure, Data Engineering, Azure Databricks, Continuous Integration (CI), Continuous Delivery (CD), Apache Spark, Azure Blob Storage API

Senior Data Engineer

2020 - 2021
ABN AMRO Bank
  • Built ELT pipelines processed that new data sources dynamically, at various intervals. Business analysts could easily add new data to the sandbox environment and obtain their results within a 15-minute interval.
  • Prepared several T-SQL templates that the development team could reuse. These templates allowed analysts to safely develop code and easily manage the sandbox environment.
  • Coached analysts into using the templates and ensured that the knowledge hand-over was presented through workshops and documented.
Technologies: T-SQL (Transact-SQL), SQL Stored Procedures, Azure SQL Databases, Azure Data Factory (ADF), Data Warehousing, Azure, Databricks, Data Engineering, Azure Databricks

Senior Consultant

2018 - 2021
Motion10
  • Developed a standard for working with Azure Databricks and trained colleagues in Spark architecture.
  • Developed a machine learning operations (MLOps) proposition by gathering stakeholders, building pilot projects, and training colleagues to use the technology stack.
  • Got nominated for the culture award for promoting and facilitating a learning environment within the organization.
Technologies: Azure, Databricks, Azure DevOps, Azure Machine Learning, Python, Data Warehousing, Data Engineering, Azure Blob Storage API

Senior Data Engineer

2020 - 2020
LOI
  • Migrated an on-premise data warehouse to an Azure data warehouse. The on-premise solution was built mainly using SSIS packages and SQL stored procedures. All existing functionality and new functionality needed to be migrated to Azure.
  • During this project, we built the data warehouse making use of metadata for extraction, built templates for automation, automatic scaling, CI/CD of development products, and built Kimball data models.
  • Coached the data architect in monitoring the data platform. Also, helped the deployment of new Tabular data models in Azure Analysis Services to test and production.
Technologies: Azure Analysis Services, Azure Data Lake, SQL Stored Procedures, Azure Data Factory (ADF), Coaching, Azure Logic Apps, CI/CD Pipelines, SQL Server Integration Services (SSIS), Data Warehousing, Azure, Data Engineering

Junior Data Engineer

2018 - 2020
TBI Holdings
  • Supported the BI department in managing and building new data models for several internal stakeholders.
  • Worked in a team of eight project members and made use of the Scrum project methodology on a daily basis.
  • Oversaw daily ELT workloads, worked on bug fixes, and used CI/CD pipelines to improve production.
Technologies: SQL Stored Procedures, Microsoft Power BI, Azure SQL Databases, Azure Logic Apps, Azure Data Factory (ADF), Azure Active Directory, Azure Active Directory Graph API, Data Warehousing, Azure, Data Engineering, Continuous Integration (CI), Continuous Delivery (CD)

Experience

Web Scraper for Extraction of Quarterly Financial Statements

https://github.com/TheDataDexter/StockScraper
A Python-based web scraper using the BS4 and Selenium frameworks for extracting data from publicly traded companies. Stock price data is readily available via APIs; however, the quarterly earnings are not. This project is a component of a larger personal project.

Nubovi

http://www.nubovi.com
A cloud cost management tool that helps organizations better allocate their cloud costs to business processes. Additionally, we integrated Open AI models to generate recommendations based on core resource metrics. This tool is now being used by a handful of organizations that spend over 10 million dollars on Azure.

Education

2016 - 2018

Master's Degree in Business Information Management

Rotterdam School of Management, Erasmus University - Rotterdam, The Netherlands

2012 - 2016

Bachelor's Degree in International Business

Rotterdam Business School - Rotterdam, The Netherlands

Certifications

OCTOBER 2021 - PRESENT

How to Coach Effectively

Harmoniq

JANUARY 2021 - PRESENT

Databricks Certified Associate Developer for Apache Spark 3.0

Databricks

JANUARY 2021 - PRESENT

Associate SQL Analyst

Databricks

AUGUST 2020 - AUGUST 2022

Azure Data Engineer

Microsoft

JULY 2020 - JULY 2022

Azure AI Engineer

Microsoft

JANUARY 2020 - PRESENT

MCSE Data Management and Analytics

Microsoft

DECEMBER 2019 - PRESENT

MCSA: SQL 2016 Database Development

Microsoft

SEPTEMBER 2019 - PRESENT

Data Streaming (Kafka and Spark Streaming)

Udacity

AUGUST 2019 - PRESENT

Applied Machine Learning in Python

University of Michigan | via Coursera

JULY 2019 - JULY 2021

Azure Data Scientist

Microsoft

MARCH 2018 - PRESENT

Artificial Intelligence Bootcamp

Turing Society

OCTOBER 2017 - PRESENT

Web Development Basics

Turing Society

Skills

Libraries/APIs

PySpark, Beautiful Soup, Azure Active Directory Graph API, Spark Streaming, Azure Blob Storage API, Azure Cognitive Services, React

Tools

Spark SQL, Azure Logic Apps, GitHub, Azure Machine Learning, Azure Search, Microsoft Power BI, Apache Maven

Languages

T-SQL (Transact-SQL), Stored Procedure, Python, Scala

Frameworks

Spark, Spark Structured Streaming, Apache Spark, Selenium, Django REST Framework

Paradigms

Database Design, Azure DevOps, Scrum, Continuous Integration (CI), Continuous Delivery (CD)

Platforms

Azure, Databricks, Azure Synapse, Azure SQL Data Warehouse, Docker, Amazon Web Services (AWS), Dedicated SQL Pool (formerly SQL DW)

Storage

Azure SQL Databases, SQL Server 2016, Azure SQL, SQL Stored Procedures, Azure Active Directory, PostgreSQL, Azure Cosmos DB, SQL Server Integration Services (SSIS)

Other

Azure Data Lake, Azure Data Factory (ADF), Indexing, Database Optimization, Delta Lake, Azure Databricks, Data Warehousing, Data Engineering, Streaming Data, Big Data Architecture, Azure Analysis Services, Coaching, CI/CD Pipelines, EventHub, Data Mesh, Business Cases, Machine Learning, Azure Cognitive Search, Data Science, Scrum Master, UX Research

Collaboration That Works

How to Work with Toptal

Toptal matches you directly with global industry experts from our network in hours—not weeks or months.

1

Share your needs

Discuss your requirements and refine your scope in a call with a Toptal domain expert.
2

Choose your talent

Get a short list of expertly matched talent within 24 hours to review, interview, and choose from.
3

Start your risk-free talent trial

Work with your chosen talent on a trial basis for up to two weeks. Pay only if you decide to hire them.

Top talent is in high demand.

Start hiring