
Derrick Bakhuis
Verified Expert in Engineering
Data Warehousing Developer
Rotterdam, Netherlands
Toptal member since October 13, 2021
Derrick is a data engineer with an academic background focused on building advanced data platforms in Azure. He's specialized in Azure Cloud, and he has made, migrated, and maintained data platforms for different types of organizations. His ability to adapt, coach, and deliver sets him out from the rest. Over the past three years, his customers rated him with 8.7 of 10.
Portfolio
Experience
- Python - 6 years
- Data Engineering - 5 years
- Data Warehousing - 5 years
- Databricks - 4 years
- Database Design - 4 years
- Big Data Architecture - 4 years
- Azure DevOps - 4 years
- PySpark - 2 years
Preferred Environment
Azure, Databricks, Spark, Python, Azure Data Lake, Azure Data Factory (ADF), Delta Lake, PostgreSQL, GitHub, Azure DevOps
The most amazing...
...project I've delivered is a migration to Azure Databricks, coaching a team of eight, setting up CI/CD pipelines, and making use of cutting-edge frameworks.
Work Experience
Python Developer
BNG Bank
- Helped deliver a migration of one of the critical systems from on-premises to the cloud. Helped the team move on-premises workloads to Azure by porting vendor functionality to the API layer and event-driven systems.
- Built APIs using Python and Azure Functions, in combination with API management and data transportation layers such as Service Bus and Azure Data Factory.
- Delivered the migration on schedule, within one business day, and with zero service interruptions from users and downstream applications.
- Contributed to the implementation of an organization-wide monitoring framework and played a key role in setting up OpenTelemetry to achieve end-to-end observability.
Data Engineer
VvAA
- Built a metadata-driven data platform that has been used for important data migration from one operational system to another operational system for insurance.
- Created a report on the data for internal audit and integrated feedback from logs after a trial migration.
- Oversaw the implementation of a metadata-driven data platform for migrating the back end of insurance software from one vendor to another.
- Helped analysts through the data dictionary by building a simple HTML parser using Python, which was heavily used by analysts for developing metadata mappings.
CTO
Nubovi
- Designed, architected, and built a data-driven SaaS solution on Azure. The startup's goal was to deliver Cost and Savings insights to business users.
- Built a platform through which data could be extracted from multiple Azure cloud environments, processed in a timely and secure manner, and provided end users with insights into managing wasted resources in the cloud.
- Used a technology stack that involved GitHub environments and CI/CD pipelines to automate data pipelines, front- and back-end models, and core Azure infrastructure. Additionally, we made use of Azure Container Apps to host the solution.
- Gained invaluable experience, as all aspects of designing, implementing, and operating a product were involved.
Data Architect | Engineer | Scrum Master
Inlichtingenbureau
- Developed a data mesh for a government organization that delivers data models and Power BI reports (embedded) to 200+ municipalities.
- Designed and led UI/UX workshops to define KPIs and dashboard requirements with over 20 end users.
- Helped the data engineers build up knowledge about Apache Spark and Delta Lake on the Azure Databricks platform. Now, they are a completely self-sufficient team.
Senior Data Engineer
Croonwolter & Dros
- Migrated a pilot that used Azure SQL databases to a production system using Azure Synapse, Databricks, IoT Hubs, and ADLS.
- Helped design the new architecture of the data platform to be event-driven during the follow-up project. Built it using Delta Lake and optimized performance and costs.
- Coached the project manager in monitoring the solution through the Synapse pipelines and Spark SQL notebooks.
Databricks Architect and Lead Data Engineer
Marlink
- Architected and implemented a migration from Azure Data Lake Analytics (ADLA) to Azure Databricks. Together with a team of eight, we migrated 100+ U-SQL scripts and several terabytes of data using Spark and the Delta Framework.
- Implemented Delta Lake and combined batch and streaming workloads. We rolled out the deployment of libraries (pip and maven) through development, test, acceptance, and production using CI/CD pipelines.
- Coached a team of eight in using Python, Spark, and the Delta Framework. Provided workshops aimed at the development workflow. The coaching sessions were focused on helping individual members of the team make the most of the platform.
Senior Data Engineer
ABN AMRO Bank
- Built ELT pipelines processed that new data sources dynamically, at various intervals. Business analysts could easily add new data to the sandbox environment and obtain their results within a 15-minute interval.
- Prepared several T-SQL templates that the development team could reuse. These templates allowed analysts to safely develop code and easily manage the sandbox environment.
- Coached analysts into using the templates and ensured that the knowledge hand-over was presented through workshops and documented.
Senior Consultant
Motion10
- Developed a standard for working with Azure Databricks and trained colleagues in Spark architecture.
- Developed a machine learning operations (MLOps) proposition by gathering stakeholders, building pilot projects, and training colleagues to use the technology stack.
- Got nominated for the culture award for promoting and facilitating a learning environment within the organization.
Senior Data Engineer
LOI
- Migrated an on-premise data warehouse to an Azure data warehouse. The on-premise solution was built mainly using SSIS packages and SQL stored procedures. All existing functionality and new functionality needed to be migrated to Azure.
- During this project, we built the data warehouse making use of metadata for extraction, built templates for automation, automatic scaling, CI/CD of development products, and built Kimball data models.
- Coached the data architect in monitoring the data platform. Also, helped the deployment of new Tabular data models in Azure Analysis Services to test and production.
Junior Data Engineer
TBI Holdings
- Supported the BI department in managing and building new data models for several internal stakeholders.
- Worked in a team of eight project members and made use of the Scrum project methodology on a daily basis.
- Oversaw daily ELT workloads, worked on bug fixes, and used CI/CD pipelines to improve production.
Experience
Web Scraper for Extraction of Quarterly Financial Statements
https://github.com/TheDataDexter/StockScraperNubovi
Education
Master's Degree in Business Information Management
Rotterdam School of Management, Erasmus University - Rotterdam, The Netherlands
Bachelor's Degree in International Business
Rotterdam Business School - Rotterdam, The Netherlands
Certifications
How to Coach Effectively
Harmoniq
Databricks Certified Associate Developer for Apache Spark 3.0
Databricks
Associate SQL Analyst
Databricks
Azure Data Engineer
Microsoft
Azure AI Engineer
Microsoft
MCSE Data Management and Analytics
Microsoft
MCSA: SQL 2016 Database Development
Microsoft
Data Streaming (Kafka and Spark Streaming)
Udacity
Applied Machine Learning in Python
University of Michigan | via Coursera
Azure Data Scientist
Microsoft
Artificial Intelligence Bootcamp
Turing Society
Web Development Basics
Turing Society
Skills
Libraries/APIs
PySpark, Beautiful Soup, Azure Active Directory Graph API, Spark Streaming, Azure Blob Storage API, Azure Cognitive Services, React
Tools
Spark SQL, Azure Logic Apps, GitHub, Azure Machine Learning, Azure Search, Microsoft Power BI, Apache Maven
Languages
Transact-SQL (T-SQL), Stored Procedure, Python, Scala, Bicep
Frameworks
Spark, Spark Structured Streaming, Apache Spark, Selenium, Django REST Framework
Paradigms
Database Design, Azure DevOps, Scrum, Continuous Integration (CI), Continuous Delivery (CD)
Platforms
Azure, Databricks, Azure Synapse, Azure SQL Data Warehouse, Docker, Amazon Web Services (AWS), Dedicated SQL Pool (formerly SQL DW), Azure Functions, Oracle
Storage
Azure SQL Databases, SQL Server 2016, Azure SQL, SQL Stored Procedures, Azure Active Directory, PostgreSQL, Azure Cosmos DB, SQL Server Integration Services (SSIS)
Other
Azure Data Lake, Azure Data Factory (ADF), Indexing, Database Optimization, Delta Lake, Azure Databricks, Data Warehousing, Data Engineering, Streaming Data, Big Data Architecture, Azure Analysis Services, Coaching, CI/CD Pipelines, EventHub, Data Mesh, Business Cases, Machine Learning, Azure Cognitive Search, Data Science, Scrum Master, UX Research, Metadata, Azure Service Bus, OpenTelemetry, API Management, GitHub Actions
How to Work with Toptal
Toptal matches you directly with global industry experts from our network in hours—not weeks or months.
Share your needs
Choose your talent
Start your risk-free talent trial
Top talent is in high demand.
Start hiring