
Derrick Bakhuis
Verified Expert in Engineering
Data Warehousing Developer
Rotterdam, Netherlands
Toptal member since October 13, 2021
Derrick is a data engineer with an academic background focused on building advanced data platforms in Azure. He's specialized in Azure Cloud, and he has made, migrated, and maintained data platforms for different types of organizations. His ability to adapt, coach, and deliver sets him out from the rest. Over the past three years, his customers rated him with 8.7 of 10.
Portfolio
Experience
- Python - 6 years
- Data Engineering - 5 years
- Data Warehousing - 5 years
- Databricks - 4 years
- Database Design - 4 years
- Big Data Architecture - 4 years
- Azure DevOps - 4 years
- PySpark - 2 years
Availability
Preferred Environment
Azure, Databricks, Spark, Python, Azure Data Lake, Azure Data Factory (ADF), Delta Lake, PostgreSQL, GitHub, Azure DevOps
The most amazing...
...project I've delivered is a migration to Azure Databricks, coaching a team of eight, setting up CI/CD pipelines, and making use of cutting-edge frameworks.
Work Experience
Data Architect | Engineer | Scrum Master
Inlichtingenbureau
- Developed a data mesh for a government organization that delivers data models and Power BI reports (embedded) to 200+ municipalities.
- Designed and led UI/UX workshops to define KPIs and dashboard requirements with over 20 end users.
- Helped the data engineers build up knowledge about Apache Spark and Delta Lake on the Azure Databricks platform. Now, they are a completely self-sufficient team.
Senior Data Engineer
Croonwolter & Dros
- Migrated a pilot that used Azure SQL databases to a production system using Azure Synapse, Databricks, IoT Hubs, and ADLS.
- Helped design the new architecture of the data platform to be event-driven during the follow-up project. Built it using Delta Lake and optimized performance and costs.
- Coached the project manager in monitoring the solution through the Synapse pipelines and Spark SQL notebooks.
Databricks Architect and Lead Data Engineer
Marlink
- Architected and implemented a migration from Azure Data Lake Analytics (ADLA) to Azure Databricks. Together with a team of eight, we migrated 100+ U-SQL scripts and several terabytes of data using Spark and the Delta Framework.
- Implemented Delta Lake and combined batch and streaming workloads. We rolled out the deployment of libraries (pip and maven) through development, test, acceptance, and production using CI/CD pipelines.
- Coached a team of eight in using Python, Spark, and the Delta Framework. Provided workshops aimed at the development workflow. The coaching sessions were focused on helping individual members of the team make the most of the platform.
Senior Data Engineer
ABN AMRO Bank
- Built ELT pipelines processed that new data sources dynamically, at various intervals. Business analysts could easily add new data to the sandbox environment and obtain their results within a 15-minute interval.
- Prepared several T-SQL templates that the development team could reuse. These templates allowed analysts to safely develop code and easily manage the sandbox environment.
- Coached analysts into using the templates and ensured that the knowledge hand-over was presented through workshops and documented.
Senior Consultant
Motion10
- Developed a standard for working with Azure Databricks and trained colleagues in Spark architecture.
- Developed a machine learning operations (MLOps) proposition by gathering stakeholders, building pilot projects, and training colleagues to use the technology stack.
- Got nominated for the culture award for promoting and facilitating a learning environment within the organization.
Senior Data Engineer
LOI
- Migrated an on-premise data warehouse to an Azure data warehouse. The on-premise solution was built mainly using SSIS packages and SQL stored procedures. All existing functionality and new functionality needed to be migrated to Azure.
- During this project, we built the data warehouse making use of metadata for extraction, built templates for automation, automatic scaling, CI/CD of development products, and built Kimball data models.
- Coached the data architect in monitoring the data platform. Also, helped the deployment of new Tabular data models in Azure Analysis Services to test and production.
Junior Data Engineer
TBI Holdings
- Supported the BI department in managing and building new data models for several internal stakeholders.
- Worked in a team of eight project members and made use of the Scrum project methodology on a daily basis.
- Oversaw daily ELT workloads, worked on bug fixes, and used CI/CD pipelines to improve production.
Experience
Web Scraper for Extraction of Quarterly Financial Statements
https://github.com/TheDataDexter/StockScraperNubovi
http://www.nubovi.comEducation
Master's Degree in Business Information Management
Rotterdam School of Management, Erasmus University - Rotterdam, The Netherlands
Bachelor's Degree in International Business
Rotterdam Business School - Rotterdam, The Netherlands
Certifications
How to Coach Effectively
Harmoniq
Databricks Certified Associate Developer for Apache Spark 3.0
Databricks
Associate SQL Analyst
Databricks
Azure Data Engineer
Microsoft
Azure AI Engineer
Microsoft
MCSE Data Management and Analytics
Microsoft
MCSA: SQL 2016 Database Development
Microsoft
Data Streaming (Kafka and Spark Streaming)
Udacity
Applied Machine Learning in Python
University of Michigan | via Coursera
Azure Data Scientist
Microsoft
Artificial Intelligence Bootcamp
Turing Society
Web Development Basics
Turing Society
Skills
Libraries/APIs
PySpark, Beautiful Soup, Azure Active Directory Graph API, Spark Streaming, Azure Blob Storage API, Azure Cognitive Services, React
Tools
Spark SQL, Azure Logic Apps, GitHub, Azure Machine Learning, Azure Search, Microsoft Power BI, Apache Maven
Languages
T-SQL (Transact-SQL), Stored Procedure, Python, Scala
Frameworks
Spark, Spark Structured Streaming, Apache Spark, Selenium, Django REST Framework
Paradigms
Database Design, Azure DevOps, Scrum, Continuous Integration (CI), Continuous Delivery (CD)
Platforms
Azure, Databricks, Azure Synapse, Azure SQL Data Warehouse, Docker, Amazon Web Services (AWS), Dedicated SQL Pool (formerly SQL DW)
Storage
Azure SQL Databases, SQL Server 2016, Azure SQL, SQL Stored Procedures, Azure Active Directory, PostgreSQL, Azure Cosmos DB, SQL Server Integration Services (SSIS)
Other
Azure Data Lake, Azure Data Factory (ADF), Indexing, Database Optimization, Delta Lake, Azure Databricks, Data Warehousing, Data Engineering, Streaming Data, Big Data Architecture, Azure Analysis Services, Coaching, CI/CD Pipelines, EventHub, Data Mesh, Business Cases, Machine Learning, Azure Cognitive Search, Data Science, Scrum Master, UX Research
How to Work with Toptal
Toptal matches you directly with global industry experts from our network in hours—not weeks or months.
Share your needs
Choose your talent
Start your risk-free talent trial
Top talent is in high demand.
Start hiring