Innocent Musanzikwa
Verified Expert in Engineering
Data Engineer and Developer
Calgary, AB, Canada
Toptal member since August 10, 2021
Inno is a seasoned data engineer and developer who's worked at IRI—a top retail data analytics company—in Africa and North America for the past decade and as a freelance consultant for the past couple of years. As a SQL and ETL developer, he has created quality data warehouses using industry-standard techniques like Kimball and DataVaults. As a data engineer, Inno has built highly robust and scalable data pipelines both on-premise and on the cloud using several latest cutting-edge technologies.
Portfolio
Experience
Availability
Preferred Environment
SQL, PySpark, Python, Hadoop, Azure Data Factory, Data Warehousing, Snowflake, Databricks, Amazon Web Services (AWS), Informatica
The most amazing...
...big data warehousing and data integration solution I've designed—using Python, SQL, ADF, Hadoop, Hive, and Spark—won an RFP in Canada out of six competitors.
Work Experience
Data Engineer
Darwill, Inc.
- Built Tableau dashboards and visualizations using AWS Redshift and Aurora databases.
- Created AWS Lambda functions running Python for custom ETL tasks and ad-hoc requests.
- Managed AWS Redshift and Aurora databases and designed data warehouses and data migrations.
- Redesigned the client's data warehouse using the AWS tech stack and improved their migration process by introducing federated queries and Lambda functions running Python pipelines, as well as overhauling their Tableau dashboards.
Data Engineer
SFL Scientific LLC
- Consulted on an existing SSIS poorly designed data integration project and helped identify bottlenecks and inefficiencies.
- Redesigned the existing data pipeline using SSIS to be efficient and scalable.
- Performed SQL tuning and SQL code review for process efficiencies.
BI and Data Warehouse Expert
Airiam Holdings, LLC
- Designed and developed data pipelines to integrate data from Quickbooks API, Sage Intacct API, and spreadsheets into Azure SQL.
- Designed and developed a data warehouse in Azure SQL.
- Designed and created business reports and KPI dashboards using Power BI.
- Developed complex SQL scripts to manage data transformations and speed up integration.
Data Analyst for Migration Project
JLL - JLLT Data
- Developed the data pipeline to integrate data from Salesforce to Microsoft SQL.
- Designed advanced SQL code, e.g., CTE, stored procedures, and functions to manage data transformations.
- Performed SQL tuning to improve ETL efficiencies and process scalability.
- Consulted on standard operating procedures and best case scenarios.
Director | Data Engineering
IRI
- Developed Azure Data Factory pipelines to integrate data from Apache Hive, HDFS, OAuth 2 APIs, and various flat-file types into Azure SQL.
- Managed a team of onshore and offshore big data developers, assigning tasks and tracking the progress on Jira.
- Oversaw data strategy and recommendations for new data sources and ongoing projects.
- Mentored big data engineers to help them develop their skills.
- Architected new data models and upgraded old data warehouses as per client request or technology change.
ETL Architect
IRI
- Developed SQL-based data warehouses on-premise and on the cloud.
- Integrated various data sources from flat files to cloud-based data sources like Snowflake, AWS and data lakes into Azure Data Warehouse, and Apache Hive on Hadoop.
- Created scalable data pipelines and improved efficiencies on the existing ones.
- Trained and upskilled new data developers and participated in code reviews.
- Maintained system documentation of all business data components and strategies.
SQL Lead Developer
IRI
- Developed SQL-based data warehouses and data marts.
- Wrote SQL queries to provide data for SSRS reports.
- Used SSIS, Talend, and DataStage for ETL processes depending on the client's requirements.
- Created custom business reports using SQL Server Reporting Services (SSRS).
- Managed junior developers and ran stand-up development meetings.
SQL/ETL Developer and Consultant
Mi9 Retail (formerly JustEnough Software Corporation)
- Managed SQL replication between mobile devices and SQL Server.
- Created SQL data warehouses using the Kimball methodology for reporting purposes.
- Designed and developed ETL packages using SQL Server Integration Services (SSIS).
- Designed and developed reports in SQL Server Reporting Services (SSRS).
- Performed database tuning and code reviews for any code being deployed to production.
Experience
Data Migration from Azure SQL to Snowflake
https://github.com/innowarue/ADFI replaced the authentic data sources with my Azure and Snowflake accounts to make the project publicly available without compromising confidentiality.
Data Integration from OAuth2 API
SQL Server Replication to Mobile Devices
In-place Data Integration for an Acquisition
Kafka Streaming and Data Integration
Education
Bachelor's Degree in Information Technology
University of South Africa - Pretoria, South Africa
Certifications
Databricks Certified Data Engineer Associate
Databricks
SnowPro Core
Snowflake
Certified Apache Spark and Hadoop Developer
Cloudera
Analyzing Big Data with Hive
LinkedIn Learning
Advanced NoSQL for Data Science
LinkedIn Learning
Skills
Libraries/APIs
PySpark, REST APIs, Spark Streaming
Tools
Microsoft Power BI, Tableau, BigQuery, Synapse, SSAS, Apache Airflow, Amazon Elastic MapReduce (EMR), Git, Google Sheets
Languages
SQL, Python, Bash Script, T-SQL (Transact-SQL), Snowflake, Stored Procedure, SQL DML, Scala, JavaScript, Bash
Frameworks
Hadoop, Spark, Windows PowerShell, ADF
Paradigms
ETL, Business Intelligence (BI), Dimensional Modeling, Database Development, Database Design
Platforms
Amazon Web Services (AWS), AWS Lambda, Azure SQL Data Warehouse, Dedicated SQL Pool (formerly SQL DW), Azure, Microsoft Power Automate, Azure Synapse, Oracle, Databricks, Apache Kafka, Salesforce, Zeppelin
Storage
Apache Hive, MySQL, SQL Server Integration Services (SSIS), SQL Server Reporting Services (SSRS), PSQL, Microsoft SQL Server, SQL Stored Procedures, PostgreSQL, Databases, Data Pipelines, Data Integration, Relational Databases, Database Architecture, RDBMS, Database Modeling, Dynamic SQL, NoSQL, SQL Server DBA, Database Replication, Azure SQL, MariaDB
Other
Azure Data Factory, Data Warehousing, Data Analysis, Data Engineering, Data, Data Architecture, Big Data Architecture, Data Migration, ELT, Data Warehouse Design, Data Transformation, Database Schema Design, ETL Tools, Scripting Languages, Data Analytics, Data Visualization, SSRS Reports, SQL Server 2015, Entity Relationships, Business Analytics, Performance Tuning, Informatica, Data Modeling, Cloud, APIs, Dashboard Design, Dashboards, Web Scraping, Data Build Tool (dbt), iPaaS, CI/CD Pipelines, DAX, Data Cleansing, Data Science, Azure Databricks
How to Work with Toptal
Toptal matches you directly with global industry experts from our network in hours—not weeks or months.
Share your needs
Choose your talent
Start your risk-free talent trial
Top talent is in high demand.
Start hiring