Mary Baculi, Developer in Calgary, AB, Canada
Mary is available for hire
Hire Mary

Mary Baculi

Verified Expert  in Engineering

Bio

Mary is a result-driven data architect that delivers data analytics to senior management. She is a detail-oriented and organized professional with 21 years of expertise in all phases of system development and project life cycles. Mary has collaborated with SMEs, project managers, business analysts, developers, quality assurance specialists, vendors, and cloud providers to deliver information solutions following Agile methodology.

Portfolio

Samuel Son & Co.
Design, Architecture, Azure, Data Governance...
Inter Pipeline
Azure PaaS, SAP S/4HANA Cloud, PySpark, Azure DevOps, OSIsoft PI...
EY
Azure PaaS, Identity & Access Management (IAM), Azure Synapse, Azure...

Experience

  • Azure Data Factory - 4 years
  • Azure Data Lake - 3 years
  • Azure PaaS - 3 years
  • Databricks - 3 years
  • Azure Synapse - 3 years
  • Security - 3 years

Availability

Part-time

Preferred Environment

DevOps, Collibra, Azure, Azure Synapse, Azure Data Lake, Azure Databricks, Azure Data Factory, Azure Event Hubs, Azure IoT Hub, Azure SQL Data Warehouse, Dedicated SQL Pool (formerly SQL DW)

The most amazing...

...thing I've architected is the implementation of modern data warehousing, cloud migration, and data analytics in several companies such as EY and Tech Mahindra.

Work Experience

Senior Data Architect

2021 - PRESENT
Samuel Son & Co.
  • Architected and designed the data protection and security requirements for data governance, including data sensitivity and classification (PII data), metadata ingestion, data catalog, and end data lineage from on-premises and cloud data sources.
  • Migrated all ERP systems such as Microsoft D365, JD Edwards, AS400, IFS, and TruckMate into the data lake. Transform and loaded the accounts receivable (AR) balances into the FIS Collection system.
  • Designed and architected the data warehouse for BI dashboards and reports.
  • Ingested data from IoT devices into the data lake and connected to Majik data warehouse and PostgresSQL.
Technologies: Design, Architecture, Azure, Data Governance, Enterprise Resource Planning (ERP), Business Intelligence (BI), ETL, Reports, BI Reporting, Reporting, Microsoft Power BI, Key Performance Indicators (KPIs), Databases, Microsoft Excel, Dashboard Development, Information Gathering, Data Transformation, Data Profiling, Data Cleansing, Data Management

Enterprise Data Architect

2021 - PRESENT
Inter Pipeline
  • Designed, architected, and implemented the enterprise data management system, which includes a data warehouse.
  • Planned, architected, and implemented the Dynamics 365 CRM geo migration project.
  • Architected and designed Data Protection and Security requirements for Azure Purview, which includes Data Sensitivity and Classification for PII data. Built the metadata and schema ingestion, catalog, classification, and lineage.
  • Conducted project assessment and high-level business requirements. Provided work breakdown structure (WBS) on different projects and set priorities. Estimated project timeline and resources that are needed in each project based on the scope of work.
  • Architected the data ingestion of the 12 data sources, i.e., SAP S/4 Hana, Tieto EC, Bourque Logistic, Endur Openlink, OSI Pi Historian, PowerApps Dataverse, Workday, Enablon, SABA, IBM Maximo, Primavera P6, Ecosys to DataLake using Azure Synapse.
  • Created and designed the data integration using Azure Service Bus and Logic Apps that provide event-driven data.
  • Designed enterprise data modeling using Azure IDW. Guided and supported the PowerBI team on data consumption best practices. Provided self-served analytics framework.
  • Data engineered the data ingestion from different data sources to Azure Data Lake and Azure Synapse dedicated pool (data warehouse).
  • Ensured security and access control on the Enterprise Data Management System (EDMS).
  • Worked with a Microsoft team on Geo Migration of CRM. Ingested CRM data (Dataverse) to Data Lake.
Technologies: Azure PaaS, SAP S/4HANA Cloud, PySpark, Azure DevOps, OSIsoft PI, Microsoft Power Apps, Dynamics CRM 365, Primavera, Workday, Enablon, TietoEC, IBM Maximo, SABA, Azure Synapse, Azure, Data Engineering, ETL, Azure Blobs, Data Pipelines, Data Lakes, ADF, Data Integration, Large Data Sets, PostgreSQL, Dedicated SQL Pool (formerly SQL DW), Azure SQL Data Warehouse, Data Warehousing, Reports, BI Reporting, Reporting, Microsoft Power BI, Key Performance Indicators (KPIs), Databases, Microsoft Excel, Dashboard Development, Information Gathering, Data Transformation, Data Profiling, Data Cleansing, Data Management

Data Architect

2021 - 2021
EY
  • Delivered the security architecture documentation for Collibra SaaS, JobServer, and Lineage.
  • Provided security best practices and security architecture documentation of the end-to-end Azure Data Analytics using Data Lake, Data Factory, Databricks, Synapse Analytics, Azure Analysis Services, SQL Server, Power BI, Key Vault, and Log Analytics.
  • Supported the EY cloud security engagement lead on different data analytics resources.
  • Provided guidance and support to the EY IAM specialist on RBAC of different Azure Data Analytics resources.
  • Reviewed and assessed the current security architecture of the data management cloud platform.
  • Delivered the onboarding checklists, a framework of the data sources ingestion, and best practice security and access control in Collibra.
  • Delivered best practice security and access control in Collibra (Data Governance).
Technologies: Azure PaaS, Identity & Access Management (IAM), Azure Synapse, Azure, Data Engineering, ETL, Azure Blobs, Data Pipelines, Data Lakes, Data Integration, Microsoft Excel, Information Gathering, Data Transformation, Data Profiling, Data Cleansing, Data Management

Data Architect

2019 - 2021
TransAlta
  • Designed, architected, and implemented the Collibra Data Governance and modern data architecture project with Data Lake and Delta Lake. Provided technical requirements for API integration with Azure.
  • Oversaw the creation of the workflow and data dictionary; planned and implemented corporate reporting for safety, operations, and finance. Designed and architected data warehousing.
  • Designed, architected, and implemented the IoT streaming for a wind farm solution and Sarnia Plant, as well as wind power prediction using machine learning.
  • Created multiple project data pipelines using Azure Data Factory and DataBricks. Developed data flow for structured, semi-structured, and unstructured datasets.
  • Replicated the SAP roles and authorization access control into the cloud for MDA.
  • Created DevOps pipeline and releases using Azure DevOps (CI/CD).
  • Designed, tested, and implemented MLOPs using DataBricks ML Flow, Azure ML, ACI, and Kubernetes.
  • Created wind power prediction model with Machine Learning. Developed ML batch and live scoring. Completed pitch curve data analysis for underperformance turbines. Developed MLOps for ML deployment to production and retraining the models.
  • Designed and implemented MDA Operating Model. Designed and implemented MDA Security Model. Architected and productionized the MDA using Lambda Architecture. Created logging and monitoring using Azure Log Analytics, and designed key management.
  • Conducted project assessments, converting business and functional requirements into a cost-effective and robust solution. Engaged business stakeholders, data owner, and SOX compliance team while assessing each project for data classification.
Technologies: Azure PaaS, PySpark, Azure DevOps, Databricks, Collibra, Azure Machine Learning, MLflow, Azure Synapse, Azure, Data Engineering, ETL, Azure Blobs, Data Pipelines, Data Lakes, ADF, Data Integration, Large Data Sets, Oracle, Azure SQL Data Warehouse, Dedicated SQL Pool (formerly SQL DW), Data Warehousing, Reports, BI Reporting, Reporting, Microsoft Power BI, Key Performance Indicators (KPIs), Databases, Microsoft Excel, Statistical Modeling, Dashboard Development, Information Gathering, Data Transformation, Data Profiling, Data Cleansing, Data Management

Data Architect

2020 - 2020
Tech Mahindra
  • Developed Azure Logic apps and designed data model and data mart.
  • Oversaw the offshore technical development, guided an offshore team on SQL scripts, and stored procedure best practices.
  • Guided the offshore team on Azure Analysis Services and PowerBI.
Technologies: Azure, SQL, Microsoft Power BI, Data Engineering, ETL, Azure Blobs, Data Pipelines, Data Integration, Key Performance Indicators (KPIs), Databases, Microsoft Excel, Information Gathering, Data Transformation, Data Profiling, Data Cleansing

Solution Designer

2018 - 2019
Suncor
  • Designed, architected, and oversaw the implementation of the digital hub cloud project.
  • Planned and implemented the improving management project execution performance (IMPEP) project. Oversaw the implementation of Suncor's digital hub in Microsoft Azure for six pilot projects.
  • Wrote PowerShell scripting for automated deployment. Architected MLOPs and integrated with DevOps.
  • Constructed and configured security, high availability, and recovery in the Azure cloud.
  • Designed and configured identity access management, role-based access control, and access control list.
  • Built the advanced analytics model management. Designed and configured the workspace and user table security in DataBricks. Configured and tested other machine learning resources, i.e., Azure ML Studio, ML Services, and Cognitive Services.
  • Created, configured, and tested data integration and live streaming from OSI PI Historian to Azure IoT.
  • Designed, configured, and tested the batch data ingestion and processing of SAP BW, SAP ECC, CXL, flat files, and image data from on-premise data sources to Azure Data Lake Gen2.
  • Created and configured enterprise data warehouse and ensured compliance with an enterprise reference model (Power Designer). Designed the data architecture and data modeling.
  • Designed and configured the data catalog and Azure Log Analytics.
Technologies: Azure PaaS, Azure, Data Engineering, ETL, Azure Blobs, Data Pipelines, Data Lakes, ADF, Data Integration, Large Data Sets, Azure SQL Data Warehouse, Dedicated SQL Pool (formerly SQL DW), Data Warehousing, Reports, BI Reporting, Reporting, Databases, Microsoft Excel, Statistical Modeling, Dashboard Development, Information Gathering, Data Transformation, Data Profiling, Data Cleansing, Tableau, Data Management

Business Systems Analyst

2017 - 2018
Shaw
  • Implemented various systems for Shaw’s operational support system (OSS) transformation projects. Participated in scope definition of the project management plan (charter).
  • Conducted stakeholder analysis. Elicited, documented, and maintained requirements package—business requirement document. Verified and validated requirements with stakeholders. Organized and facilitated workshops.
  • Documented as-is and to-be state, gap analysis use cases, and user stories.
  • Liaisoned between technical and non-technical resources. Facilitated and led technical requirements sessions.
  • Ensured requirements traceability from architectural design, detailed design, and testing.
  • Documented the acceptance criteria and business processes.
  • Participated in the request for information (RFI), request for proposal (RFP), vendor demo, and vendor selection processes.
  • Collaborated with the project teams to ensure that delivered projects met business needs. Worked with senior management in defining KPIs and reporting needs.
  • Conducted value stream mapping (VSM) and helped the delivery managers prepare return on investment (ROI). Prepared data dictionary, data lineage, and data mapping.
Technologies: ServiceNow, REST APIs, HDFS, Amazon S3 (AWS S3), ETL, Amazon Web Services (AWS), Reporting, Key Performance Indicators (KPIs), Databases, Microsoft Excel, Dashboard Development, Information Gathering, Data Transformation, Data Profiling, Data Cleansing

Business Intelligence Analyst

2016 - 2017
University of Calgary
  • Designed and implemented a data mart for the development office at the University of Calgary.
  • Reviewed, analyzed, and evaluated the business processes and associated IT application requirements.
  • Identified and verified data and business rules towards fulfilling the requirements of current and future ad hoc and pre-determined BI reporting requests.
  • Communicated and collaborated with the development office, BI team, and end users to analyze and transform needs and goals, documented functional requirements, and delivered the appropriate artifacts as needed.
  • Prepared business requirements documents (BRD), report designs (RDs), test plans, requirement traceability matrix, user training materials, and other related documents.
  • Analyzed business workflow and system needs for conversions and migrations. Assisted in data mapping.
  • Designed the report mockups and built dashboards and reports using Tableau.
  • Developed and executed test strategies, plans, and scenarios and tracked the resolution of identified defects.
  • Participated in all stages of project development, including design, programming, testing, and implementation to ensure the released product met the intended functional and operational requirements.
  • Contributed to the project's status update meetings and documents on status and issues.
Technologies: Business Intelligence (BI), ETL, Data Warehousing, Reports, BI Reporting, Reporting, Microsoft Power BI, Key Performance Indicators (KPIs), Databases, Microsoft Excel, Dashboard Development, Information Gathering, Data Transformation, Data Profiling, Data Cleansing, Tableau

Business Intelligence Analyst

2014 - 2016
Sanjel Energy Services
  • Worked closely with the VPs in implementing business intelligence strategy and roadmap. Performed technical analysis and data extracts of Microsoft AX 2012 financials for BI. Designed and implemented data warehouse using the Tabular model.
  • Developed business case, gathered and documented requirements, created use and test cases, executed test cases, and assisted the SMEs in doing UAT. Analyzed the eService, Inthinc, Intelex, and Microsoft AX revenue data.
  • Played a key role in one of the projects regarding fuel efficiency that showed 26% improvements on the low idle percentage of total trip time through vehicle low and high idle analysis, identifying the top and bottom 50 drivers.
  • Contributed to the fracturing operational efficiency project by analyzing preparation time, operation time, and post job time with the event tracker and job performance data in eService.
  • Analyzed market data using advanced BI, time intelligence functions, statistical methodologies, and global market intelligence tools.
Technologies: Business Intelligence (BI), Microsoft Power BI, ETL, Data Warehousing, Reports, BI Reporting, Reporting, Key Performance Indicators (KPIs), Databases, Microsoft Excel, Dashboard Development, Information Gathering, Data Transformation, Data Profiling, Data Cleansing, Tableau

Intermediate Business Analyst

2012 - 2014
Savanna Energy Services Corp.
  • Gathered business requirements and assessed the suitability of an existing Savanna software system. Analyzed business needs and translated them into an application and operational requirements. Conducted fit-gap analysis.
  • Facilitated design workshops (as-is and to-be process, report design). Created and maintained the requirements traceability matrix. Wrote business requirement documentation.
  • Participated in the review of the two requisitioning applications that addressed the requirements of purchasing and approvals. Created the RFP and evaluated the vendor’s reply to the RFP.
  • Prepared the solution proposal document, business process design document (BPDD), functional specification, approval, and role-based security matrix.
  • Maintained the decision registry and secured signoff for every business decision or rule made by the team and business (SME).
  • Prepared the use cases and QA test plan. Assisted the users in developing the unit acceptance test scripts.
  • Collaborated with the business system analyst and third-party vendor on the customizations, interfaces, and reports development. Prepared the training plan and training document.
Technologies: Business Analysis, ETL, Reports, BI Reporting, Reporting, Key Performance Indicators (KPIs), Databases, Microsoft Excel, Dashboard Development, Information Gathering, Data Transformation, Data Profiling, Data Cleansing

Business Intelligence Data Analyst

2010 - 2012
Shaw
  • Performed hybrid role (BA and DA) and oversaw delivering complex analytical reports to Shaw’s Group VPs and senior executives, including sales acquisition, disconnect analysis, churn analysis, migration analysis, and promotional analysis.
  • Acted as the point-of-contact accountable for delivering intelligence to satisfy monthly information requests for supporting customer acquisition, customer retention, product optimization, strategic planning, and operational efficiency.
  • Chaired team meetings, prioritized requests, and addressed other concerns of the stakeholders.
  • Gathered requirements for new requests and analyzed the impact on the existing reports.
  • Prepared the business intelligence requirement document (BIRD) and designed the report mockups.
  • Created prototype, developed analytical and operational reports, and prepared test plan.
Technologies: SQL, ETL, Reports, BI Reporting, Reporting, Microsoft Power BI, Key Performance Indicators (KPIs), Databases, Microsoft Excel, Dashboard Development, Information Gathering, Data Transformation, Data Profiling, Data Cleansing

Enterprise Data Management System (EDMS)

My responsibilities included the following activities:
• Project assessment
• Work breakdown structure (WBS) on different projects
• Estimating project timeline and resources
• Architecting and designing the data ingestion of the 12 data sources, including SAP S/4HANA, Tieto EC, Bourque Logistics, Endur Openlink, OSI PI Historian, Power Apps Dataverse, Workday, Enablon, SABA, IBM Maximo, Primavera P6, and EcoSys to Data Lake using Azure Synapse
• Architecting and designing the data integration using Azure Service Bus and Logic Apps that provide the event-driven data
• Architecting and designing the data protection and security requirements for Azure Purview
• Enterprise data modeling using Azure IDW
• Engineering the data ingestion from different data sources to Azure Data Lake and Azure Synapse Dedicated Pool (data warehouse)
• Security and access control on the enterprise data management system (EDMS)
• Self-served analytics framework
• Guiding and supporting the Power BI team on data consumption best practices
• Building the metadata ingestion, catalog, and lineage and integrating Azure Purview to different Azure resources used in EDMS
• Chair of the Data Governance Council
• Holding training to the junior data engineers

Modern Data Architecture (MDA) | Azure Cloud

My tasks included the following:
• Architecting and productionalizing the MDA using lambda architecture, which included designing and spinning up the Data Lake and Delta Lake, designing and implementing data integration and analytics, monitoring using Azure Log Analytics, designing key management with Azure Key Vault, creating and configuring VM, Vnet, and NIC, and designing and implementing MDA operating and security model
• Developing data lake governance and data engineering governance
• Creating DevOps pipeline and releases using Azure DevOps (CI/CD)
• Working collaboratively with Microsoft solution architect, data engineers, data scientist, AltaML data scientist, DataBricks solution architect, and third-party vendors
• Project assessments and converting business and functional requirements into a cost-effective and robust solution
• Guiding the junior data engineers in data preparation and transformation
• Documenting solution architecture and design (SAD), which includes conceptual, logical, and physical design

IoT Streaming for Wind Farm Solution and Sarnia Plant

I architected batch streaming (IoT), implemented the design on pulling data from AspenTech IP2, and loaded 21 wind farms, BC Hydro, and Sarnia plants. In addition to this, I architected the live streaming (IoT), implemented the design, and completed the proof-of-concept (POC) of five wind farms and device event logs. I played a key role in engaging with the business stakeholders, data owners, and SOX compliance team while assessing each project for data classification and sensitivity, NERC security, and personal identifiable information (PII data).

Wind Power Prediction (Machine Learning) and MLOPs

My responsibilities included:
• Architecting MLOPs and integrating to DevOps
• Creating wind power prediction model using machine learning
• Developing ML batch and live-scoring
• Creating pitch curve data analysis for underperforming turbines
• Developing MLOps for ML deployment to production and retraining the models

Corporate Reporting for Safety, Operations, and Finance

I developed a data flow for structured, semi-structured, and unstructured datasets by pulling data from SAP BW, SAP HANA, SharePoint with Logic Apps, Oracle databases, SQL on-premises, Azure SQL DB with Azure Data Factory, and from different websites (secured and public).

Digital Hub (Cloud) Project

As an architecture owner following the Agile methodology, I worked closely with the product owner, scrum master, and Microsoft team, including solution architect, data and AI consultants, and project manager. I wrote the PowerShell scripting, oversaw the implementation of Suncor’s Digital Hub in Microsoft Azure for six pilot projects, and designed and configured the following items:
• security, high availability, and recovery in Azure Cloud,
• identity access management, role-based access control, and access control list,
• key management using Azure Key Vault,
• workspace and user table security in Databricks,
• advanced analytics model management,
• data integration and live streaming from OSI PI Historian to Azure IoT Hub,
• batch data ingestion and processing of SAP BW, SAP ECC, flat files/images data from on-premises data sources to Azure Data Lake Gen2
• enterprise data warehouse and compliance with an enterprise reference model,
• data catalog, architecture, and data modeling, and
• Azure Log Analytics.

Data Governance and Azure Security Assessment

My responsibility was to deliver security architecture documentation for Collibra SaaS, JobServer, and Lineage, onboarding checklist and framework of the data sources ingestion in Collibra, and best practice security and access control. I provided security best practices, guidance and support, and review and assessment of the current security architecture of the data management cloud platform.

Designing Collibra Data Governance

I worked closely with the enterprise architect to develop standard processes and procedures, define different personas responsibilities, configure JobServer, and integrate SQL servers with Collibra using the native connector. I architected and designed the custom integration of Azure services to Collibra using REST API, provided technical requirements for API integration with Azure, and oversaw the creation of workflow and data and business dictionary.

Improving Management Project Execution Performance (IMPEP) Project

My responsibilities included the following activities:
• Designing the project control conversion program in SAP ABAP by loading, translating, and pushing vendor’s data from spreadsheets to SAP HANA
• Designing data marts using SAP HANA model that integrates SAP Project Systems, SAP FICO, Primavera, and the conversion program data
• Creating the segregation of duties and roles
• Ensuring the proper security in Power BI, SAP HANA, and SAP ABAP
coordinating with the SAP Security specialist and governance risk compliance (GRC) team to provide appropriate security for the SAP ABAP conversion program and SAP HANA models
• Technical analysis, documenting the current and future state design, and preparing the solution design, security design, and technical design documentation

PowerApps (Dataverse) Geo Migration

I worked with the Microsoft team on geo migration of Dynamic CRM, changed the Canvass apps data source from SQL DB to Data Lake, ingested CRM data (Dataverse) to Data Lake, and configured Azure Synapse link.
2010 - 2010

Certification in Business Analysis

University of Calgary - Calgary, Alberta Canada

1996 - 2000

Bachelor of Science Degree in Computer Science

San Sebastian College-Recoletos - Manila, Philippines

DECEMBER 2020 - PRESENT

Scalable Machine Learning with Apache Spark

Databricks

JULY 2020 - PRESENT

Fundamentals of Delta Lake

Databricks

JULY 2020 - PRESENT

Introduction to Big Data

Databricks

JANUARY 2020 - PRESENT

Azure Solution Architect (AZ-300 and AZ-301)

Simplilearn

JANUARY 2017 - PRESENT

Big Data Modeling and Management Systems Certificate

University of California | via Coursera

DECEMBER 2016 - PRESENT

Introduction to Big Data Certificate

University of California | via Coursera

APRIL 2010 - PRESENT

Business Analyst Certificate

University of Calgary, Calgary, Canada

Libraries/APIs

PySpark, REST APIs

Tools

Azure IoT Hub, Microsoft Power BI, Microsoft Excel, Collibra, Tableau, Azure ML Studio, Microsoft Power Apps, Azure Machine Learning, Primavera, IBM Maximo

Languages

SQL, Python

Frameworks

ADF

Paradigms

App Development, Business Intelligence (BI), ETL, DevOps, Azure DevOps, Agile

Platforms

Azure Synapse, Azure Event Hubs, Databricks, Azure PaaS, Azure, Azure SQL Data Warehouse, Dedicated SQL Pool (formerly SQL DW), Amazon Web Services (AWS), Oracle

Storage

Data Lake Design, Azure Blobs, Data Pipelines, Data Lakes, Data Integration, Databases, HDFS, Amazon S3 (AWS S3), SAP S/4HANA Cloud, PostgreSQL

Other

Azure Data Lake, Azure Databricks, Azure Data Factory, Function App, Business Systems Analysis, Business Process Analysis, Software Development, Delta Lake, Azure Data Lake Analytics, Identity & Access Management (IAM), Data Architecture, Data Governance, Business Analysis, Data Engineering, Large Data Sets, Data Warehousing, Reports, BI Reporting, Reporting, Key Performance Indicators (KPIs), Dashboard Development, Information Gathering, Data Transformation, Data Profiling, Data Cleansing, Data Management, Security, Big Data Architecture, Big Data, Statistical Modeling, Modeling, Enterprise, Machine Learning, MLflow, Dynamics CRM 365, ServiceNow, OSIsoft PI, Workday, Enablon, TietoEC, SABA, Design, Architecture, Enterprise Resource Planning (ERP)

Collaboration That Works

How to Work with Toptal

Toptal matches you directly with global industry experts from our network in hours—not weeks or months.

1

Share your needs

Discuss your requirements and refine your scope in a call with a Toptal domain expert.
2

Choose your talent

Get a short list of expertly matched talent within 24 hours to review, interview, and choose from.
3

Start your risk-free talent trial

Work with your chosen talent on a trial basis for up to two weeks. Pay only if you decide to hire them.

Top talent is in high demand.

Start hiring