Verified Expert in Engineering
Data Engineer and Developer
Awais is a data engineer and advanced data analyst with 14 years of experience in consultancy and in-house development. He's been working on data engineering and advanced analytics projects. As a data engineer, Awais has architected and developed big data clouds and data warehouses that handle billions of rows. He has led a data analyst team of up to eight developers and worked extensively with consumer packaged goods (CPG), construction, semiconductor, and medical device industries.
Microsoft Power BI, Azure Data Factory, Azure Cosmos DB, Azure Synapse, AutoML, Spark, Azure, Snowflake
The most amazing...
...projects I've worked on were featured as case studies on the Microsoft website.
Principal Data Architect
- Designed and built a data platform capable of holding 240 billion rows in a data warehouse (DWH) and doing real-time or batch analysis of those rows. Implemented data-driven culture for more than 15 clients using Power BI and the Azure Data Platform.
- Managed client meetings and requirement gathering phases of projects as well as business analysis, business process re-engineering, and information management practices and protocols.
- Conducted a number of webinars on big data analytics. Published four branded case studies with Microsoft. Collaborated with Microsoft's sales team.
- Oversaw a business intelligence team that worked with various clients, created custom product lifecycle management (PLM) solutions for them, and designed and implemented native SQL Server 2016 DBA applications.
- Created large-scale DWH solutions, including SSIS packages, stored procedures, SQL Server Agent jobs, and native DBA jobs, to administer and maintain DWH in production. Set up SSAS cubes that aggregated millions of rows for the SSRS analysis.
- Worked with Fortune 100 companies around the world and conducted a number of workshops on SQL Server services.
California Electricity Meter Data: Data Engineering and Analytics
Depletion and Shipment Forecasting
• Data collection from SAP and VIP.
• Feature engineering.
• Forecasting model identification using AutoML.
• Analytics on Power BI.
Drinking Event Identification from PPM Sensors Data Using AI
• Gathered data in both controlled and uncontrolled contexts.
• Information obtained from a breathalyzer and wearable devices.
• It detects drinking episodes within a thirty-minute span.
Data Platform for Wearable Devices
For the near-real-time analytics, we employ analytical stores on collections and then consume these data through an Azure Synapse link, which subsequently displays information in Power BI. On the other hand, for batch analytics, we put in place archiving mechanisms that migrate data from Cosmos to hot or cold storage and finally to Azure Synapse data warehouse for reporting. Finally, all of the Power BI analytics were embedded in an ISV application built using Node.js.
• A budget to actual variance analysis.
• Commission calculation analytics.
• Accounts receivable aging tabulated via an aged receivables report.
• An aging inventory report.
• An order summary report.
• Portfolio management.
• Warehouse management analytics.
Email Statistics and SLA Reports for Customer Support – Dynamics 365
Call Rotation and Campaign Analytics
• Capture deadlock history, long-running connections, slow running queries, and space used by databases.
• Back up and index management jobs.
• Monitor and kill blocking sessions.
• Utilize the Notification Framework.
• Conduct the SQL Server Integration Services (SSIS) log analysis.
Data Warehouse for a Bank
Analytics for Construction
CPG – Market Void Identification Using Depletion and Market Channel Data
The project begins by collecting and preprocessing the data from VIP, Nielsen, and retail chain sources using Azure Data Factory. It will then integrate the datasets, aligning them geographically for state-level analysis. The market potential will be estimated using Nielsen data, which provides insights into consumer behavior and purchasing patterns.
Through a comprehensive comparison of depletion and market potential, the Power BI report highlights areas with significant gaps between supply and demand, indicating potential market voids. These findings help businesses identify expansion opportunities, refine marketing strategies, and optimize product distribution to better cater to consumer needs in specific regions.
CPG – Promotion Effectiveness Analytics
Power BI creates interactive dashboards for real-time insights, aiding data-driven decisions. Azure Synapse handles big data analytics and warehousing, efficiently processing large-scale retail data for in-depth analysis. Its seamless integration with Power BI enhances data flow.
Azure Data Factory orchestrates data pipelines, automating data movement and transformation for faster, more efficient analysis. The data-driven approach provides valuable insights into consumer preferences and behaviors during different seasons.
Optimized pricing strategies maximize sales and revenue. The project examines SKU interactions within the same brand, refining cross-selling and brand loyalty tactics.
The goal is to offer actionable recommendations to stakeholders, enabling informed decisions on promotions and yielding the best results for various seasons and product categories. Utilizing Power BI, Azure Synapse, and Azure Data Factory enhances marketing strategies and competitiveness in the market.
CPG – Stock-out Alerts
The project predicts potential stock-outs in advance by integrating VIP, Salient, and retail data. This proactive approach helps retailers and suppliers prevent inventory shortages, optimize supply chain management, and maintain customer satisfaction.
CPG – Supplier Onboarding Process
Power Automate flows are critical in automating and orchestrating the onboarding process. They guide suppliers through the necessary steps, ensuring they provide all the essential documents and pricing information for their products. This automated approach saves time and reduces manual errors, expediting supplier integration.
The Power BI dashboard acts as a central monitoring tool for the onboarding process. It provides real-time updates on the status of each supplier and SKU integration. Distribution companies can track the progress of multiple suppliers simultaneously, ensuring the timely completion of onboarding tasks.
CPG – Supplier Pricing Tool
CPG – On Shelf Expiry
By issuing timely alerts, the project enables stakeholders to take proactive measures to address the identified high-risk items. Distributors and store managers can make informed decisions, such as implementing targeted promotions, adjusting pricing, or relocating items to increase visibility and sales velocity.
Supplier Pricing Tool
1. User-Friendly Interface.
2. Integrated Power Automate Flows: Utilizing Power Automate, our solution automates the entire approval workflow. As Suppliers set product pricing, the system efficiently manages the approval and rejection processes, enhancing speed and accuracy.
3. Bulk Update/Edit/Approve/Reject.
Bulk Inserts: Simplify adding new pricing entries with the bulk insert functionality. This feature facilitates the efficient onboarding of new products and pricing information in large quantities.
Approval Dashboard: A dedicated dashboard provides a consolidated view of all pricing approvals, making it easy for authorized personnel to monitor and manage the entire approval pipeline.
Java, SQL, C#.NET, Snowflake, Python, T-SQL (Transact-SQL), Java 9
ODBC, REST APIs, PySpark
Microsoft Power BI, Microsoft Excel, Excel 2016, Microsoft Access, Microsoft Power Apps, Looker, AutoML
ETL, Business Intelligence (BI), Application Architecture, Dimensional Modeling, Database Design, Data Science
Azure SQL Data Warehouse, Azure, Dedicated SQL Pool (formerly SQL DW), Azure Synapse, Microsoft Power Automate, Microsoft
SQL Server Analysis Services (SSAS), SQL Server Integration Services (SSIS), SQL Server Reporting Services (SSRS), Microsoft SQL Server, Data Lakes, Data Pipelines, Databases, Database Administration (DBA), Database Architecture, PostgreSQL, Database Structure, Database Transactions, MySQL, Azure Cosmos DB, Oracle PL/SQL, Azure SQL Databases, SQL Server 2014, SQL Server 2008, SQL Server DBA
Azure Data Factory, Data Warehousing, Azure Analysis Services, Data Engineering, Big Data Architecture, Big Data, Data Warehouse Design, Cloud Architecture, Data Analysis, Data Flows, Dataverse, Data Visualization, Dashboards, Data Analytics, Team Leadership, Architecture, Data-level Security, Training, Leadership, DAX, Data Modeling, Database Optimization, Data Architecture, Data Management, PL/SQL Tuning, Reports, Excel 365, Consumer Packaged Goods (CPG), Data Extraction, CSV, CSV Export, Reporting, Algorithms, Data Cleaning, Data Quality, Data Matching, Healthcare Services, APIs, Machine Learning, Azure Data Lake Analytics, Google Data Studio, Unix Shell Scripting, API Design, Enterprise Resource Planning (ERP), Scripting, Financials, Azure Virtual Machines, Azure Data Lake, Dynamics CRM 365, NetSuite, Product Lifecycle Management (PLM)
Bachelor's Degree in Computer Science
Punjab University College of Information Technology - Lahore, Pakistan
Designing a Business Intelligence Infrastructure Using Microsoft SQL Server (MCITP)
Microsoft SQL Server 2008 – Business Intelligence Development and Maintenance (MCTS)
How to Work with Toptal
Toptal matches you directly with global industry experts from our network in hours—not weeks or months.
Share your needs
Choose your talent
Start your risk-free talent trial
Top talent is in high demand.Start hiring