Sayed Farhan Amjad
Verified Expert in Engineering
Data Engineer and Developer
Sialkot, Punjab, Pakistan
Toptal member since November 17, 2022
Sayed is a senior data engineer with seven years of experience developing extract, transform, and load (ETL) pipelines and managing data warehouses, business intelligence (BI) reports, and interactive dashboards. Specializing in SQL, Python programming, and data visualization, Sayed has a wide range of interests, including machine learning (ML), automation, and the Internet of Things (IoT), and he is eager to take on new challenges.
Portfolio
Experience
- Python 3 - 7 years
- SQL - 7 years
- Microsoft Power BI - 5 years
- Relational Databases - 5 years
- MySQL - 3 years
- Azure Databricks - 3 years
- Azure Functions - 3 years
- Machine Learning - 3 years
Availability
Preferred Environment
Python 3, MySQL, MacOS, PyCharm, MySQL Workbench, Slack, Microsoft Power BI, Tableau
The most amazing...
...project I've delivered was automating data and feature extraction and a multi-class classification pipeline for ranking publicly traded Japanese companies.
Work Experience
Power BI Developer
Tropicana Brands - Main
- Loaded large datasets into PowerBI, optimizing and aggregating historical data and enabling incremental load for efficient dashboard refreshes.
- Integrated PowerBI with Azure DevOps to enable source control for PowerBI dashboard development using Power BI Desktop projects file format .pbix.
- Implemented data modeling in Azure Databricks and connected PowerBI directly to the gold layer of medallion architecture with optimized data that can fulfill data visualization needs.
Senior Software Engineer
Techlogix
- Developed and maintained data warehouse and ETL pipelines for multiple large-scale businesses. Managed multitenant databases and Microsoft Power Bi dashboards with row-level security (RLS) and embedded them into websites and mobile applications.
- Delivered data analytics and machine learning projects, including balance prediction and bank statement processing using computer vision techniques.
- Trained junior developers and helped resolve everyday technical issues faced by the data team.
- Worked for a sales and distribution company where we received near-real-time data from hundreds of trucks and delivery vans. We designed a Power BI dashboard to monitor the business flow and identify real-time choke points.
- Analyzed historical data for a streaming giant, STARZ, to understand user segmentation and deeper analysis of churned customers. I presented the results in an interactive visual form using Power BI with options to dissect results using handy filters.
Software Engineer
Techlogix
- Created ETL pipelines for a campus management system and sales and distribution system.
- Developed a data extraction module for seasonal data extraction automation for a campus management system to extract the data related to admission cycles with options to select filters before extracting the data.
- Worked on a sales and distribution network solution with sophisticated route planning and optimization capabilities and a near real-time dashboard to track anomalies in the distribution of goods.
- Participated in a churn prediction project for the STARZ PLAY streaming service by analyzing customer viewing habits, payment methods, and content consumption patterns.
- Designed and optimized a Power BI dashboard for a hospital management system company (vicenna.com) with a multi-tenant reporting database, row-level security, optimized screens for portable devices, and the ability to drill down to lower data dimensions.
- Implemented hierarchical data access inside the Power BI dashboards based on the position of the user in their organization.
Intern
Emerging Technology Lab
- Participated in a research project to detect behavioral anomalies in employees caused by stress and mental health issues using internal communication in a corporate enterprise.
- Worked as a research intern with PhD scholars on an NLP project aimed at the early detection of problems with employees' mental and physical health, focusing on implementing preventative measures to maintain a work and life balance.
- Performed literature review, data cleaning and preparations, and documentation of results.
Experience
Data Warehouse for Hospital Management System
http://www.vicenna.comReporting Database and Dashboards for CMS Project
https://almusnet.com/campus-on-cloud/I created admission data extraction modules that let the system clients request an extract of data based on a number of parameters such as admission cycle, application status, fee payment status, and more. The module was based on SQL and C# and deployed in queue-triggered Azure Function, and the status of extraction requests was maintained in a database table.
I worked on the design of the reporting database and the development of ETL pipelines in SSIS that were deployed in Azure VMs with email notification system for critical alerts.
I also worked on developing Power BI dashboards for different modules of the products, such as code, admissions, academics, and financials. We designed and embedded RLS-based multi-tenant interactive dashboards to add value for our clients.
Businesses Classification Based on Financial Transparency
To achieve end-to-end automation of the whole process, I first downloaded bundled reports as a ZIP file and extracted financial files with relevant data. Then, I ran feature extraction and financial information extraction for HTML files, processed and stored the information, and finally trained and tested an ensemble model to classify companies into five classes.
Previously, the client did all these steps manually, spending days processing and extracting the information. After the project was complete, the whole process took less than 10 minutes to run end-to-end and produce classification results.
Churn Prediction for Video Streaming Platform
Education
Master's Degree in Data Science/Computer Science
Aalto University - Espoo, Finland
Bachelor's Degree in Computer Engineering
National University of Sciences and Technology - Islamabad, Pakistan
Certifications
Introduction to Cybersecurity Tools & Cyber Attacks
IBM
Advanced Database Queries
New York University
Starting up
MinnaLearn
Neural Networks and Deep Learning
DeepLearning.AI
R Programming
Johns Hopkins University
Microsoft Certified: Azure Data Scientist Associate
Microsoft
Machine Learning for Data Science and Analytics
Columbia University
Skills
Libraries/APIs
Pandas, PyTorch
Tools
Microsoft Power BI, Power Query, Rundeck, PyCharm, MySQL Workbench, Slack, MATLAB, Tableau, Jira, Microsoft Report Builder
Languages
SQL, Python 3, Stored Procedure, C#, Python
Storage
Relational Databases, MySQL, Microsoft SQL Server, DB, Data Pipelines, Databases, RDBMS, SQL Server Management Studio (SSMS), SQL Stored Procedures, SQL Server DBA, Database Architecture, PostgreSQL, SQL Server Integration Services (SSIS), Azure Blobs
Paradigms
ETL, Business Intelligence (BI), Database Development, Automation, Azure DevOps
Platforms
Azure Functions, Azure, Pentaho, MacOS, Amazon Web Services (AWS), Databricks, SharePoint
Industry Expertise
Cybersecurity
Other
BI Reports, Microsoft Data Transformation Services (now SSIS), Machine Learning, Azure Databricks, Task Scheduling, Data Engineering, Data Architecture, DAX, APIs, Dashboards, Software Engineering, Artificial Intelligence (AI), Deep Learning, Entrepreneurship, Natural Language Processing (NLP), Computer Vision, Machine Learning Automation, Cloud, R Programming, Data Analysis, Metabase, Tableau Server, Data Warehouse Design, Amazon Redshift, Virtual Machines, Data Visualization, Generative Pre-trained Transformers (GPT), Azure Data Studio
How to Work with Toptal
Toptal matches you directly with global industry experts from our network in hours—not weeks or months.
Share your needs
Choose your talent
Start your risk-free talent trial
Top talent is in high demand.
Start hiring