Tarviha Fatima, Developer in Lahore, Punjab, Pakistan
Tarviha is available for hire
Hire Tarviha

Tarviha Fatima

Verified Expert  in Engineering

Data Engineer and Developer

Location
Lahore, Punjab, Pakistan
Toptal Member Since
November 11, 2022

Tarviha is a seasoned data engineer with 3+ years of experience designing and developing robust, automated, and highly performant end-to-end data pipelines. She has a solid technical background, specializing in data extraction, cleaning, standardization, modeling, transformation, visualization, and analysis. Tarviha enjoys collaborative efforts and loves to be recognized as the go-to person by fellow team members.

Portfolio

Freelance
Microsoft Power BI, Data Analysis, Data Visualization, Dashboard Design...
Afiniti
Data Engineering, ETL, SQL, Microsoft Power BI, MySQL, Greenplum, Talend ETL...
Afiniti
ETL, ETL Tools, Talend ETL, Python 3, SQL, MySQL, Microsoft SQL Server...

Experience

Availability

Part-time

Preferred Environment

Python 3, SQL, Microsoft Power BI, Amazon Web Services (AWS), Apache Spark, Greenplum, Microsoft Excel, Tableau, Google BigQuery, Talend

The most amazing...

...project I've automated and standardized is the bounty disbursement process for a finance team, an improvement that made me the top employee of the year.

Work Experience

Business Intelligence Developer (Freelance)

2022 - 2022
Freelance
  • Developed interactive visualizations, reports and dashboards using Power BI to derive actionable insights for business decisions.
  • Developed complex calculated measures in Power BI using Data Analysis Expression language (DAX).
  • Participated in requirements gathering and data mapping sessions to underline business needs and assisted in existing client dashboards and data sources.
Technologies: Microsoft Power BI, Data Analysis, Data Visualization, Dashboard Design, Dashboard Development, Dashboards, Business Intelligence (BI), Databases, Data Processing, CSV, SQL Views, Data Architecture, Datasets, Analytics, Query Composition, DAX, Data

Data Engineer ||

2021 - 2022
Afiniti
  • Developed and maintained an automated revenue calculation design and deployed the architecture on 38 clients. I used MySQL, PostgreSQL, MS SQL, Greenplum, Python, and Talend.
  • Designed and developed client performance and statistics dashboards for global visibility to the leadership using Microsoft Power BI.
  • Created a Python-based utility that migrates the data from MySQL to Greenplum (PostgreSQL) in an optimized way using PFX Reader and gpfdist services.
  • Contributed actively to the code migration from MySQL to PostgreSQL, converting approximately 5,000 lines of code from one system to another.
  • Automated and designed the process of employees' bounty distribution to assist the finance team in payroll tasks using MySQL, PostgreSQL, MS SQL, Python, Talend, and Microsoft Excel.
Technologies: Data Engineering, ETL, SQL, Microsoft Power BI, MySQL, Greenplum, Talend ETL, Microsoft SQL Server, Python 3, Data Analysis, Business Intelligence (BI), Data Warehousing, Data Modeling, Microsoft Excel, Apache Airflow, Talend, PostgreSQL, Command Prompt (CMD), Data Cleaning, Dimensional Modeling, Named-entity Recognition (NER), Data Visualization, Dashboard Design, Dashboard Development, Dashboards, ETL Tools, Databases, Data Transformation, Data Processing, Data Migration, Database Optimization, CSV, Python, Stored Procedure, SQL Stored Procedures, SQL Views, User-defined Functions (UDF), SQL Triggers, Database Triggers, Data Architecture, Database Migration, Data Pipelines, Datasets, Data Warehouse Design, Query Composition, Query Optimization, Performance Tuning, DAX, Pipelines, NumPy, Pandas, Data, GitHub, Reporting, APIs

Data Engineer |

2020 - 2021
Afiniti
  • Built a Python-based custom utility for module-based data transfers from multiple client servers to a global server using SMTP and FTP mediums.
  • Optimized the model deployment process on the production environment, replacing the time-consuming stored procedures and Talend jobs with efficient MySQL views, triggers, and events.
  • Maintained and developed an optimized data pipeline for three North American clients using Talend, MySQL, PostgreSQL, and Python. This optimization resulted in saving approximately one hour in the process.
Technologies: ETL, ETL Tools, Talend ETL, Python 3, SQL, MySQL, Microsoft SQL Server, PostgreSQL, Databases, Data Transformation, Data Cleaning, Data Processing, Data Modeling, Data Migration, Database Optimization, CSV, JSON, Python, Stored Procedure, SQL Stored Procedures, SQL Views, User-defined Functions (UDF), SQL Triggers, Database Triggers, Data Architecture, Data Pipelines, Datasets, Query Composition, Query Optimization, Pipelines, Data, GitHub, Reporting

Data Engineer

2019 - 2020
Xavor
  • Handled data pipeline implementation for a utility that fetches data from social platforms on user-defined topics and generates insights using Microsoft Power BI and AWS Cloud Computing Services, including S3, Redshift, Lambda, Glue, and API Gateway.
  • Played a significant part in data gathering, implementation, testing, and optimization of a Python-based utility to help generate autonomous intelligent suggestions for data analysis by understanding data semantics using artificial intelligence.
  • Performed day-to-day large-scale data transformations using Oracle SQL.
Technologies: Amazon Web Services (AWS), Oracle SQL, Microsoft Power BI, Python 3, Talend ETL, Microsoft SQL Server, SQL, Data Engineering, ETL, Business Intelligence (BI), Data Warehousing, Microsoft Excel, Amazon S3 (AWS S3), Amazon Redshift, AWS Lambda, Command Prompt (CMD), Data Cleaning, Named-entity Recognition (NER), Dimensional Modeling, Artificial Intelligence (AI), Machine Learning, Deep Learning, Neural Networks, Java, Data Visualization, Dashboard Development, Dashboard Design, Dashboards, ETL Tools, Databases, Data Transformation, Data Processing, Database Optimization, Amazon RDS, CSV, JSON, Python, Stored Procedure, SQL Stored Procedures, SQL Views, User-defined Functions (UDF), SQL Triggers, Database Triggers, Data Architecture, Data Pipelines, Datasets, Redshift, Query Composition, Query Optimization, Performance Tuning, SQL Server Integration Services (SSIS), Pipelines, BigQuery, NumPy, Pandas, Data, Reporting, APIs

MySQL to Greenplum (PostgreSQL) Data Migration

https://github.com/tarvihafatima/MySQL-to-GreenPlum-Data-Migration
I created a utility that allows efficient table creation and data transfer from a MySQL database platform to Greenplum (PostgreSQL) during platform migration using two Greenplum frameworks (PXF Reader and gpfdist). This utility simplifies data transfer by offering an easy-to-use interface and configuration setup.

AI-based Data Discovery Tool

I was actively involved in an R&D project that aimed to get autonomous intelligent data analysis suggestions by understanding data semantics of unstructured data using artificial intelligence. The project's scope included data collection, cleaning, labeling, training, testing, and enhancement of data models.

Indoor Food Growing System

This tool was made to assist people in growing plants and keeping them healthy with the help of artificial intelligence. This project's scope covered gathering positive and negative images of plants for datasets, training, testing, and improving AI models.

Traffic Violation Detection and Analysis System

The system is an AWS-based traffic violation detection, analysis, and suggestion tool capable of detecting vehicles and their type in traffic videos with 97% accuracy. In addition, this tool detects four types of traffic violations with an overall accuracy of 82%.
2015 - 2019

Bachelor's Degree in Computer Science

National University of Computer and Emerging Sciences (FAST) - Lahore, Pakistan

2013 - 2015

High School Diploma in Pre-engineering

Kinniard College - Lahore, Pakistan

SEPTEMBER 2022 - PRESENT

Taming Big Data with Apache Spark and Python - Hands On!

Udemy

AUGUST 2022 - PRESENT

Apache Spark (TM) SQL for Data Analysts

Coursera

JANUARY 2022 - PRESENT

Working with BigQuery

Coursera

Languages

Python 3, SQL, Python, Stored Procedure, XML, Java, Scala

Tools

Microsoft Power BI, Talend ETL, Microsoft Excel, Tableau, BigQuery, Qlik Sense, Apache Airflow, Amazon Elastic MapReduce (EMR), Spark SQL, AWS Glue, Amazon QuickSight, Named-entity Recognition (NER), GitHub

Paradigms

ETL, Business Intelligence (BI), Dimensional Modeling

Platforms

Talend, AWS Lambda, Amazon Web Services (AWS), Amazon EC2, Databricks

Storage

Greenplum, MySQL, Microsoft SQL Server, PostgreSQL, JSON, SQL Stored Procedures, SQL Views, SQL Triggers, Database Triggers, Data Pipelines, Oracle SQL, Amazon S3 (AWS S3), Database Migration, Redshift, Databases, SQL Server Integration Services (SSIS)

Other

Data Analysis, Command Prompt (CMD), Data Cleaning, Data Visualization, ETL Tools, Data Transformation, Data Processing, CSV, User-defined Functions (UDF), Datasets, Query Composition, Query Optimization, Pipelines, Data, Data Engineering, Data Warehousing, Data Modeling, Google BigQuery, Dashboard Design, Dashboard Development, Dashboards, Data Migration, Database Optimization, Amazon RDS, Data Architecture, Data Cleansing, Analytics, Data Warehouse Design, DAX, Reporting, APIs, Amazon Redshift, Artificial Intelligence (AI), Machine Learning, Deep Learning, Neural Networks, Big Data Architecture, Data Profiling, Performance Tuning

Libraries/APIs

NumPy, Pandas, PySpark, Amazon Rekognition

Frameworks

Apache Spark, Spark

Collaboration That Works

How to Work with Toptal

Toptal matches you directly with global industry experts from our network in hours—not weeks or months.

1

Share your needs

Discuss your requirements and refine your scope in a call with a Toptal domain expert.
2

Choose your talent

Get a short list of expertly matched talent within 24 hours to review, interview, and choose from.
3

Start your risk-free talent trial

Work with your chosen talent on a trial basis for up to two weeks. Pay only if you decide to hire them.

Top talent is in high demand.

Start hiring