Tarviha Fatima
Verified Expert in Engineering
Data Engineer and Developer
Lahore, Punjab, Pakistan
Toptal member since November 11, 2022
Tarviha is a seasoned data engineer with 3+ years of experience designing and developing robust, automated, and highly performant end-to-end data pipelines. She has a solid technical background, specializing in data extraction, cleaning, standardization, modeling, transformation, visualization, and analysis. Tarviha enjoys collaborative efforts and loves to be recognized as the go-to person by fellow team members.
Portfolio
Experience
- Microsoft Power BI - 3 years
- Python 3 - 3 years
- SQL - 3 years
- ETL - 3 years
- Data Analysis - 3 years
- Business Intelligence (BI) - 3 years
- Data Engineering - 3 years
- Talend - 3 years
Availability
Preferred Environment
Python 3, SQL, Microsoft Power BI, Amazon Web Services (AWS), Apache Spark, Greenplum, Microsoft Excel, Tableau, Google BigQuery, Talend
The most amazing...
...project I've automated and standardized is the bounty disbursement process for a finance team, an improvement that made me the top employee of the year.
Work Experience
Business Intelligence Developer (Freelance)
Freelance
- Developed interactive visualizations, reports and dashboards using Power BI to derive actionable insights for business decisions.
- Developed complex calculated measures in Power BI using Data Analysis Expression language (DAX).
- Participated in requirements gathering and data mapping sessions to underline business needs and assisted in existing client dashboards and data sources.
Data Engineer ||
Afiniti
- Developed and maintained an automated revenue calculation design and deployed the architecture on 38 clients. I used MySQL, PostgreSQL, MS SQL, Greenplum, Python, and Talend.
- Designed and developed client performance and statistics dashboards for global visibility to the leadership using Microsoft Power BI.
- Created a Python-based utility that migrates the data from MySQL to Greenplum (PostgreSQL) in an optimized way using PFX Reader and gpfdist services.
- Contributed actively to the code migration from MySQL to PostgreSQL, converting approximately 5,000 lines of code from one system to another.
- Automated and designed the process of employees' bounty distribution to assist the finance team in payroll tasks using MySQL, PostgreSQL, MS SQL, Python, Talend, and Microsoft Excel.
Data Engineer |
Afiniti
- Built a Python-based custom utility for module-based data transfers from multiple client servers to a global server using SMTP and FTP mediums.
- Optimized the model deployment process on the production environment, replacing the time-consuming stored procedures and Talend jobs with efficient MySQL views, triggers, and events.
- Maintained and developed an optimized data pipeline for three North American clients using Talend, MySQL, PostgreSQL, and Python. This optimization resulted in saving approximately one hour in the process.
Data Engineer
Xavor
- Handled data pipeline implementation for a utility that fetches data from social platforms on user-defined topics and generates insights using Microsoft Power BI and AWS Cloud Computing Services, including S3, Redshift, Lambda, Glue, and API Gateway.
- Played a significant part in data gathering, implementation, testing, and optimization of a Python-based utility to help generate autonomous intelligent suggestions for data analysis by understanding data semantics using artificial intelligence.
- Performed day-to-day large-scale data transformations using Oracle SQL.
Experience
MySQL to Greenplum (PostgreSQL) Data Migration
https://github.com/tarvihafatima/MySQL-to-GreenPlum-Data-MigrationAI-based Data Discovery Tool
Indoor Food Growing System
Traffic Violation Detection and Analysis System
Education
Bachelor's Degree in Computer Science
National University of Computer and Emerging Sciences (FAST) - Lahore, Pakistan
High School Diploma in Pre-engineering
Kinniard College - Lahore, Pakistan
Certifications
Taming Big Data with Apache Spark and Python - Hands On!
Udemy
Apache Spark (TM) SQL for Data Analysts
Coursera
Working with BigQuery
Coursera
Skills
Libraries/APIs
NumPy, Pandas, PySpark, Amazon Rekognition
Tools
Microsoft Power BI, Talend ETL, Microsoft Excel, Tableau, BigQuery, Qlik Sense, Apache Airflow, Amazon Elastic MapReduce (EMR), Spark SQL, AWS Glue, Amazon QuickSight, Named-entity Recognition (NER), GitHub
Languages
Python 3, SQL, Python, Stored Procedure, XML, Java, Scala
Paradigms
ETL, Business Intelligence (BI), Dimensional Modeling
Platforms
Talend, AWS Lambda, Amazon Web Services (AWS), Amazon EC2, Databricks
Storage
Greenplum, MySQL, Microsoft SQL Server, PostgreSQL, JSON, SQL Stored Procedures, SQL Views, SQL Triggers, Database Triggers, Data Pipelines, Oracle SQL, Amazon S3 (AWS S3), Database Migration, Redshift, Databases, SQL Server Integration Services (SSIS)
Frameworks
Apache Spark, Spark
Other
Data Analysis, Command Prompt (CMD), Data Cleaning, Data Visualization, ETL Tools, Data Transformation, Data Processing, CSV, User-defined Functions (UDF), Datasets, Query Composition, Query Optimization, Pipelines, Data, Data Engineering, Data Warehousing, Data Modeling, Google BigQuery, Dashboard Design, Dashboard Development, Dashboards, Data Migration, Database Optimization, Amazon RDS, Data Architecture, Data Cleansing, Analytics, Data Warehouse Design, DAX, Reporting, APIs, Amazon Redshift, Artificial Intelligence (AI), Machine Learning, Deep Learning, Neural Networks, Big Data Architecture, Data Profiling, Performance Tuning
How to Work with Toptal
Toptal matches you directly with global industry experts from our network in hours—not weeks or months.
Share your needs
Choose your talent
Start your risk-free talent trial
Top talent is in high demand.
Start hiring