Govind Tiwari, Developer in Pune, Maharashtra, India
Govind is available for hire
Hire Govind

Govind Tiwari

Bio

Govind is a results-oriented data engineer and scientist with over five years of industry experience, including four years of experience working with Snowflake and AWS cloud platforms. With strong proficiency in Python, SQL, and machine learning techniques, he builds and maintains scalable data pipelines and automation workflows. Leveraging cloud-native solutions, Govind optimizes system performance and ensures the delivery of reliable, high-quality data.

Portfolio

Morgan Stanley
Data Engineering, Databricks, Amazon S3 (AWS S3), Snowflake, Teradata, Tableau...
TIAA
Python, SQL, Snowflake, Machine Learning
KPMG
Alteryx, SQL, Python, Data Engineering

Experience

  • Python - 6 years
  • SQL - 6 years
  • Cloud - 6 years
  • Snowflake - 5 years
  • Data Engineering - 4 years
  • APIs - 3 years
  • ETL - 3 years
  • Machine Learning - 3 years

Preferred Environment

Windows 10, Python, SQL, Cloud, APIs, Snowflake, Data Engineering, Machine Learning, Apache Airflow, ETL

The most amazing...

...solution I've built is a Python-based ETL scheduler that automates data pipelines and ensures reliable, on-schedule processing without manual intervention.

Work Experience

Data Scientist

2026 - 2026
Morgan Stanley
  • Architected and delivered an end-to-end data POC on Databricks, covering ingestion, transformation, and analytics; defined data models, optimized storage (Delta), and ensured scalable pipeline design.
  • Engineered robust ingestion pipelines for Teradata to Databricks and Snowflake to Databricks, utilizing Databricks Volumes and Azure Data Lake Storage (ADLS); handled schema mapping, data validation, and performance tuning.
  • Automated end-to-end data engineering workflows using Databricks Genie and internal ingestion frameworks, including job orchestration, table creation, dependency management, and reusable pipeline templates.
  • Built an AI-enabled analytics layer using Snowflake Cortex and Streamlit, developing interactive dashboards with conversational query capabilities; replicated Genie-like experience and enabled business-friendly data exploration.
Technologies: Data Engineering, Databricks, Amazon S3 (AWS S3), Snowflake, Teradata, Tableau, Data Scientist, SQL, Amazon SageMaker, Cloud Architecture

Data Engineer

2023 - 2025
TIAA
  • Developed and maintained data pipelines in Snowflake, combining raw information from diverse sources, enhancing data accuracy by 20%.
  • Engineered and deployed a data quality framework in Snowpark, improving data validation and reducing inconsistencies by 25%.
  • Managed data pipelines in AWS and dbt, improving data accuracy by 20%.
  • Designed machine learning models for mutual funds and retirement products using Snowflake Python and scikit-learn.
  • Optimized data storage and retrieval by leveraging Snowflake features such as clustering, partitioning, and caching.
Technologies: Python, SQL, Snowflake, Machine Learning

Analyst Engineer

2021 - 2023
KPMG
  • Presented data-driven recommendations to stakeholders, resulting in a 15% increase in project efficiency. Built Alteryx models for data processing, reducing preparation time by 20%.
  • Designed and maintained Snowflake databases to support large-scale financial analytics, leveraging expertise in visualizing and manipulating big datasets and using data replication for secure cross-account sharing and failover management.
  • Communicated with IT contacts and business partners to design, develop, and troubleshoot end-to-end technical solutions, and coordinated data migration processes using ETL tools such as Airflow and AWS Glue.
  • Created Alteryx models for data preprocessing and workflow management, and developed databases, tables, views, stored procedures, functions, common table expressions (CTEs), and indexes as needed.
Technologies: Alteryx, SQL, Python, Data Engineering

Data Analyst

2020 - 2021
SK Fincorp
  • Supported users by developing comprehensive documentation and occasionally performed ad hoc reporting to address specific business questions from upper management.
  • Coordinated test data creation, system testing, and documentation across all phases of ETL data migration processes.
  • Automated ETL pipelines from Oracle and Microsoft SQL Server to Snowflake using Airflow and dbt.
  • Reduced processing time by 40% through incremental data loading and transformation.
Technologies: APIs, Data Engineering, Cloud, Apache Airflow, OLAP, Snowflake, Snowpark

Experience

Data Quality Control Framework with Snowpark

I designed and implemented automated data quality checks across over 120 pipelines, resulting in a 25% reduction in processing errors. This involved comprehensive validation of data accuracy, consistency, the absence of duplicate records, referential integrity, uniqueness, and overall data validity, ensuring that business-critical analytics and reporting were reliable and error-free. The framework improved operational efficiency, minimized manual intervention, and enhanced confidence in decision-making across the organization.

ETL Scheduler

I developed a Python-based ETL scheduler to automate data pipelines, ensuring reliable, on-schedule ETL without manual intervention. I also integrated Snowflake and Tableau to streamline data availability, increasing access by 40% and supporting faster, data-driven decision-making.

Sales Analytics Pipeline

I automated ETL pipelines from Oracle and Microsoft SQL Server to Snowflake using Airflow and dbt, and designed and maintained Snowflake databases to support large-scale financial analytics. I also optimized data processing by implementing incremental loading and transformation, resulting in a 40% reduction in overall processing time.

Education

2015 - 2019

Bachelor's Degree in Electrical Engineering

Rajasthan Technical University (RTU) - Jaipur, India

Skills

Libraries/APIs

Snowpark

Tools

Amazon SageMaker, Apache Airflow, dbt Cloud, Tableau

Languages

SQL, Snowflake, Python

Paradigms

ETL, OLAP

Platforms

Alteryx, Oracle, Databricks

Storage

Amazon S3 (AWS S3), Teradata

Other

Cloud, Data Engineering, Cloud Architecture, Windows 10, APIs, Machine Learning, Electrical Engineering, Simulations, Data Scientist

Collaboration That Works

How to Work with Toptal

Toptal matches you directly with global industry experts from our network in hours—not weeks or months.

1

Share your needs

Discuss your requirements and refine your scope in a call with a Toptal domain expert.
2

Choose your talent

Get a short list of expertly matched talent within 24 hours to review, interview, and choose from.
3

Start your risk-free talent trial

Work with your chosen talent on a trial basis for up to two weeks. Pay only if you decide to hire them.

Top talent is in high demand.

Start hiring