Sumati Jain, Developer in Delhi, India
Sumati is available for hire
Hire Sumati

Sumati Jain

Verified Expert  in Engineering

Bio

Sumati is a seasoned data engineer with over 13 years of IT consulting experience, specializing in the data quality (DQ) tool Ataccama. He has a proven track record of creating bespoke solutions for banking customers that meet complex data management needs and ensure strict compliance with regulatory requirements. With a keen eye for detail and a passion for excellence, Sumati is a go-to expert for all things data.

Availability

Part-time

Preferred Environment

SQL, Ataccama, Collibra, ETL, Data Engineering, Data Quality, Data Management, Google Cloud Platform (GCP), Cloud Architecture, Data Architecture

The most amazing...

...thing I've done is lead the end-to-end implementation of a DQ reporting layer and integrate the summarized DQ results and associated metadata into Collibra.

Work Experience

Technical Architect – Data Management

2016 - PRESENT
Tata Consultancy Services
  • Increased Ataccama adoption by 40%. Developed and implemented a DQ reporting layer through ETL batch processing and integrated the results into Collibra—empowering application and data owners across five business units.
  • Designed a DQ framework in Ataccama that complies with the enterprise data governance model, including data stewardship, user activity monitoring, business metadata maintenance, and handling human-centered design, reporting, and monitoring.
  • Helped two business units confidently archive their databases of eight applications by leading the design, development, and implementation of a self-serviceable data discovery solution in Power BI using the application's data profiling results.
Technologies: SQL, Data Engineering, Ataccama, Collibra, Google Cloud, ETL, Data Quality

Ataccama Integration with Collibra

Increased Ataccama adoption by 40% by empowering data and application owners across five business units.

The client used Ataccama and Collibra as strategic tools for DQ and metadata management, respectively. As part of the data foundations program, I ensured that each business unit complied with centrally laid data management principles. One of the requirements was to publish data quality rules and results metadata for critical data elements to Collibra.

I created a proposal and implemented a DQ reporting layer through data engineering pipelines built using the Ataccama IDE. The IDE fetched data from a highly normalized Ataccama operational database into a denormalized dimensional schema using data warehousing methodologies. This enabled end users to create bespoke data visualization using tools of their choice and acted as an intermediate stage before data gets pushed to Collibra.

Additionally, I used Collibra exposed API endpoints to push this denormalized data from the reporting layer while maintaining proper business metadata lineage. This demonstrated data quality at various levels, from the physical table to the data domain level, helping data owners, application owners, and data stewards keep track of data quality.

Data Quality Dashboard for Insurance Business Units

A Power BI-based data quality dashboard whose purpose is to give a summary of data quality metrics for hosts of applications used by insurance business units in a top British financial institution.

This dashboard aimed to give a holistic understanding of data quality by summarizing results measured across various DQ dimensions like completeness, validity, accuracy, uniqueness, etc. while providing a view of DQ over time. This was built on top of a reporting layer that captured historical results of DQ evaluations performed by data asset stewards using Ataccama.

Further, multiple reports were built to provide drill-down capability by allowing users to look into the details of DQ rules that were used, various glossary terms that are tagged to it indicating critical data elements (CDE), total records that were processed, and number of passed and failed records.
JUNE 2022 - JUNE 2024

Google Cloud Certified – Professional Cloud Architect

Google Cloud

JANUARY 2022 - PRESENT

Ataccama ONE v13 | ONE Desktop Core

Ataccama

Tools

Microsoft Power BI, Collibra, Power BI Desktop

Languages

SQL

Paradigms

ETL, Business Intelligence (BI)

Platforms

Ataccama, Google Cloud Platform (GCP)

Storage

Google Cloud

Other

Data Engineering, Data Quality, Data Management, Cloud Architecture, Data Analytics, Data Reporting, Data Analysis, Data Visualization, Dashboards, Data Architecture, Data Profiling, Data Transformation, Data Modeling, Data Warehousing, Data Governance

Collaboration That Works

How to Work with Toptal

Toptal matches you directly with global industry experts from our network in hours—not weeks or months.

1

Share your needs

Discuss your requirements and refine your scope in a call with a Toptal domain expert.
2

Choose your talent

Get a short list of expertly matched talent within 24 hours to review, interview, and choose from.
3

Start your risk-free talent trial

Work with your chosen talent on a trial basis for up to two weeks. Pay only if you decide to hire them.

Top talent is in high demand.

Start hiring