Sumati Jain
Verified Expert in Engineering
Data Engineer and Developer
Delhi, India
Toptal member since February 19, 2024
Sumati is a seasoned data engineer with over 13 years of IT consulting experience, specializing in the data quality (DQ) tool Ataccama. He has a proven track record of creating bespoke solutions for banking customers that meet complex data management needs and ensure strict compliance with regulatory requirements. With a keen eye for detail and a passion for excellence, Sumati is a go-to expert for all things data.
Portfolio
Experience
Availability
Preferred Environment
SQL, Ataccama, Collibra, ETL, Data Engineering, Data Quality, Data Management, Google Cloud Platform (GCP), Cloud Architecture, Data Architecture
The most amazing...
...thing I've done is lead the end-to-end implementation of a DQ reporting layer and integrate the summarized DQ results and associated metadata into Collibra.
Work Experience
Technical Architect – Data Management
Tata Consultancy Services
- Increased Ataccama adoption by 40%. Developed and implemented a DQ reporting layer through ETL batch processing and integrated the results into Collibra—empowering application and data owners across five business units.
- Designed a DQ framework in Ataccama that complies with the enterprise data governance model, including data stewardship, user activity monitoring, business metadata maintenance, and handling human-centered design, reporting, and monitoring.
- Helped two business units confidently archive their databases of eight applications by leading the design, development, and implementation of a self-serviceable data discovery solution in Power BI using the application's data profiling results.
Experience
Ataccama Integration with Collibra
The client used Ataccama and Collibra as strategic tools for DQ and metadata management, respectively. As part of the data foundations program, I ensured that each business unit complied with centrally laid data management principles. One of the requirements was to publish data quality rules and results metadata for critical data elements to Collibra.
I created a proposal and implemented a DQ reporting layer through data engineering pipelines built using the Ataccama IDE. The IDE fetched data from a highly normalized Ataccama operational database into a denormalized dimensional schema using data warehousing methodologies. This enabled end users to create bespoke data visualization using tools of their choice and acted as an intermediate stage before data gets pushed to Collibra.
Additionally, I used Collibra exposed API endpoints to push this denormalized data from the reporting layer while maintaining proper business metadata lineage. This demonstrated data quality at various levels, from the physical table to the data domain level, helping data owners, application owners, and data stewards keep track of data quality.
Data Quality Dashboard for Insurance Business Units
This dashboard aimed to give a holistic understanding of data quality by summarizing results measured across various DQ dimensions like completeness, validity, accuracy, uniqueness, etc. while providing a view of DQ over time. This was built on top of a reporting layer that captured historical results of DQ evaluations performed by data asset stewards using Ataccama.
Further, multiple reports were built to provide drill-down capability by allowing users to look into the details of DQ rules that were used, various glossary terms that are tagged to it indicating critical data elements (CDE), total records that were processed, and number of passed and failed records.
Certifications
Google Cloud Certified – Professional Cloud Architect
Google Cloud
Ataccama ONE v13 | ONE Desktop Core
Ataccama
Skills
Tools
Microsoft Power BI, Collibra, Power BI Desktop
Languages
SQL
Paradigms
ETL, Business Intelligence (BI)
Platforms
Ataccama, Google Cloud Platform (GCP)
Storage
Google Cloud
Other
Data Engineering, Data Quality, Data Management, Cloud Architecture, Data Analytics, Data Reporting, Data Analysis, Data Visualization, Dashboards, Data Architecture, Data Profiling, Data Transformation, Data Modeling, Data Warehousing, Data Governance
How to Work with Toptal
Toptal matches you directly with global industry experts from our network in hours—not weeks or months.
Share your needs
Choose your talent
Start your risk-free talent trial
Top talent is in high demand.
Start hiring