
Bibeksheel Kaur
Verified Expert in Engineering
Data Engineer and Developer
Kalimpong, West Bengal, India
Toptal member since June 24, 2020
Bibeksheel has over 12 years of experience in the IT industry, specializing in the design, development, maintenance, and support of database applications across various platforms and cloud implementations. She is skilled in Microsoft SQL Server, SSIS, Azure Data Factory, Azure Databricks, and Azure Data Lake Storage, and proficient with Snowflake, AWS, and Amazon S3. A quick learner and self-motivated professional, Bibeksheel excels in multitasking and consistently delivers high-quality results.
Portfolio
Experience
- T-SQL (Transact-SQL) - 12 years
- ETL - 12 years
- PySpark - 6 years
- Databricks - 6 years
- Data Engineering - 6 years
- Microsoft Azure - 6 years
- Azure Databricks - 6 years
- Azure Data Factory (ADF) - 6 years
Availability
Preferred Environment
Microsoft Azure, Microsoft SQL Server, Amazon Web Services (AWS), PySpark, Azure Databricks, Azure Data Factory (ADF), Data Engineering, Data Warehousing, SQL, Databases
The most amazing...
...thing I've redesigned is the statement mailing process for a banking client to reduce the overall cost by $0.9 million.
Work Experience
Senior Data Engineer (via Toptal)
PepsiCo Global - PepsiCo International Limited
- Collaborated with stakeholders to understand business requirements and design effective data architectures. This involved creating data models, defining data storage strategies, and ensuring data quality and integrity.
- Implemented data ingestion pipelines to extract data from various sources, transform it, and load it into Azure data storage solutions.
- Developed data transformation workflows to clean, enrich, and aggregate data to perform data wrangling, integration, and orchestration tasks.
- Monitored data pipelines and workflows to ensure data integrity, performance, and reliability. Identified and resolved data ingestion, transformation, storage, and processing issues.
- Automated and optimized data solutions for cost-efficiency and performance.
- Collaborated with cross-functional teams, such as data scientists, business analysts, and software developers, to understand their needs and provide the required data solutions.
- Communicated effectively with stakeholders to gather requirements, provide updates, and present insights.
- Developed an ROI solution comprising marketing and sales data. It involves multiple data sources to help the business team understand net spending versus revenue for PepsiCo products.
- Built several dbt models and orchestrated pipelines using Airflow. Optimized DAGs and models two times to improve the overall execution time.
- Designed final reporting tables in Snowflake. Played an important role in the data vault architecture of initial tables and evolved models over time to satisfy business needs.
Technical Lead
HCL Technologies
- Migrated the existing on-prem ETL solutions residing locally to a global cloud platform. Implemented end-to-end data solutions (storage, integration, processing, and visualization) in Azure.
- Recreated the functionality of the current on-prem ETL solution by scripting the data transformations in Databricks using Spark SQL, PySpark, and Scala.
- Created end-to-end database solutions in accordance with business requirements. Optimized existing database solutions and enhanced the performance of databases.
- Created analytical reports for database solutions to help business users make future decisions accordingly.
- Supported various BAU activities such as defect prevention, change, incident, and problem management.
- Worked in an interactive Agile and DevOps environment.
Senior Systems Engineer
Infosys, Ltd.
- Redesigned the statement mailing process for the client to reduce the overall cost by US $0.9 million. Supported the critical process of sending the annual statements to 11 million savings account stakeholders.
- Optimized and reengineered the ETL solution to implement the travel insurance process for a client. Wrote and tuned stored procedures, subqueries, functions, triggers, and views to maintain referential integrity and implement complex business logic.
- Handled business queries and CSR and was involved in 120+ mart development and enhancement. Made recommendations for performance improvement in hosted databases involving partitioning, index creation, index removal, and index modification.
Experience
BI Modernization
ACCOMPLISHMENTS
• Used Azure Data Factory to build pipelines for extracting source data from SQL tables and FlatFiles.
• Rescripted the transformation logic by scripting it in Spark SQL, PySpark, and Scala.
• Recreated existing business logic and functionality using Azure stack.
• Designed, developed, and deployed the solution.
• Implemented end-to-end data solutions (storage, integration, processing, visualization) in Azure.
GSS-DW
- Migrate the existing ETL stack from SSIS to the Azure platform.
- Redesign the existing DW solution by writing it in Databricks and integrating it.
- Architect and implement medium to large scale BI solutions on Azure using Azure Data Platform services (Azure Data Lake, Data Factory, Azure SQL DW, and HDInsight/Databricks)
- Recreating existing application logic and functionality in the Azure Data Lake, Data Factory, SQL Database and SQL data warehouse environment. experience in DWH/BI project implementation using Azure DF.
- Propose architectures considering cost/spend in Azure and develop recommendations to right-size data infrastructure
- Implement end-to-end data solutions (storage, integration, processing, visualization) in Azure
- Migrate on-premise data (Oracle/SQL Server/Mainframe) to Azure Data Lake Storage using Azure Data Factory.
- Create dashboards on the delta tables in Azure Databricks and present the analytics per requirements
Ropes And Repairs
I populated the data in Azure tables (CosmosDB).
ACCOMPLISHMENTS
• Created Python scripts to refine the Mainframe historical files so that they are eligible for processing in Azure.
• Created Delta tables from the flat files in Azure Databricks and created a data model on top of it.
• Set up the ETL process for rendering monthly reports from Azure for future use.
FMG Data Warehouse
• Deployed and uploaded the SSRS reports to SharePoint Server for the end-users and involved in enhancements and modifications.
• Managed the subscription reports for the end users.
• Created databases and schema objects, including tables, indexes, and applied constraints.
• Connected various applications to the database, written functions, and stored procedures and triggers.
• Used SSIS transformations such as lookup, derived column, data conversion, aggregate, conditional splits, SQL tasks, script tasks, send mail tasks, and more.
• Used execution plans, SQL profiler, and database engine tuning advisor to optimize queries and enhance the performance of databases.
• Supported various BAU activities such as defect prevention, change management, incident management, and problem management.
• Monitored and made recommendations for performance improvement in hosted databases involving partitioning, index creation, index removal, index modification, and adding scheduled jobs to re-index and update database statistics.
• Monitored 100+ jobs and troubleshoot and reported in case of failure.
Flex Accounts Travel Insurance
Statement Efficiency
Team Tidbit
Corporate Analytical Warehouse
Aetna Quoting Centre (AQC)
Microsoft IT (MSIT)
• Made necessary updates and generated reports using SQL queries and worked in Active Directory.
• Created T-SQL stored procedures and traced and analyzed the stored procedures for performance and cost optimization.
• Developed tools in .NET to automate various infrastructure management service activities to deliver exceptional client service.
Education
Bachelor's Degree in Electronics and Communication Engineering
Panjab University - Chandigarh
Certifications
70-433 Microsoft SQL Server 2008, Database Development
Microsoft Technologies
70-461 Querying Microsoft SQL Server 2012
Microsoft Technologies
Skills
Libraries/APIs
PySpark
Tools
Spark SQL
Languages
T-SQL (Transact-SQL), SQL, Scala, Python, Java, Snowflake
Paradigms
Agile, ETL, DevOps, Azure DevOps
Platforms
Databricks, Azure PaaS, Azure, Amazon Web Services (AWS), Jakarta EE
Storage
SQL Server Integration Services (SSIS), SQL Server 2012, SQL Server Reporting Services (SSRS), Microsoft SQL Server, Azure SQL, Azure Table Storage, SQL Server 2008, Amazon S3 (AWS S3), Databases
Frameworks
Apache Spark
Other
Azure Data Lake, Data Warehousing, Data Analysis, Azure Data Factory (ADF), Microsoft Azure, Data Warehouse Design, Data Engineering, Azure Databricks, ServiceNow, Performance Tuning, Storage, Electronics
How to Work with Toptal
Toptal matches you directly with global industry experts from our network in hours—not weeks or months.
Share your needs
Choose your talent
Start your risk-free talent trial
Top talent is in high demand.
Start hiring