Rajib Baruah
Verified Expert in Engineering
Data Developer
Macungie, PA, United States
Toptal member since April 28, 2022
Rajib is a senior data engineer with 23 years of experience in T-SQL coding and building SQL Server databases, ETL data pipelines in the Azure cloud using ADF, or on-premises software using SQL Server Integration Services (SSIS) to ingest data and processes. An expert in coding using Python and PySpark, he creates notebooks in Databricks for data transformation and load. With his extensive experience, Rajib will be an excellent addition to any team.
Portfolio
Experience
Availability
Preferred Environment
SQL, Python, PySpark, Azure Data Factory, SQL Server Integration Services (SSIS), T-SQL (Transact-SQL), Databricks, Microsoft SQL Server, ETL, Databases, Relational Databases, Azure, Azure SQL, Agile, Scrum, Data Modeling, ETL Development
The most amazing...
...data engineering solution I’ve developed prepares raw historical data for data scientists to be used for forecasting the future pricing of online ads.
Work Experience
Data Engineer
Assurant
- Created data pipelines using Azure Data Factory (ADF), Databricks, Python, and PySpark. Converted on-premises ETL processes into ADF pipelines for scalability.
- Designed a relational and denormalized star schema-based database and built multiple SQL databases. I also did extensive T-SQL coding in the form of stored procedures, functions, views, and ad hoc SQL scripts.
- Handled the ongoing performance tuning and SQL database and t-SQL code for optimal performance. Identified and resolved performance-related issues in production.
- Created many SQL Server Integration Services (SSIS) packages to load and process inbound files from external vendors and other internal systems. I also created SSIS packages to transform data and generate outbound file feeds.
- Interacted with end customers and business analysts to get a thorough understanding of the business requirements and deliver the right solutions.
- Created a few Power-BI and SQL Server Reporting Services (SSRS) reports for business users.
- Oversaw a team of engineers working on the project and program launch.
DBA | Data Architect
Innovative Control Systems
- Designed relational and star schema-based databases for the online transaction processing (OLTP) and reporting systems, respectively.
- Created the first SQL Data Warehouse for the company from the relational SQL Server database, providing the company's decision-makers with easy, almost real-time access to the sales data at the regional and store levels.
- Set up SQL Server replication from the stores to the central database. Created SSIS packages to load data from the OLTP database to the SQL Data Warehouse.
- Built cubes using SQL Server Analysis Services (SSAS), enabling the generation of sales and activity reports by time and location.
- Maintained databases, handling various actions, such as backup and restore, indexing, performance tuning, and monitoring.
Consultant | Data Architect
RSG Media
- Created SSIS packages to process raw-impression data, transform it into relational structured data, and load it into the SQL Server database that data scientists and analysts then use to generate pricing and demand future predictions.
- Collaborated with data scientists to understand their data needs and design the database accordingly.
- Oversaw and mentored two database developers in the team.
Lead DBA
KGB.com
- Designed, built, and maintained multiple databases for the company's application used by call center agents, their QA monitoring system, and their reporting database.
- Created a data warehouse on top of the OLTP databases that contained the call center activities of the directory assistance business, generating almost real-time reports on the call volumes.
- Implemented database partitioning to speed up search performance on a database table with over a billion records. It aimed to keep all searches under a second, helping the company process more calls with fewer call center agents.
- Set up the offshore to onshore real-time replication between SQL Server databases.
- Continued performance tuning, indexing, database backup and restore, and log shipping.
Experience
Data Pipelines in Azure and Sync On-prem Database
Shipping Terminal Management System
Automated Enrollment Correction
ETL for Predictive Analysis
Education
Engineer's Degree in Computer Science
Maulana Azad National Institute of Technology Bhopal - Bhopal, India
Certifications
Microsoft Certified AI Fundamentals
Microsoft
Microsoft Certified: Azure Data Fundamentals
Microsoft
Skills
Libraries/APIs
PySpark
Tools
Microsoft Excel, GitHub, Microsoft Power BI, SSAS
Languages
SQL, Python, T-SQL (Transact-SQL), Excel VBA
Paradigms
ETL, Database Design, Agile, Scrum, Business Intelligence (BI)
Platforms
Databricks, Azure, Azure Synapse
Storage
SQL Server Integration Services (SSIS), Microsoft SQL Server, Databases, Relational Databases, Azure SQL Databases, SQL Performance, Azure SQL, Data Pipelines, Database Management, Azure Cosmos DB, SQL Server Reporting Services (SSRS), SQL Server Analysis Services (SSAS)
Frameworks
Apache Spark, Spark, .NET
Industry Expertise
Insurance, Telecommunications
Other
Azure Data Factory, Data Engineering, Data Analysis, Data Modeling, ETL Development, Media, Transportation & Shipping, Excel 365, Data Warehousing, Software Development, Software Architecture, Azure Data Lake, Blob Storage, Messaging, Electronic Data Interchange (EDI), MSMQ, Data Visualization, CSV
How to Work with Toptal
Toptal matches you directly with global industry experts from our network in hours—not weeks or months.
Share your needs
Choose your talent
Start your risk-free talent trial
Top talent is in high demand.
Start hiring