Arnab Saha
Verified Expert in Engineering
SQL Developer
Arnab is a business intelligence professional with experience in analyzing, designing, and developing large-scale data warehouses with SQL and NoSQL, business intelligence solutions, ETL, and Reporting solutions. As a Microsoft certified solution expert, he builds end-to-end business intelligence systems to solve complex data analytics problems. Having strong analytical, problem-solving, and troubleshooting skills, Arnab enjoys handling assignments for the entire project lifecycle.
Portfolio
Experience
Availability
Preferred Environment
SQL Server Data Tools (SSDT), SQL, Azure Data Factory, Business Intelligence (BI), Microsoft Power BI, Azure SQL, Microsoft Azure, ETL, Azure Data Lake, Microsoft Power Apps, Data Analysis Consultant, Data Management, Data Warehousing, Performance Tuning, Snowflake, Analytics, Back-end
The most amazing...
...thing I've developed is an enterprise data warehouse for Assurant, a leading real estate insurance company, to track their business in a single platform.
Work Experience
Principal Data and Business Intelligence Consultant
Neudesic
- Collaborated with the client to identify existing data flow and successfully migrated whole data processing layer to Azure environment, processing more than 4 TB of data.
- Used the Agile process while handling all projects and Azure DevOps as a scrum board. Worked with the scrum master to do story pointing and to track bugs. Used various source control systems and the CI/CD process for deployment.
- Contributed to several organization-wide initiatives, created a logging framework for on-premise SSIS to monitor each pipeline's status—a PnP component that helps other projects—and created a framework to load data to ADLS.
- Maintained the hiring process by working with the recruitment team and conducted more than 200 screening and selection interviews.
- Mentored several colleagues and freshers on data and SQL to help them gain technical knowledge and work with the client.
Business Intelligence/Data Engineer Consultant
Freelance
- Worked with team to help create an OLAP data warehouse. Made different dimensions and facts. Developed and deployed SSIS packages for data movement. Also worked on audit framework to handle error/ failure.
- Used Azure DevOps VSTO for project management. Used Git/TFS for source control and created user stories/tasks to track progress in the sprint.
- Trained team on Azure Data resources, like ADF, Azure SQL, Databricks, Synapse, and more, and made them proficient at working in projects.
- Created a tabular model and deployed it in the Azure Analysis service. Also worked on different refresh models and roles (for security).
- Worked on Power BI reports to create different visuals and published them in the workspace to share among stakeholders.
System Analyst
Eastman Chemicals India Pvt Ltd
- Designed and created a data mart to store entry and exit time for contractors, helping them successfully verify time and payment system (actual and business time).
- Followed the Agile process and used the TFS scrum board to handle daily activities and track bugs.
- Partnered with the recruitment team and conducted interviews for screening in the BI domain.
Software Engineer
MAQ Software, LLC.
- Learned about SQL and MSBI tools and successfully delivered several projects. Communicated with the team lead and client to understand the requirements.
- Took several Microsoft courses on SQL and BI and became a Microsoft Certified Solution Expert in business intelligence, SQL Server, and data warehouse solutions.
- Gained training on the Agile process and was accustomed to Sprint, Scrum, and different terms of Agile.
Experience
Assurant Property Insurance
I was a part of the team that got daily files of various sizes from different clients and created four logical divisions to process the data (pre-stage, stage, native, and SSP). We also verified the file quality and loaded the whole file in Prestage in a single column with different tables for each client. Then we bifurcated data, loaded the columns with Truncate, and pushed data. Finally, we performed various business transformations and loaded them into a single SSP layer. We used a logging framework to log the status of each pipeline to track it correctly. During the project delivery, we followed the Agile model, TFS/Azure DevOps as source control, and CI/CD process for deployment.
Migration of Data Process Layer (on Premise to Cloud Environment)
To migrate the whole processing layer, we mainly used the following cloud resources:
1. Azure Data Factory
2. Azure Databricks
3. Azure Data Lake Storage
4. Azure Event Hub
5. Azure Application Insights and Log Analytics
They have some streaming data sources (in Avro format), RDBMS tables, and File storage. We used Event Hub and Databricks to handle streaming and Data Factory as orchestration tools. We used a Databricks Notebook for all transformations activities, and using Notebook activity in ADF, invoked Databricks Notebook. We have been used Pipeline triggers (dependent triggers) and the scheduled trigger.
For the storage, we mainly used ADLS and created three logical divisions in the ADLS—stage, discovery, and insight layer. Through the insight layer, we connected with reporting tools for different insights and visualizations
Sales Insight
We designed and created a data warehouse (OLAP system) to store data from different channels in a unified fashion and used SQL (Azure SQL) as a storage model.
We have created different DB from the stage, native, and dimension modeling. Also, we created an audit framework to handle the error. As data size was massive, we created a separate tabular model and hosted it in the Azure Analysis service.
We connected the Power Bi report (Live Connection) with AAS, created different visuals for reporting purposes, and hosted in Power Bi service.
We use ADF as an orchestration pipeline, which will get the data from on-premise sources, then process through SP-Activity and refresh tabular model in AAS and send successful/failure alert through Azure Logic App.
Education
Bachelor's Degree in Computer Science
Heritage Institute of Technology - Kolkata, India
Certifications
Azure Databricks Developer Essentials
Microsoft
Designing Business Intelligence Solutions
Microsoft
Querying MS Server
Microsoft
Skills
Libraries/APIs
PySpark
Tools
Microsoft Power BI, Azure Logic Apps, Microsoft Power Apps
Languages
SQL, Python, Snowflake, C, C#
Storage
RDBMS, Azure SQL, Databases, Relational Databases, Microsoft SQL Server, Database Structure, SQL Server Integration Services (SSIS), SQL Server Reporting Services (SSRS), Azure SQL Databases, Database Architecture, Data Pipelines, MySQL, Data Lakes, Database Transactions, SQL Server Data Tools (SSDT), SQL Server Analysis Services (SSAS), Azure Cosmos DB
Paradigms
Business Intelligence (BI), ETL, Database Design, High-performance Computing
Frameworks
Data Lakehouse, .NET
Platforms
Databricks, Azure, Microsoft Power Automate, Azure Synapse
Other
Data Engineering, Data Analysis Consultant, Data Management, Business Intelligence Consultant, Data Warehousing, Performance Tuning, Query Optimization, Azure Data Factory, Azure Data Lake, Agile Practices, MSBI, Data Transformation, Database Schema Design, Data Visualization, Dashboard Design, Dashboards, Data Analytics, Data Modeling, Data Analysis, Cloud Storage, Query Composition, Data Warehouse Design, Analytics, Back-end, BI Reports, Relational Database Design, Data Architecture, Excel 365, Transactions, Microsoft Azure, Azure Analysis Services, Big Data
How to Work with Toptal
Toptal matches you directly with global industry experts from our network in hours—not weeks or months.
Share your needs
Choose your talent
Start your risk-free talent trial
Top talent is in high demand.
Start hiring