Raminder Singh Bagga, Developer in Bengaluru, Karnataka, India
Raminder is available for hire
Hire Raminder

Raminder Singh Bagga

Verified Expert  in Engineering

Data Engineer and Developer

Bengaluru, Karnataka, India

Toptal member since August 6, 2024

Bio

Raminder has over a decade of hands-on experience in end-to-end data engineering solutions, including cloud and on-premise services and API integrations. He specializes in AWS and Azure platforms, including EMR, Glue, Athena, and Amazon Managed Streaming for Apache Kafka (MSK). Raminder excels in guiding projects from initial estimations and design to implementing fault-tolerant, complex data pipelines for large-scale systems.

Portfolio

BNY Mellon
Spark, Scala, PySpark, EMR, Apache Kafka, Amazon Kinesis, AWS Glue...
Mindtree
Adobe Experience Platform, Amazon Web Services (AWS), Apache Airflow...

Experience

  • Spark - 10 years
  • Adobe Experience Platform - 10 years
  • Data Engineering - 10 years
  • Python - 10 years
  • Amazon Web Services (AWS) - 10 years
  • SQL - 10 years
  • Apache Kafka - 4 years
  • Apache Airflow - 4 years

Availability

Part-time

Preferred Environment

Apache Airflow, AWS Glue, EMR, Snowflake, Spark, Adobe Experience Platform, Python, SQL

The most amazing...

...thing I've created and implemented is a highly scalable data ingestion framework on AWS, handling 2TB+ of data monthly through hundreds of orchestrated jobs.

Work Experience

Senior Data Engineer

2021 - PRESENT
BNY Mellon
  • Created enterprise applications for data ingestion and processing using Spark with Scala and AWS services like EMR, Glue, Athena, and Amazon MSK. Orchestrated workflows using Apache Airflow, AWS Step Functions, Docker, and Workflow Engine.
  • Managed the data engineering team on custom integration projects in EMEA and APAC and handled estimations, architecture, design, development, and deployment strategies.
  • Led design and architectural discussions in markets around custom solutions that integrated applications and customers' enterprise applications while creating robust, fault-tolerant, real-time data processing.
  • Collaborated closely with data analysts and data scientists, fostering a multidisciplinary approach to data projects.
Technologies: Spark, Scala, PySpark, EMR, Apache Kafka, Amazon Kinesis, AWS Glue, Data Engineering, Adobe Experience Platform, SQL, Python, Apache Airflow, AWS Lambda

Module Lead

2017 - 2021
Mindtree
  • Crafted robust data pipelines for customer data platforms for companies' digital transformational goals using on-prem and cloud services for batch and real-time processing.
  • Created Spark scripts using Scala on low-level and high-level APIs as required.
  • Maintained multi-pipeline architecture design for data ingestion, ETL, and data analysis from different sources.
Technologies: Adobe Experience Platform, Amazon Web Services (AWS), Apache Airflow, Data Engineering, Spark, ETL, SQL

Experience

Custom Integration with Downstream Applications

Processed data on an allocated live 30-node Hadoop cluster. Key activities include:

• Understanding client requirements and preparing high-level designs.
• Creating Spark scripts using Scala with low- and high-level APIs, per the requirements.
• Maintaining a multi-pipeline architecture design for ingestion, ETL, and analysis of data from different sources.
• Leveraging Sqoop and Informatica for data ingestion for efficiency.
• Working collaboratively with the data science team to maintain data pipelines and automate consumption processes.

Integration with Adobe Experience Platform

I passionately developed and maintained multiple integrations with Adobe Experience Platform by building robust data pipelines for batch and real-time processing. The integration was done on both on-prem and cloud sources.

Education

2009 - 2013

Bachelor's Degree in Computer Science

Nagpur University - Nagpur, India

Certifications

JULY 2022 - PRESENT

Microsoft Certified: Azure Data Engineer Associate

Microsoft

JULY 2019 - PRESENT

AWS Certified Solutions Architect – Associate

Amazon Web Services

Skills

Libraries/APIs

PySpark

Tools

AWS Glue, Apache Airflow

Languages

Python, SQL, Snowflake, Scala

Frameworks

Spark

Platforms

Adobe Experience Platform, AWS Lambda, Amazon Web Services (AWS), Apache Kafka

Storage

Amazon S3 (AWS S3)

Paradigms

ETL

Other

Data Engineering, Azure Data Factory (ADF), Amazon Kinesis, Amazon Redshift, EMR, Big Data

Collaboration That Works

How to Work with Toptal

Toptal matches you directly with global industry experts from our network in hours—not weeks or months.

1

Share your needs

Discuss your requirements and refine your scope in a call with a Toptal domain expert.
2

Choose your talent

Get a short list of expertly matched talent within 24 hours to review, interview, and choose from.
3

Start your risk-free talent trial

Work with your chosen talent on a trial basis for up to two weeks. Pay only if you decide to hire them.

Top talent is in high demand.

Start hiring