Chirag Dhawan, Developer in Dubai, United Arab Emirates
Chirag is available for hire
Hire Chirag

Chirag Dhawan

Verified Expert  in Engineering

Data Engineer and Developer

Dubai, United Arab Emirates

Toptal member since November 24, 2022

Bio

Chirag is a data engineer with 5+ years of experience in multiple industries like healthcare, eCommerce, and banking. He is a quick learner with a knack for problem-solving who can be the driving force in busy teams, performing various tasks that different situations might require. He is also well-versed in patient-level data, making him an asset to healthcare teams. Chirag believes in making an impact and delivering value to his clients and customers from day one.

Portfolio

Amazon.com
Amazon Web Services (AWS), Python, Redshift, SQL
G42 Healthcare
Spark, PostgreSQL, SQL, Scala
ZS Associates
Spark, Python, Amazon Web Services (AWS), SQL, Apache Airflow

Experience

  • SQL - 4 years
  • Python - 4 years
  • Linux - 4 years
  • Git - 4 years
  • Redshift - 3 years
  • Amazon Web Services (AWS) - 3 years
  • Spark - 3 years
  • Docker - 1 year

Availability

Part-time

Preferred Environment

Linux, Redshift, Python, SQL, Spark, Amazon Web Services (AWS), Git, Docker

The most amazing...

...project I've worked on is setting up data pipelines using Spark at the core with the ability to process 4+ TBs of data in a few hours.

Work Experience

Data Engineer

2022 - PRESENT
Amazon.com
  • Developed, maintained, and optimized data pipelines that serve data to analytics teams.
  • Improved unit test coverage for existing pipelines using Pytest and unit test packages.
  • Collaborated in gathering the requirements from the analysts and BI engineers, working with stakeholders in an Agile environment to fulfill their requests within the defined SLA.
Technologies: Amazon Web Services (AWS), Python, Redshift, SQL

Data Engineer

2021 - 2022
G42 Healthcare
  • Identified areas for improvement and optimization in the in-house ETL application developed using Scala and Spark.
  • Implemented data pipelines and worked on performance tuning of existing pipelines using the in-house ETL application.
  • Analyzed and studied the source data, helping the team identify significant data quality issues at the source.
  • Developed an application using Scala to measure the data quality metrics for a given data set.
Technologies: Spark, PostgreSQL, SQL, Scala

Senior Analyst | Data Engineering

2018 - 2021
ZS Associates
  • Worked on developing pipelines for processing high-volume patient-level data sets using technologies like Spark on EMR clusters and orchestration of processes using Apache Airflow.
  • Handled a dashboard's back end, which involved creating data warehouse tables using Pandas, a Python library, by consuming flat files from Amazon S3.
  • Collaborated in the front- and back-end development of a web-based dashboard. The tech stack of the application was AngularJS and C3.js for visualizations, along with Flask and Pandas on the back end.
  • Developed a deep understanding of patient-level data sets and OHDSI tools, which helped me act as an SME for such data sets within the team.
  • Defined business rules for transforming these data sets into a common data model.
Technologies: Spark, Python, Amazon Web Services (AWS), SQL, Apache Airflow

Software Engineer

2017 - 2018
Newgen Software Technologies Limited
  • Worked on developing banking systems that digitized processes such as account opening and processing inward Swift messages.
  • Developed an archival data module for a bank that archived all documents after specific timeframes, moving them from client servers to a device called EMC Centera.
  • Handled direct communication with the client to gather requirements.
  • Assisted the client with user acceptance testing (UAT) and deployment on UAT and production servers.
  • Frequently led on-call support and collaborated with client teams to help resolve production issues within the defined SLA.
Technologies: Java, Jakarta Server Pages (JSP)

Experience

Unified Data Model

I worked on developing pipelines for processing patient-level data sources using technologies like Spark on EMR clusters and orchestration of processes using Apache Airflow. One of these pipelines processed 4+ TBs of data in a few hours. I also analyzed and studied these data sources to define business rules for their standardization into a common data model.

Education

2013 - 2017

Bachelor's Degree in Information Technology

Jaypee Institute of Information Technology - Noida, India

Skills

Tools

Git, Amazon Athena, AWS Glue, PyCharm, DataGrip, Apache Airflow, Amazon Elastic MapReduce (EMR)

Languages

SQL, Python, Scala, Java

Paradigms

ETL

Storage

Relational Databases, Redshift, Databases, PostgreSQL

Frameworks

Spark, Jakarta Server Pages (JSP)

Platforms

AWS Lambda, Windows, Linux, Amazon Web Services (AWS), Docker

Other

Data Modeling, Software Development, Operating Systems, Computer Networking

Collaboration That Works

How to Work with Toptal

Toptal matches you directly with global industry experts from our network in hours—not weeks or months.

1

Share your needs

Discuss your requirements and refine your scope in a call with a Toptal domain expert.
2

Choose your talent

Get a short list of expertly matched talent within 24 hours to review, interview, and choose from.
3

Start your risk-free talent trial

Work with your chosen talent on a trial basis for up to two weeks. Pay only if you decide to hire them.

Top talent is in high demand.

Start hiring