Akash Roy, Developer in Bengaluru, India
Akash is available for hire
Hire Akash

Akash Roy

Verified Expert  in Engineering

Data Engineer and Developer

Bengaluru, India

Toptal member since September 15, 2022

Bio

Akash is a technically sound Java developer with four years of experience designing, developing, and managing big data applications. Adept at readily gathering and translating complex requirements into viable solutions, Akash uses proven skills to design and write applications from conception to completion.

Portfolio

Airtel Africa
Hadoop, Spark, Java, Apache Maven, Git, Apache Hive, Apache Hudi, Presto...
Mobileum
Hadoop, Java, Spark, Presto, SQL, Yarn
MobiKwik
Spring, Hibernate, MySQL, Apache Maven, Git, REST

Experience

  • SQL - 6 years
  • Java - 6 years
  • Presto - 3 years
  • Hadoop - 3 years
  • Spark - 3 years
  • Apache Hive - 3 years
  • Docker - 1 year
  • Kubernetes - 1 year

Availability

Part-time

Preferred Environment

MacOS, IntelliJ IDEA, Slack, Gmail

The most amazing...

...project I've developed is Tesseract OLAP. It uses Spark and Presto to provide faster query runtime, less computational overhead, and a better user experience.

Work Experience

Senior Software Engineer

2020 - 2022
Airtel Africa
  • Performed the online analytical processing (OLAP) engine computing with aggregated datasets using Spark and added a partition column cuboid to store multidimensional aggregates in the same external Hive table.
  • Integrated internal APIs from Spark Structured Streaming into a Spark batch, achieving fault tolerance and exactly-once semantics.
  • Enhanced Presto Core to intercept an SQL query and rewrite it at runtime with an optimal cuboid predicate after analyzing referred dimensions.
Technologies: Hadoop, Spark, Java, Apache Maven, Git, Apache Hive, Apache Hudi, Presto, Kubernetes, Docker

Software Engineer

2019 - 2020
Mobileum
  • Enhanced an in-house ETL framework for running a Spark job using JSON configuration files and added support for in-memory master management.
  • Analyzed data using various techniques in PySpark and Presto, such as row pattern matching. Devised rules to catch telecom fraud like FAS and fixed-line fraud.
  • Implemented various user-defined functions (UDFs) and user-defined aggregate functions (UDAFs) to transform CADs in the required format. I used the HyperLogLogPlus data structure to create a mergeable unique count measure at query time.
Technologies: Hadoop, Java, Spark, Presto, SQL, Yarn

Software Developer

2018 - 2019
MobiKwik
  • Contributed to creating the Digital Gold feature using the MVC architecture, Spring, Hibernate, and MySQL. This feature enables users to buy and sell gold in real time.
  • Created an algorithm for a KYC flow and user onboarding to automatically blur sensitive information like identification numbers in government-issued IDs.
  • Wrote a script to onboard 100,000 merchants to the in-house mutual fund application. We had to fill out a form containing around five pages to onboard a single merchant. However, we could automate this flow with Selenium, which reduced manual effort.
Technologies: Spring, Hibernate, MySQL, Apache Maven, Git, REST

Experience

Tesseract OLAP

Tesseract OLAP is a modern-day engine designed to compute and query aggregated datasets.

I computed multidimensional aggregated datasets using a Spark batch and stored them in the same Hive external table. A cuboid partition column is used to identify what partition stores aggregate for what dimensions. I also employed Presto as a query layer.

After analyzing referred dimensions, I enhanced Presto Core to intercept an SQL query and rewrite it at runtime with an optimal cuboid predicate.

Education

2014 - 2018

Bachelor's Degree in Computer Engineering

Netaji Subhas Institute of Technology - Delhi, India

Skills

Tools

IntelliJ IDEA, Apache Maven, Git, Slack

Languages

SQL, Java

Frameworks

Spark, Presto, Hadoop, Yarn, Spring, Hibernate

Storage

Apache Hive, MySQL

Paradigms

REST

Platforms

Apache Hudi, Kubernetes, Docker, MacOS

Other

Gmail

Collaboration That Works

How to Work with Toptal

Toptal matches you directly with global industry experts from our network in hours—not weeks or months.

1

Share your needs

Discuss your requirements and refine your scope in a call with a Toptal domain expert.
2

Choose your talent

Get a short list of expertly matched talent within 24 hours to review, interview, and choose from.
3

Start your risk-free talent trial

Work with your chosen talent on a trial basis for up to two weeks. Pay only if you decide to hire them.

Top talent is in high demand.

Start hiring