
Akash Roy
Verified Expert in Engineering
Data Engineer and Developer
Bengaluru, India
Toptal member since September 15, 2022
Akash is a technically sound Java developer with four years of experience designing, developing, and managing big data applications. Adept at readily gathering and translating complex requirements into viable solutions, Akash uses proven skills to design and write applications from conception to completion.
Portfolio
Experience
- SQL - 6 years
- Java - 6 years
- Presto - 3 years
- Hadoop - 3 years
- Spark - 3 years
- Apache Hive - 3 years
- Docker - 1 year
- Kubernetes - 1 year
Availability
Preferred Environment
MacOS, IntelliJ IDEA, Slack, Gmail
The most amazing...
...project I've developed is Tesseract OLAP. It uses Spark and Presto to provide faster query runtime, less computational overhead, and a better user experience.
Work Experience
Senior Software Engineer
Airtel Africa
- Performed the online analytical processing (OLAP) engine computing with aggregated datasets using Spark and added a partition column cuboid to store multidimensional aggregates in the same external Hive table.
- Integrated internal APIs from Spark Structured Streaming into a Spark batch, achieving fault tolerance and exactly-once semantics.
- Enhanced Presto Core to intercept an SQL query and rewrite it at runtime with an optimal cuboid predicate after analyzing referred dimensions.
Software Engineer
Mobileum
- Enhanced an in-house ETL framework for running a Spark job using JSON configuration files and added support for in-memory master management.
- Analyzed data using various techniques in PySpark and Presto, such as row pattern matching. Devised rules to catch telecom fraud like FAS and fixed-line fraud.
- Implemented various user-defined functions (UDFs) and user-defined aggregate functions (UDAFs) to transform CADs in the required format. I used the HyperLogLogPlus data structure to create a mergeable unique count measure at query time.
Software Developer
MobiKwik
- Contributed to creating the Digital Gold feature using the MVC architecture, Spring, Hibernate, and MySQL. This feature enables users to buy and sell gold in real time.
- Created an algorithm for a KYC flow and user onboarding to automatically blur sensitive information like identification numbers in government-issued IDs.
- Wrote a script to onboard 100,000 merchants to the in-house mutual fund application. We had to fill out a form containing around five pages to onboard a single merchant. However, we could automate this flow with Selenium, which reduced manual effort.
Experience
Tesseract OLAP
I computed multidimensional aggregated datasets using a Spark batch and stored them in the same Hive external table. A cuboid partition column is used to identify what partition stores aggregate for what dimensions. I also employed Presto as a query layer.
After analyzing referred dimensions, I enhanced Presto Core to intercept an SQL query and rewrite it at runtime with an optimal cuboid predicate.
Education
Bachelor's Degree in Computer Engineering
Netaji Subhas Institute of Technology - Delhi, India
Skills
Tools
IntelliJ IDEA, Apache Maven, Git, Slack
Languages
SQL, Java
Frameworks
Spark, Presto, Hadoop, Yarn, Spring, Hibernate
Storage
Apache Hive, MySQL
Paradigms
REST
Platforms
Apache Hudi, Kubernetes, Docker, MacOS
Other
Gmail
How to Work with Toptal
Toptal matches you directly with global industry experts from our network in hours—not weeks or months.
Share your needs
Choose your talent
Start your risk-free talent trial
Top talent is in high demand.
Start hiring