Sony Shrestha, Developer in Geldrop, Netherlands
Sony is available for hire
Hire Sony

Sony Shrestha

Verified Expert  in Engineering

Bio

Sony is a reliable software engineer with over three years of experience as a database developer, primarily focusing on databases. She currently works in a data analytics company and handles millions of transactions per hour. Most of Sony's work includes back-end architecture design, ETL processes, performance tuning, Pentaho job optimization, designing interactive dashboards, creating data pipelines, analyzing complex datasets, and mentoring interns and trainees.

Portfolio

Returnalyze
Snowflake, Amazon S3 (AWS S3), Python, PostgreSQL, Data Build Tool (dbt)
Verisk Nepal Pvt. Ltd.
Pentaho, JasperReports, Python, PostgreSQL, Amazon Web Services (AWS)...
Toptal Client
SQL, Amazon Athena, Amazon S3 (AWS S3), Python, Scraping, Spotify API...

Experience

  • Microsoft Power BI - 3 years
  • Apache Airflow - 3 years
  • PostgreSQL - 3 years
  • ETL - 3 years
  • Pentaho - 3 years
  • Microsoft SQL Server - 3 years
  • SQL - 3 years
  • Python - 2 years

Availability

Full-time

Preferred Environment

Windows, Visual Studio Code (VS Code), Git

The most amazing...

...project I've worked on is Foneloan, a service offered by banks to provide short-term loans to employed individuals through their mobile banking apps.

Work Experience

Data Engineer

2023 - PRESENT
Returnalyze
  • Developed a robust notification system to alert clients in real time when new data is loaded and insights are generated, enhancing client engagement and decision-making efficiency.
  • Designed an ETL pipeline to automate the processing of client data delta feeds.
  • Developed a pipeline to integrate with the Bazaarvoice API, enabling clients to retrieve review data efficiently.
  • Led end-to-end management of retail sales and returns processing, leveraging DBT for seamless integration of data from S3 to Snowflake tables.
  • Designed and implemented a robust notification system to alert users of new files, missing files, and any column mismatches detected during pipeline execution.
  • Conducted raw data analysis, providing valuable insights and automated reports to US stakeholders for decision-making.
  • Led the onboarding of 10 retailer clients, customizing data pipelines to meet unique client demands and ensuring seamless integration with company systems.
  • Designed a robust framework to automate client-specific reports generation.
  • Designed and implemented scalable, automated end-to-end data pipelines to ingest and process large datasets, including sales, returns, and product data from multiple retailers.
Technologies: Snowflake, Amazon S3 (AWS S3), Python, PostgreSQL, Data Build Tool (dbt)

Data Engineer

2021 - 2023
Verisk Nepal Pvt. Ltd.
  • Optimized existing Pentaho jobs and transformations, which reduced processing time by around 83%.
  • Created the end-to-end workflow to migrate legacy data from Microsoft SQL Server to PostgreSQL.
  • Implemented data masking techniques, including pseudonymization, anonymization, encryption, redaction, shuffling, and scrambling, to safeguard Personally Identifiable Information (PII) and ensure compliance with data governance regulations.
  • Implemented an automated solution using AWS Lambda to trigger the execution of the ETL pipeline upon the arrival of files from clients in an S3 bucket.
  • Identified the root cause of technical bugs raised in production and made necessary changes in ETL to fix them.
  • Prepared business reports for A-PLUS Property using Jasper scripts.
  • Evaluated the workflow and increased the efficiency of data pipelines that process a large volume of data daily.
Technologies: Pentaho, JasperReports, Python, PostgreSQL, Amazon Web Services (AWS), Amazon Aurora, Microsoft SQL Server, Data Cleansing, Data Cleaning, Data Migration, Snowpipe, Microsoft Access, Microsoft Excel, Microsoft Power BI, MySQL

Data Engineer

2022 - 2022
Toptal Client
  • Prepared a Python script for scraping data from Soundchart, Chartmetric, and Spotify to gather music-related data from several APIs for an app in the music industry.
  • Prepared complex SQL scripts while migrating data from raw to clean database.
  • Provisioned, changed, and improved resources on AWS using Terraform (Infrastructure as a Code).
  • Prepared AWS Glue Crawler that uses a custom classifier, making it able to identify schema of all sorts of data available in AWS S3.
Technologies: SQL, Amazon Athena, Amazon S3 (AWS S3), Python, Scraping, Spotify API, Object-oriented Programming (OOP), Apache Airflow, Terraform

Data Engineer

2021 - 2021
ExtensoData
  • Prepared an ETL pipeline for ingesting transactional data across 10 primary data sources using Pentaho.
  • Prepared jobs and transformations in Pentaho Data Integration (PDI), starting from extracting data from the source, performing necessary modifications for calculating facts, and loading relevant information into final tables for reporting.
  • Assembled an ETL job that restarts its execution from a failed step, preventing the necessity of re-processing steps that succeeded in the previous execution.
  • Implemented a stepwise logging mechanism to keep track of metadata steps, i.e., when the job started, failed, or succeeded.
  • Generated dummy data for 25 tables in master and raw databases, using the Faker module in Python and Pentaho.
  • Prepared a DAG for scheduling and monitoring ETL pipelines using Apache Airflow.
  • Deployed Foneloan as a product in four different Nepalese banks.
  • Guided interns to help them understand the ETL process and Pentaho.
  • Created relational data tables and designed forms and various reports to meet management needs using Microsoft Access.
Technologies: Pentaho, MySQL, PostgreSQL, Oracle, Python, Apache Airflow, Microsoft SQL Server, Databases, Database Design, Data Migration, Data Cleaning, Data Cleansing, Data Engineering, Microsoft Power BI

Associate Data Engineer

2019 - 2021
ExtensoData
  • Evaluated the current architecture of Foneloan and designed the new one that satisfies the diverse requirements of multiple clients while keeping the ETL flow generic.
  • Automated the ETL process, eliminating the necessity of manual coding to manage the data flows.
  • Performed data cleansing and validation using Pentaho Data Integration (PDI).
  • Optimized SQL queries, reducing processing time by more than 50%.
  • Increased the efficiency of data fetching by approximately 30%, using query optimization and indexing.
  • Ingested data from disparate data sources like SQL, MySQL, Oracle, and CSV to create data views for business intelligence tools.
  • Tested the end-to-end flow of the project named Budget, raised bugs, and provided recommendations.
  • Performed web scrapping using Python to generate schedules for different airlines.
  • Improved overall decision-making by 30% by designing and developing BI reports and providing invaluable insights.
Technologies: MySQL, Oracle, Pentaho, Apache Kafka, Python, Git, ETL, Microsoft SQL Server, Databases, Database Design, Data Migration, Data Cleaning, Data Engineering, Microsoft Power BI

Data Trainee

2019 - 2019
Extensodata, F1Soft
  • Organized queries, including merge operations and window functions.
  • Assembled stored procedures for performing repetitive queries.
  • Prepared triggers to keep track of insert, delete, and update operations performed.
  • Researched different ways of handling personally identifiable information.
  • Built database structures containing different constraints.
Technologies: Agile Practices, Databases, MySQL, SQL, Microsoft Power BI, Python, Pentaho, Data Engineering

Experience

Foneloan

http://foneloan.com.np/
Foneloan is a digital lending service offered by banks to provide small short-term loans instantly to employed individuals through their mobile banking apps. These pre-approved loans are available to banking customers deemed eligible by an automated analytics system according to the criteria set by banks.

Esewa

https://esewa.com.np/
Esewa is a digital wallet based in Nepal that provides instant online payment solutions. This payment gateway is Nepal's first safe and easy way to pay online without exposing credit card numbers and bank account numbers to merchants.

A-PLUS Property

Automobile Property Loss Underwriting Service (A-PLUS) Property collects and reports insurance claims and loss history associated with homes, auto, or personal property. It provides companies with a necessary format to report property and liability claims that are then incorporated into the A-PLUS database.

Education

2015 - 2019

Bachelor's Degree in Computer Engineering

KL University - Vijayawada, India

Certifications

FEBRUARY 2022 - PRESENT

AWS Solutions Architect Associate

AWS

NOVEMBER 2021 - PRESENT

AWS Certified Cloud Practitioner

AWS

Skills

Libraries/APIs

JasperReports, Spotify API

Tools

Microsoft Power BI, Microsoft Excel, Apache Airflow, Microsoft Access, Amazon Athena, Git, Shell, Terraform, AWS Glue, AWS Batch

Languages

SQL, Python, Snowflake

Paradigms

ETL, Database Design, Object-oriented Programming (OOP)

Platforms

Pentaho, Oracle, Windows, Amazon Web Services (AWS), Visual Studio Code (VS Code), Apache Kafka, Amazon EC2

Storage

MySQL, PostgreSQL, Microsoft SQL Server, Databases, Amazon Aurora, SQL Server Integration Services (SSIS), Amazon S3 (AWS S3)

Frameworks

ASP.NET MVC

Other

Data Migration, Data Engineering, DataFrames, Data Cleaning, Data Cleansing, Data Build Tool (dbt), Agile Practices, Snowpipe, Scraping, Data Analytics, Personally Identifiable Information (PII), Amazon RDS

Collaboration That Works

How to Work with Toptal

Toptal matches you directly with global industry experts from our network in hours—not weeks or months.

1

Share your needs

Discuss your requirements and refine your scope in a call with a Toptal domain expert.
2

Choose your talent

Get a short list of expertly matched talent within 24 hours to review, interview, and choose from.
3

Start your risk-free talent trial

Work with your chosen talent on a trial basis for up to two weeks. Pay only if you decide to hire them.

Top talent is in high demand.

Start hiring