
Sony Shrestha
Verified Expert in Engineering
Database Developer
Geldrop, Netherlands
Toptal member since March 22, 2022
Sony is a reliable software engineer with over three years of experience as a database developer, primarily focusing on databases. She currently works in a data analytics company and handles millions of transactions per hour. Most of Sony's work includes back-end architecture design, ETL processes, performance tuning, Pentaho job optimization, designing interactive dashboards, creating data pipelines, analyzing complex datasets, and mentoring interns and trainees.
Portfolio
Experience
- Microsoft Power BI - 3 years
- Apache Airflow - 3 years
- PostgreSQL - 3 years
- ETL - 3 years
- Pentaho - 3 years
- Microsoft SQL Server - 3 years
- SQL - 3 years
- Python - 2 years
Availability
Preferred Environment
Windows, Visual Studio Code (VS Code), Git
The most amazing...
...project I've worked on is Foneloan, a service offered by banks to provide short-term loans to employed individuals through their mobile banking apps.
Work Experience
Data Engineer
Returnalyze
- Developed a robust notification system to alert clients in real time when new data is loaded and insights are generated, enhancing client engagement and decision-making efficiency.
- Designed an ETL pipeline to automate the processing of client data delta feeds.
- Developed a pipeline to integrate with the Bazaarvoice API, enabling clients to retrieve review data efficiently.
- Led end-to-end management of retail sales and returns processing, leveraging DBT for seamless integration of data from S3 to Snowflake tables.
- Designed and implemented a robust notification system to alert users of new files, missing files, and any column mismatches detected during pipeline execution.
- Conducted raw data analysis, providing valuable insights and automated reports to US stakeholders for decision-making.
- Led the onboarding of 10 retailer clients, customizing data pipelines to meet unique client demands and ensuring seamless integration with company systems.
- Designed a robust framework to automate client-specific reports generation.
- Designed and implemented scalable, automated end-to-end data pipelines to ingest and process large datasets, including sales, returns, and product data from multiple retailers.
Data Engineer
Verisk Nepal Pvt. Ltd.
- Optimized existing Pentaho jobs and transformations, which reduced processing time by around 83%.
- Created the end-to-end workflow to migrate legacy data from Microsoft SQL Server to PostgreSQL.
- Implemented data masking techniques, including pseudonymization, anonymization, encryption, redaction, shuffling, and scrambling, to safeguard Personally Identifiable Information (PII) and ensure compliance with data governance regulations.
- Implemented an automated solution using AWS Lambda to trigger the execution of the ETL pipeline upon the arrival of files from clients in an S3 bucket.
- Identified the root cause of technical bugs raised in production and made necessary changes in ETL to fix them.
- Prepared business reports for A-PLUS Property using Jasper scripts.
- Evaluated the workflow and increased the efficiency of data pipelines that process a large volume of data daily.
Data Engineer
Toptal Client
- Prepared a Python script for scraping data from Soundchart, Chartmetric, and Spotify to gather music-related data from several APIs for an app in the music industry.
- Prepared complex SQL scripts while migrating data from raw to clean database.
- Provisioned, changed, and improved resources on AWS using Terraform (Infrastructure as a Code).
- Prepared AWS Glue Crawler that uses a custom classifier, making it able to identify schema of all sorts of data available in AWS S3.
Data Engineer
ExtensoData
- Prepared an ETL pipeline for ingesting transactional data across 10 primary data sources using Pentaho.
- Prepared jobs and transformations in Pentaho Data Integration (PDI), starting from extracting data from the source, performing necessary modifications for calculating facts, and loading relevant information into final tables for reporting.
- Assembled an ETL job that restarts its execution from a failed step, preventing the necessity of re-processing steps that succeeded in the previous execution.
- Implemented a stepwise logging mechanism to keep track of metadata steps, i.e., when the job started, failed, or succeeded.
- Generated dummy data for 25 tables in master and raw databases, using the Faker module in Python and Pentaho.
- Prepared a DAG for scheduling and monitoring ETL pipelines using Apache Airflow.
- Deployed Foneloan as a product in four different Nepalese banks.
- Guided interns to help them understand the ETL process and Pentaho.
- Created relational data tables and designed forms and various reports to meet management needs using Microsoft Access.
Associate Data Engineer
ExtensoData
- Evaluated the current architecture of Foneloan and designed the new one that satisfies the diverse requirements of multiple clients while keeping the ETL flow generic.
- Automated the ETL process, eliminating the necessity of manual coding to manage the data flows.
- Performed data cleansing and validation using Pentaho Data Integration (PDI).
- Optimized SQL queries, reducing processing time by more than 50%.
- Increased the efficiency of data fetching by approximately 30%, using query optimization and indexing.
- Ingested data from disparate data sources like SQL, MySQL, Oracle, and CSV to create data views for business intelligence tools.
- Tested the end-to-end flow of the project named Budget, raised bugs, and provided recommendations.
- Performed web scrapping using Python to generate schedules for different airlines.
- Improved overall decision-making by 30% by designing and developing BI reports and providing invaluable insights.
Data Trainee
Extensodata, F1Soft
- Organized queries, including merge operations and window functions.
- Assembled stored procedures for performing repetitive queries.
- Prepared triggers to keep track of insert, delete, and update operations performed.
- Researched different ways of handling personally identifiable information.
- Built database structures containing different constraints.
Experience
Foneloan
http://foneloan.com.np/Esewa
https://esewa.com.np/A-PLUS Property
Education
Bachelor's Degree in Computer Engineering
KL University - Vijayawada, India
Certifications
AWS Solutions Architect Associate
AWS
AWS Certified Cloud Practitioner
AWS
Skills
Libraries/APIs
JasperReports, Spotify API
Tools
Microsoft Power BI, Microsoft Excel, Apache Airflow, Microsoft Access, Amazon Athena, Git, Shell, Terraform, AWS Glue, AWS Batch
Languages
SQL, Python, Snowflake
Paradigms
ETL, Database Design, Object-oriented Programming (OOP)
Platforms
Pentaho, Oracle, Windows, Amazon Web Services (AWS), Visual Studio Code (VS Code), Apache Kafka, Amazon EC2
Storage
MySQL, PostgreSQL, Microsoft SQL Server, Databases, Amazon Aurora, SQL Server Integration Services (SSIS), Amazon S3 (AWS S3)
Frameworks
ASP.NET MVC
Other
Data Migration, Data Engineering, DataFrames, Data Cleaning, Data Cleansing, Data Build Tool (dbt), Agile Practices, Snowpipe, Scraping, Data Analytics, Personally Identifiable Information (PII), Amazon RDS
How to Work with Toptal
Toptal matches you directly with global industry experts from our network in hours—not weeks or months.
Share your needs
Choose your talent
Start your risk-free talent trial
Top talent is in high demand.
Start hiring