Satish Basetty
Verified Expert in Engineering
Database Developer
Los Angeles, CA, United States
Toptal member since December 14, 2020
Satish is a senior data engineer with over 14 years of experience in database and data warehouse projects in both on-premises and cloud. He is an expert in the design and development of ETL pipelines using Python and SQL over Cloud Dataflow orchestration with Apache Airflow. He automated processing data of royalties and copyrights for Universal Music Group. Satish has provided solutions encompassing reports and visualizations, real-time data processing, migrations, and performance tuning.
Portfolio
Experience
Availability
Preferred Environment
Slack, PyCharm, Windows, Linux, MacOS, Docker, Docker Hub, Oracle, Streaming Data, HIPAA Compliance, Amazon Web Services (AWS), Data Analytics, Tableau, Data Modeling, PostgreSQL, SQL, Google Analytics, Data Engineering, Data Analysis, APIs
The most amazing...
...solution I've provided was a highly scalable daily sales reporting process.
Work Experience
Senior Data Engineer
The Estee Lauder Companies Inc.
- Developed data pipeline jobs to ingest web search and sales data into BigQuery.
- Fixed and tracked issues that were occurring in the data pipeline using the Jira tool.
- Oversaw deployments using Git and CI/CD. Created a high-level process flow and technical design specification document.
SR Data Engineer
LA City
- Deployed the web application from on-prem to Google Cloud, developed Dataflow pipelines, and implemented CI/CD in the Cloud environment.
- Tracked bugs using Jira and troubleshot pipeline-related errors and performance tuning.
- Oversaw the workload processed from the pipeline jobs.
Senior Data Engineer
Kitchen United
- Created a daily reporting process to send out reports to members. This daily process ingests the data into the data lake then the "send email" process sends the reporting emails to all members.
- Developed the ETL pipeline to ingest the purchase data into the data lake. Created the batch job using PySpark and Apache Beam to load the third-party sales data into the data lake.
- Designed and developed the data mart that provides insights and visualization.
- Automated the process for onboarding and offboarding members.
Senior Data Engineer
Fabfitfun
- Designed a data mart to track the sales, CPA, and churns across various sales channels—provided a solution for automated AB testing.
- Developed the ETL pipeline to ingest data related to the add on purchases and seasonal box delivery to members across Fabfitfun.
- Developed the ETL pipeline for survey data ingestion.
- Designed and developed the style data mart that provides visualizations across top-selling SKUs.
Senior Data Engineer
Machinima
- Developed a process that provides video data insights.
- Designed and developed the data mart that provides visualizations on the best performing videos across channels.
- Configured the Goofys file system used as a primary source/target for most of the ELT/ETL process.
Data Engineer
PennyMac
- Gathered requirements and completed data analysis, design, and development of the ELT/ETL process using Pentaho and Python.
- Designed a data lake on AWS for various processes with data ingestion into the data warehouse Redshift and Snowflake. Worked with stakeholders in resolving issues and completing requirements.
- Oversaw performance tuning of the queries and provided operations support.
Senior Database Developer
BeachMint
- Designed and developed ELT/ETL processes using Python.
- Designed a sales data mart of complex queries.
- Oversaw performance tuning of queries.
Senior Developer
Bank of America
- Designed and developed the ETL process. Collaborated with stakeholders to resolve issues and clarify requirements.
- Designed the order data-mart and loaded the data using the ETL Pentaho and SQL.
- Managed the performance tuning of the queries.
Database developer
Universal Music Group
- Developer ETL processes using Oracle PL/SQL to extract the legacy data and load it into the data mart.
- Oversaw the performance tuning of complex queries. Gathered requirements from end-users and designed the data mart for royalties and copyrights.
- Performed data analysis for royalties and copyrights. Created an automation process for processing the data.
ETL Developer
Prokarma
- Oversaw the data migration project from the legacy system to SAP.
- Developed the ETL process to handle the car's data.
- Collaborated with stakeholders on requirements gathering. Performed data analysis.
Senior Developer
RapidIgm Consulting
- Developed an ETL process to perform data integration from various sources. Peformed analysis on the Rx and DDD data.
- Designed the sales data mart and assisted with complex queries and performance tuning.
- Collaborated with stakeholders to gather requirements and develop the data modeling.
Experience
Sales Data Ingestion
I collaborated worked with the finance, marketing, data science, and BI teams and provided solutions accordingly. I helped build the data modeling that enabled the BI team to create reports and dashboards. I created a reconciliation process to keep track of the orders, a cloud to watch alerts, error reporting, and an outbound process to various third-party vendors.
Gaming/Video Data Ingestion for Machinima
I designed the data mart to track insights at the video-id grain from various channels and collaborated with the finance, email marketing and BI teams. I developed a process to ingest the sentiment data events into the data mart and configured the Goofys file system used as the primary source/target for most of the ELT/ETL process.
Sentiment Data Analysis
Education
Master's Degree in Computer Science
Texas A & M University - College Station, Texas, USA
Skills
Libraries/APIs
Scikit-learn
Tools
AWS Glue, Apache Airflow, Amazon Athena, Apache Beam, BigQuery, Cloud Dataflow, Google Cloud Composer, Tableau, Google Analytics, PyCharm, Slack, Docker Hub, Microsoft Power BI
Languages
SQL, Python, Snowflake, Python 3, Python 2, Bash
Paradigms
ETL, Business Intelligence (BI), HIPAA Compliance
Platforms
Azure, AWS Lambda, Amazon Web Services (AWS), Docker, Oracle, Google Cloud Platform (GCP), Kubernetes
Storage
PostgreSQL, Redshift, Google Cloud, Data Pipelines, JSON
Other
Data Warehousing, Query Optimization, Data Warehouse Design, Data Engineering, Migration, Data Analytics, ETL Pipelines, Indexing, Streaming Data, Data Modeling, Data Analysis, APIs, Slurm Workload Manager, Cloud
How to Work with Toptal
Toptal matches you directly with global industry experts from our network in hours—not weeks or months.
Share your needs
Choose your talent
Start your risk-free talent trial
Top talent is in high demand.
Start hiring