Senior Data Engineer
2022 - PRESENTThe Estee Lauder Companies Inc.- Developed data pipeline jobs to ingest web search and sales data into BigQuery.
- Fixed and tracked issues that were occurring in the data pipeline using the Jira tool.
- Oversaw deployments using Git and CI/CD. Created a high-level process flow and technical design specification document.
Technologies: Apache Airflow, Python, Google Cloud Platform (GCP), BigQuery, SQL, Bash, Data Pipelines, Data Engineering, HIPAA Compliance, Data Analytics, Google AnalyticsSR Data Engineer
2021 - 2022LA City- Deployed the web application from on-prem to Google Cloud, developed Dataflow pipelines, and implemented CI/CD in the Cloud environment.
- Tracked bugs using Jira and troubleshot pipeline-related errors and performance tuning.
- Oversaw the workload processed from the pipeline jobs.
Technologies: Apache Airflow, BigQuery, Cloud, Docker, Kubernetes, Microsoft Power BI, PostgreSQL, Google Analytics, JSONSenior Data Engineer
2019 - 2021Kitchen United- Created a daily reporting process to send out reports to members. This daily process ingests the data into the data lake then the "send email" process sends the reporting emails to all members.
- Developed the ETL pipeline to ingest the purchase data into the data lake. Created the batch job using PySpark and Apache Beam to load the third-party sales data into the data lake.
- Designed and developed the data mart that provides insights and visualization.
- Automated the process for onboarding and offboarding members.
Technologies: Data Marts, Data Lakes, Amazon Athena, PostgreSQL, AWS Glue, Python 3, ETL, Google Cloud, Docker, Apache Beam, ETL Pipelines, Python, Business Intelligence (BI), Data Engineering, Streaming Data, Amazon Web Services (AWS), Data Analytics, Data Modeling, SQLSenior Data Engineer
2018 - 2019Fabfitfun- Designed a data mart to track the sales, CPA, and churns across various sales channels—provided a solution for automated AB testing.
- Developed the ETL pipeline to ingest data related to the add on purchases and seasonal box delivery to members across Fabfitfun.
- Developed the ETL pipeline for survey data ingestion.
- Designed and developed the style data mart that provides visualizations across top-selling SKUs.
Technologies: Data Marts, Qualtrics, Bash Script, PostgreSQL, Redshift Spectrum, Amazon EC2, Python 3, Apache Airflow, ETL, Apache Beam, ETL Pipelines, Python, Business Intelligence (BI), Data Engineering, Migration, Amazon Web Services (AWS), TableauSenior Data Engineer
2017 - 2018Machinima- Developed a process that provides video data insights.
- Designed and developed the data mart that provides visualizations on the best performing videos across channels.
- Configured the Goofys file system used as a primary source/target for most of the ELT/ETL process.
Technologies: Redshift, Bash Script, PostgreSQL, Pentaho, Python 3, ETL Pipelines, Python, Business Intelligence (BI), Data Engineering, Oracle, Data ModelingData Engineer
2015 - 2017PennyMac- Gathered requirements and completed data analysis, design, and development of the ELT/ETL process using Pentaho and Python.
- Designed a data lake on AWS for various processes with data ingestion into the data warehouse Redshift and Snowflake. Worked with stakeholders in resolving issues and completing requirements.
- Oversaw performance tuning of the queries and provided operations support.
Technologies: Snowflake, Python 2, Pentaho, PostgreSQL, Python, Data EngineeringSenior Database Developer
2014 - 2015BeachMint- Designed and developed ELT/ETL processes using Python.
- Designed a sales data mart of complex queries.
- Oversaw performance tuning of queries.
Technologies: Redshift, Bash, Python 2, PostgreSQL, TableauSenior Developer
2013 - 2014Bank of America- Designed and developed the ETL process. Collaborated with stakeholders to resolve issues and clarify requirements.
- Designed the order data-mart and loaded the data using the ETL Pentaho and SQL.
- Managed the performance tuning of the queries.
Technologies: Oracle, PostgreSQL, Python 2Database developer
2011 - 2013Universal Music Group- Developer ETL processes using Oracle PL/SQL to extract the legacy data and load it into the data mart.
- Oversaw the performance tuning of complex queries. Gathered requirements from end-users and designed the data mart for royalties and copyrights.
- Performed data analysis for royalties and copyrights. Created an automation process for processing the data.
Technologies: Bash Script, Oracle PL/SQLETL Developer
2007 - 2010Prokarma- Oversaw the data migration project from the legacy system to SAP.
- Developed the ETL process to handle the car's data.
- Collaborated with stakeholders on requirements gathering. Performed data analysis.
Technologies: SAP FICO, Shell, Oracle PL/SQLSenior Developer
2006 - 2007RapidIgm Consulting- Developed an ETL process to perform data integration from various sources. Peformed analysis on the Rx and DDD data.
- Designed the sales data mart and assisted with complex queries and performance tuning.
- Collaborated with stakeholders to gather requirements and develop the data modeling.
Technologies: SQL, Shell, Oracle