Svetlana Karpilovskiy
Verified Expert in Engineering
Data Warehouse Design Developer
Los Angeles, CA, United States
Toptal member since July 14, 2020
Svetlana is a top data engineer and architect with over 20 years of experience specializing in tool-based and script-based ETL/ELT for data warehouses, data lakes, and data analytical systems. She spearheaded Cedars-Sinai Medical Center's complete overhaul, including ETL pipelines, secure data extracts and transmissions, and encryptions. Her main skill set is Informatica Suite and Cloud, dbt, and she is proficient in the development and administration of ETL ecosystems and DevOps.
Portfolio
Experience
- Informatica ETL - 20 years
- Oracle Database - 20 years
- Data Warehouse Design - 20 years
- SQL - 20 years
- Data Warehousing - 20 years
- Python - 9 years
- Amazon Web Services (AWS) - 4 years
- Snowflake - 3 years
Availability
Preferred Environment
Amazon Web Services (AWS), Informatica, Python, Snowflake, Azure
The most amazing...
...Informatica ecosystem I built was for NBCUniversal. It had 18 environments across three regions and supported 30 business groups and nearly 1,000 users.
Work Experience
Senior BI Data Engineer
PepsiCo
- Built modern eCommerce Infomart as a cleansed, scalable, secure, and easy-to-navigate data layer that presents a Unified customer view of executive KPIs at a common grain.
- Used ELT for data loaded from Data Vault to reporting star schema data mart.
- Integrated data for marketing budget tracker, sales, share, out-of-stock, fill-rate, and in-store for eCommerce customers.
- Developed Digital Commerce Supply Chain pipelines, established a data foundation, and created data quality trackers and a reporting suite.
Cloud Integration Architect
Crowd Consulting
- Served as the subject area expert on heterogeneous cloud ETL practices across Azure and AWS clouds.
- Led Crowd Consulting’s integration tools and operations (Informatica/Talend) practice.
- Developed proprietary Crowd Consulting’s ETL integration framework combining custom Python and Bash scripting with Fivetran and Airflow.
Integration Solutions Architect
Comcast NBCUniversal
- Architected, installed, and supported six Informatica environments on-premises and same on the cloud. Each environment consists of multiple versions, e.g., 18 environments total across three geographic regions.
- Supported 30 business groups and up to 1,000 individual users.
- Developed ETL standards and best practices, manuals and documentation, sampler methodologies, and code. Conducted multiple POC’s and pilot projects on a cloud-based platform (Python, Airflow).
- Developed impact analysis reports and provided ad-hoc querying of Informatica metadata repositories in Oracle PL/SQL. Wrote shell scripts for system administration purposes.
Senior ETL Developer and Consultant
Cedars-Sinai Medical Center
- Oversaw the EPIC 2014 upgrade: Estimated impact, planned, coded, coordinated, and implemented changes across EDW tiers.
- Designed and developed ETL pipelines for the claims, eligibility, members, pharmacy, and lab data extracts from multiple source systems, shell scripts for secure data transmission, data analysis, and exceptions detection in Oracle.
- Tuned and optimized ETL pipelines and DB loads. Developed extract, encrypted, and secured files delivery to the third-party vendors.
Data Warehouse Engineer, EA Mobile
Electronic Arts
- Designed and developed ETL and DW processes in a multi-platform environment (Windows/Linux, Teradata/Oracle/SQL Server/MySQL/HiveQL, shell scripting, Perl, Python, and Informatica).
- Supported, enhanced, and optimized existing 30+ ETL processes; performed full integration cycle from different wireless carriers, partners, third-party websites, and internal company systems into sales, ranking, ads, and telemetry.
- Developed Informatica mappings/workflows/sessions, performed PowerCenter installations, configurations, and upgrades, managed access and security, and managed metadata repository.
Senior ETL Engineer and Consultant
Hewlett-Packard, HealthNet Government Account
- Developed projects supporting different data marts (medical management, claims, patient enrollment) within the health care program for Department of Defense retirees, active duty service members, and their families.
- Enhanced, developed, and implemented existing and new ETL processes (Informatica/Oracle/Unix).
- Participated in production support and data quality initiatives.
Senior ETL Developer
Universal Music Group
- Managed the ETL full life cycle development enhancement to existing DW pipelines. Data quality analysis, reload/updated strategies, and pipeline development. ETL performance tuning, production support 24x7 on a rotation basis, and 20+ ETL processes.
- Oversaw the migration of music mart data warehouse from DB2/AIX to Windows/SQL server project.
- Developed and communicated specifications for core DW processes to the offshore team and ensured implementation timelines. Coordinated the offshore team’s development effort.
- Participated in data models development and design; full life cycle development for migrated DW processes. Collaborated in the development efforts for the new ETL infrastructure.
Programmer Analyst and Developer
Computer Sciences Corporation (CSC), Raytheon Account
- Oversaw data warehouse development and enhancements and ETL to Oracle data warehouse from IBM mainframe sources.
- Converted Raytheon legacy financial systems to SAP. Worked with a team to design, develop, and deploy a migration application using Informatica Power Center and UNIX scripts.
- Migrated the Raytheon Approval Authority System from HR data warehouse (Sybase) to Oracle. Analyzed legacy data to capture possible exception cases and wrote SQL to generate exception reports.
Experience
Informatica Platform for Crowd Consulting's Tools ETL Practice
Education
Bachelor of Science Degree in Computer Science & Economics
Kiev National Economic University - Kiev, Ukraine
Certifications
Certified Oracle DBA
Oracle Corporation
Skills
Tools
Informatica ETL, Apache Airflow, Terraform, Informatica PowerCenter, dbt Cloud
Languages
SQL, Snowflake, Python, Bash, Perl
Paradigms
ETL, DevOps
Platforms
Azure, Amazon Web Services (AWS), Oracle Database, Talend, Oracle
Storage
PostgreSQL, Redshift, Teradata, PL/SQL, Microsoft SQL Server, MySQL, Apache Hive, IBM Db2, Sybase
Frameworks
Hadoop
Other
Data Warehousing, Data Warehouse Design, Informatica, Fivetran, iDQ, User Stories, Shell Scripting, Azure Data Lake, MicroStrategy, SAP, Data Engineering, Data Vaults, Data Build Tool (dbt), Computer Science, Programming, Economics, Operations, Development
How to Work with Toptal
Toptal matches you directly with global industry experts from our network in hours—not weeks or months.
Share your needs
Choose your talent
Start your risk-free talent trial
Top talent is in high demand.
Start hiring