Svetlana Karpilovskiy, Developer in Los Angeles, CA, United States
Svetlana is available for hire
Hire Svetlana

Svetlana Karpilovskiy

Verified Expert  in Engineering

Data Warehouse Design Developer

Location
Los Angeles, CA, United States
Toptal Member Since
July 14, 2020

Svetlana is a top data engineer and architect with over 20 years of experience specializing in tool-based and script-based ETL/ELT for data warehouses, data lakes, and data analytical systems. She spearheaded the complete overhaul of Cedars-Sinai Medical Center that included ETL pipelines, secure data extracts and transmissions, and encryptions. Her main skill set is Informatica Suite and Cloud, also Talend, and is proficient in the development and administration of ETL ecosystems and DevOps.

Portfolio

Crowd Consulting
Amazon Web Services (AWS), Terraform, Apache Airflow, Python, SQL, Informatica...
Comcast NBCUniversal
Amazon Web Services (AWS), DevOps, Python, Bash, Azure, Snowflake, Teradata...
Cedars-Sinai Medical Center
Shell Scripting, User Stories, PL/SQL, Oracle, Informatica PowerCenter...

Experience

Availability

Part-time

Preferred Environment

Amazon Web Services (AWS), Informatica, Python, Snowflake, Azure

The most amazing...

...Informatica ecosystem I built was for NBCUniversal. It had 18 environments across three regions and supported 30 business groups and nearly 1,000 users.

Work Experience

Cloud Integration Architect

2020 - PRESENT
Crowd Consulting
  • Served as the subject area expert on heterogeneous cloud ETL practices across Azure and AWS clouds.
  • Led Crowd Consulting’s integration tools and operations (Informatica/Talend) practice.
  • Developed proprietary Crowd Consulting’s ETL integration framework combining custom Python and Bash scripting with Fivetran and Airflow.
Technologies: Amazon Web Services (AWS), Terraform, Apache Airflow, Python, SQL, Informatica, Fivetran, PostgreSQL, Redshift, Snowflake, Azure

Integration Solutions Architect

2015 - 2020
Comcast NBCUniversal
  • Architected, installed, and supported six Informatica environments on-premises and same on the cloud. Each environment consists of multiple versions, e.g., 18 environments total across three geographic regions.
  • Supported 30 business groups and up to 1,000 individual users.
  • Developed ETL standards and best practices, manuals and documentation, sampler methodologies, and code. Conducted multiple POC’s and pilot projects on a cloud-based platform (Python, Airflow).
  • Developed impact analysis reports and provided ad-hoc querying of Informatica metadata repositories in Oracle PL/SQL. Wrote shell scripts for system administration purposes.
Technologies: Amazon Web Services (AWS), DevOps, Python, Bash, Azure, Snowflake, Teradata, Talend, iDQ, Informatica

Senior ETL Developer and Consultant

2014 - 2015
Cedars-Sinai Medical Center
  • Oversaw the EPIC 2014 upgrade: Estimated impact, planned, coded, coordinated, and implemented changes across EDW tiers.
  • Designed and developed ETL pipelines for the claims, eligibility, members, pharmacy, and lab data extracts from multiple source systems, shell scripts for secure data transmission, data analysis, and exceptions detection in Oracle.
  • Tuned and optimized ETL pipelines and DB loads. Developed extract, encrypted, and secured files delivery to the third-party vendors.
Technologies: Shell Scripting, User Stories, PL/SQL, Oracle, Informatica PowerCenter, Data Warehousing, Data Warehouse Design

Data Warehouse Engineer, EA Mobile

2011 - 2014
Electronic Arts
  • Designed and developed ETL and DW processes in a multi-platform environment (Windows/Linux, Teradata/Oracle/SQL Server/MySQL/HiveQL, shell scripting, Perl, Python, and Informatica).
  • Supported, enhanced, and optimized existing 30+ ETL processes; performed full integration cycle from different wireless carriers, partners, third-party websites, and internal company systems into sales, ranking, ads, and telemetry.
  • Developed Informatica mappings/workflows/sessions, performed PowerCenter installations, configurations, and upgrades, managed access and security, and managed metadata repository.
Technologies: Azure Data Lake, Bash, Python, Perl, Apache Hive, Hadoop, MySQL, Microsoft SQL Server, Teradata, Oracle, Informatica

Senior ETL Engineer and Consultant

2010 - 2011
Hewlett-Packard, HealthNet Government Account
  • Developed projects supporting different data marts (medical management, claims, patient enrollment) within the health care program for Department of Defense retirees, active duty service members, and their families.
  • Enhanced, developed, and implemented existing and new ETL processes (Informatica/Oracle/Unix).
  • Participated in production support and data quality initiatives.
Technologies: Bash, Oracle, Informatica

Senior ETL Developer

2005 - 2010
Universal Music Group
  • Managed the ETL full life cycle development enhancement to existing DW pipelines. Data quality analysis, reload/updated strategies, and pipeline development. ETL performance tuning, production support 24x7 on a rotation basis, and 20+ ETL processes.
  • Oversaw the migration of music mart data warehouse from DB2/AIX to Windows/SQL server project.
  • Developed and communicated specifications for core DW processes to the offshore team and ensured implementation timelines. Coordinated the offshore team’s development effort.
  • Participated in data models development and design; full life cycle development for migrated DW processes. Collaborated in the development efforts for the new ETL infrastructure.
Technologies: Bash, MicroStrategy, IBM Db2, Oracle, Informatica

Programmer Analyst and Developer

1997 - 2005
Computer Sciences Corporation (CSC), Raytheon Account
  • Oversaw data warehouse development and enhancements and ETL to Oracle data warehouse from IBM mainframe sources.
  • Converted Raytheon legacy financial systems to SAP. Worked with a team to design, develop, and deploy a migration application using Informatica Power Center and UNIX scripts.
  • Migrated the Raytheon Approval Authority System from HR data warehouse (Sybase) to Oracle. Analyzed legacy data to capture possible exception cases and wrote SQL to generate exception reports.
Technologies: Sybase, SAP, Bash, Oracle, Informatica

Informatica Platform for Crowd Consulting's Tools ETL Practice

http://www.crowdc.io
I developed the frameworks and standards for new tools and ETL (Informatica/Talend) practice for Crowd Consulting. Crowd Consulting, LLC. is a boutique consulting firm specializing in cloud data engineering, data warehousing, data lake architecture, development, and operations.

Languages

SQL, Snowflake, Python, Bash, Perl

Tools

Informatica ETL, Apache Airflow, Terraform, Informatica PowerCenter

Paradigms

ETL, DevOps

Other

Data Warehousing, Data Warehouse Design, Informatica, Fivetran, iDQ, User Stories, Shell Scripting, Azure Data Lake, MicroStrategy, SAP

Platforms

Azure, Amazon Web Services (AWS), Oracle Database, Talend, Oracle

Storage

PostgreSQL, Redshift, Teradata, PL/SQL, Microsoft SQL Server, MySQL, Apache Hive, IBM Db2, Sybase

Frameworks

Hadoop

1987 - 1992

Bachelor of Science Degree in Computer Science & Economics

Kiev National Economic University - Kiev, Ukraine

NOVEMBER 2001 - PRESENT

Certified Oracle DBA

Oracle Corporation

Collaboration That Works

How to Work with Toptal

Toptal matches you directly with global industry experts from our network in hours—not weeks or months.

1

Share your needs

Discuss your requirements and refine your scope in a call with a Toptal domain expert.
2

Choose your talent

Get a short list of expertly matched talent within 24 hours to review, interview, and choose from.
3

Start your risk-free talent trial

Work with your chosen talent on a trial basis for up to two weeks. Pay only if you decide to hire them.

Top talent is in high demand.

Start hiring