Art Vancil, Developer in Charlottesville, VA, United States
Art is available for hire
Hire Art

Art Vancil

Verified Expert  in Engineering

Data Architect and Developer

Location
Charlottesville, VA, United States
Toptal Member Since
March 9, 2020

Art has 25 years of data architecture and cloud-computing consulting experience—mostly in building enterprise data warehouses. Art is an end-to-end solution architect and chief problem solver with a long history of focused execution—according to a statement of work—and successful delivery in a team setting.

Portfolio

AT&T
Synapse, BigQuery, Snowflake, Redshift, Actian, Azure Synapse, SQL...
Lio Insurance
Profisee MDM, Snowflake, Matillion ETL for Redshift, SQL Server 2015...
American Associated Pharmacies
Data Modeling, Azure SQL, Microsoft Power BI, Azure Data Factory...

Experience

Availability

Part-time

Preferred Environment

Azure, T-SQL (Transact-SQL), Microsoft Power BI, PostgreSQL, PL/SQL, Erwin, Azure Logic Apps, Redshift, Amazon Web Services (AWS), Data Engineering, Architecture, Python, Azure Analysis Services, Databricks, Salesforce, SQL, Data Science, Business Intelligence (BI), Data Visualization, Data Loading, Database Design, Database Schema Design, Reporting, Integration, Amazon S3 (AWS S3), Data Analytics, Technical Leadership, Distributed Systems, Cloud, Algorithms, Leadership, Data Warehouse Design, Consumer Packaged Goods (CPG), Cloud Storage, Data Architecture, Logical Database Design, Database Architecture, Excel 2016, Azure Databricks, Azure Blobs, Azure Queue Storage, Event-driven Architecture

The most amazing...

...software I've developed is a hash join algorithm for joining many tables. This high-volume solution outperformed Db2's table joins by 66%.

Work Experience

Snowflake Data Architect (Contract)

2022 - 2023
AT&T
  • Designed the Snowflake data hub using data modeling, SQL, and Databricks.
  • Created ETL specifications for offshore developers and supported them with guidance and testing.
  • Performed data quality testing and validation for Teradata source data and Snowflake target data.
  • Optimized and rewrote queries to tune them to the highest level of performance.
Technologies: Synapse, BigQuery, Snowflake, Redshift, Actian, Azure Synapse, SQL, Business Intelligence (BI), Data Loading, Cloud, Data Engineering, IT Strategy, Big Data Architecture, Data Management, Delivery Management, Engineering, Cloud Architecture, PL/SQL Tuning, Relational Databases, Cloud Infrastructure, Performance Tuning, Teradata, Business Requirements

MDM Data Architect

2022 - 2022
Lio Insurance
  • Implemented the customer master subject area in Profisee.
  • Designed data load specifications based on requirements.
  • Extracted Snowflake source data for Profisee loads.
  • Attained strong customer feedback for team leadership and problem-solving.
Technologies: Profisee MDM, Snowflake, Matillion ETL for Redshift, SQL Server 2015, Master Data, Data Management, Data Pipelines, Agile Project Management, Engineering, Cloud Architecture, PL/SQL Tuning, Database Transactions, Data Migration, ETL Tools, Insurance, Insurance Technology (Insurtech), Business Requirements

Azure Data Warehouse Architect

2021 - 2022
American Associated Pharmacies
  • Designed and developed Azure data warehouse using Azure SQL, Azure Data Factory, and Power BI to product sales and profitability analysis and customer-facing reports embedded on the RXAAP website.
  • Created relational data models using IDERA ER/Studio; deployed data models to physical Azure SQL databases.
  • Developed ETL to load the data warehouse tables using Azure Data Factory.
  • Created sales and rebate reports embedded in the RXAAP website using Power BI.
Technologies: Data Modeling, Azure SQL, Microsoft Power BI, Azure Data Factory, ER/Studio Data Architecture, SQL, Business Intelligence (BI), Data Visualization, Data Loading, Database Design, Database Schema Design, Reporting, Integration, Data Analytics, Technical Leadership, Distributed Systems, Cloud, Data Engineering, Master Data Management (MDM), MDM, Data Structures, ETL, Data Pipelines, Data Lakes, Microsoft SQL Server, Data Warehouse Design, File Systems, Data Architecture, Logical Database Design, Database Architecture, Key Performance Indicators (KPIs), Microsoft Excel, Data Transformation, Data Cleansing, Data Profiling, Agile Project Management, Engineering, Cloud Architecture, PL/SQL Tuning, XML, GitHub, Relational Databases, Database Structure, Dedicated SQL Pool (formerly SQL DW), Azure SQL Data Warehouse, Azure Data Lake, OLAP, OLTP, Application Architecture, Cloud Infrastructure, ETL Tools, Git, Unstructured Data Analysis, Business Requirements, Machine Learning Operations (MLOps), Dashboards, Relational Database Design, Data Marts, Technical Design, SQL Architecture

Cloud Data Engineer (Contract)

2020 - 2021
McKnight Consulting Group
  • Performed Cloud Big Data benchmark tests to compare the performance of five big data tools.
  • Loaded industry-standard TPC-H test data of 30TB into five different database platforms.
  • Tuned database storage and indexing features to optimize storage and performance.
  • Executed a suite of standard SQL queries and tuned the performance of those queries.
Technologies: Actian, Redshift, Snowflake, Synapse, BigQuery, Big Data

AWS Cloud Architect (Contract)

2020 - 2020
Anthem Wellpoint
  • Compared Atlas and MongoDB versus DocumentDB versus DynamoDB to recommend the best-performing solution for a real-time data streaming solution. Identified limitations and advantages of each tool.
  • Conducted the AWS Well-Architected review, recommending reliability and performance upgrades to the cloud environment.
  • Created a new AWS data-streaming architecture to combine batch and real-time data updates, transaction logging, and JSON document handling.
  • Evaluated and optimized a real-time data streaming application in AWS by introducing GraphQL and DocumentDB.
Technologies: Amazon DynamoDB, Atlas, DocumentDB, MongoDB, GraphQL, Apache Kafka, Amazon Web Services (AWS), SQL, Business Intelligence (BI), Amazon S3 (AWS S3), Cloud, Data Engineering, Cloud Architecture, Data Pipelines, Engineering, Big Data, AWS Lambda, Amazon EC2, Relational Databases, Message Queues, Amazon RDS, Database Transactions, Transactions, Cloud Infrastructure, Oracle Cerner, Insurance, Insurance Technology (Insurtech), Business Requirements, Healthcare Management Systems, Healthcare Effectiveness Data and Information Set (HEDIS), AWS Cloud Architecture, Healthcare

Azure Data Architect (Contract)

2020 - 2020
BioTE Medical (through a development agency)
  • Created data models and cloud architecture models to dramatically restructured the enterprise databases for conversion to Azure cloud microservices. Transformed monolithic MDM into domain-based data stores.
  • Selected a data vault data design pattern. Implemented a GraphQL middleware for data virtualization.
  • Led the C++ .NET Core team to minimize production support impact upon project delivery.
  • Created Power BI dashboards and an OLAP data design, supporting sales and project performance.
Technologies: Microsoft Power BI, Auth0, .NET Core, GraphQL, Azure Application Insights, Azure Logic Apps, Azure Cosmos DB, Redis, Azure Event Hubs, Apache Kafka, Azure SQL, Cloud Architecture, Azure Functions, Engineering, Database Structure, Bioinformatics, OLAP, OLTP, DevOps, Cloud Infrastructure, Data Analysis, DAX, Dashboards, Cloud Monitoring, Healthcare Management Systems, Healthcare Effectiveness Data and Information Set (HEDIS), Data Marts, SQL Architecture, Healthcare

Software Release Manager

2018 - 2020
TAMKO Building Products
  • Resolved developer issues with embedded modules.
  • Defined the software distribution process.
  • Monitored software distribution schedules and successful upgrades for each customer.
Technologies: Power BI Embedded, Browsers, Software Development Management, People Management, Agile Project Management, Engineering Management, Delivery Management, Engineering, Cloud Architecture, Program Management, Data Marts, Software Design, Troubleshooting, Technical Design, System Integration, IT Project Management, SQL Architecture, Executive Presentations, Organization

Data Science Team Leader (Freelance)

2018 - 2020
TAMKO Building Products
  • Created the vision and strategy for a manufacturing data warehouse using relational star-schema storage, ETL, and Power BI dashboards.
  • Created Power BI interactive analytics dashboards with a Java front end to identify $millions cost savings and control the manufacturing process. Designed an OLAP data structure for reporting.
  • Proposed the data governance program including the IT, business, and PMO roles.
  • Supervised 12 developers and DBAs to enable self-service analytics through team leadership, data strategy, and an execution roadmap.
Technologies: Microsoft Power BI, SAP HANA, Microsoft SQL Server, Azure, SQL Server Management Studio (SSMS), Statistical Modeling, Dashboard Development, Team Management, Engineering Management, Delivery Management, Engineering, Cloud Architecture, OLAP, DAX, Machine Learning Operations (MLOps), Bill of Material

Global Center for Innovative Analytics Director

1999 - 2018
Hitachi Consulting
  • Prepared business cases and prepped data for the data science team. Delivered dozens of predictive analytics solutions for the manufacturing, mining, automotive, and transportation industries.
  • Defined the predictive maintenance solution offering, including solution architecture, software, and services components. Performed POCs and client engagements to implement the solutions.
  • Delivered large-scale global cloud migrations to AWS and Azure for financial services, pharmaceutical, and manufacturing companies including Hadoop, Redshift, DevOps, Impala, and Power BI.
  • Defined the big data product offering, including Hadoop hardware specifications, IoT machine data collection, and analysis.
Technologies: Pentaho Data Integration (Kettle), Redshift, Microsoft SQL Server, Hadoop, Machine Learning, Management Systems, Transportation & Logistics, Data Pipelines, Software Development Management, People Management, Team Management, Engineering Management, Delivery Management, Engineering, Cloud Architecture, Oracle PL/SQL, Unix Shell Scripting, XML, Program Management, Relational Databases, Genomics, OLAP, ETL Tools, SQL Server Integration Services (SSIS), SQL Server Analysis Services (SSAS), Machine Learning Operations (MLOps), Bill of Material, Feasibility, Technical Consulting, Information Gathering, Data Marts, System Integration, Consulting, IT Project Management, IT Consulting, Solution Architecture, Executive Presentations, Organization, Due Diligence, Banking & Finance, Automotive, Netezza

Principal Data Architect

1994 - 1999
Kaiser Permanente
  • Led Kaiser Permanente HEDIS reporting to a 4th place national recognition for clinical quality.
  • Defined business requirements to meet the reporting/analysis needs of the physicians, claims, medical quality, and finance departments.
  • Delivered database design and ETL data load processes.
Technologies: Biometrics, Bioinformatics, Data Analysis, Dynamic SQL, Oracle Cerner, Insurance, Healthcare Management Systems, Healthcare Effectiveness Data and Information Set (HEDIS), Relational Database Design, IT Consulting, Solution Architecture, SQL Architecture, Executive Presentations, Healthcare

Senior Software Engineer

1976 - 1979
General Dynamics
  • Designed and delivered custom Fortran solutions on DEC PDP minicomputers to collect real-time flight instrumentation data.
  • Designed and delivered custom Fortran solutions on DEC PDP minicomputers to create real-time test flight reports and to provide real-time controls for data collection storage devices.
  • Designed and delivered custom Fortran solutions on DEC PDP minicomputers to control the execution of CNC inspection equipment.
Technologies: Technical Reports, Software Development Lifecycle (SDLC), IT Infrastructure, IT Operations Management (ITOM), Software Architecture, Engineering, Fortran, Troubleshooting, Technical Design, System Integration

Implementation of Cloudera on AWS Platform for a Leading Semiconductor Manufacturing Company

I undertook a pilot implementation of Cloudera on the AWS platform, providing advice on technology tools, architecture, and ETL design. I also managed the project tasks and deliverables and designed and developed a supply chain traceability solution.

The technology stack included ​AWS, Cloudera Hadoop, Hue, Impala, Hive, Sqoop, Superset, StreamSets, Tableau, Neo4J, SQL Server, and Oracle.

SalesForce.com Data Extraction for an Internet Banking Company

I designed a strategy for high-volume data warehouse extracts, developing daily metrics subsystem for 140 million accounts. I also delivered 100GB daily feeds from AWS to Marketing Cloud and tuned Redshift data storage and SQL script execution performance.

The technology stack included Redshift and Marketing Cloud.

Asset Optimization Solution

I defined the asset optimization solution strategy among Hitachi Group companies, managing software development of solution artifacts, including the oversight of offshore development and data science teams. I implemented solutions for equipment health index and optimizing inspection cycles.

The technology stack included Domo, Ammo, Pentaho, and Oracle Enterprise Asset Management (EAM).

Operations Data Warehouse for a Fortune 100 Technology Services Company

I designed a normalized, historical operations data warehouse using data vault design techniques. I also developed data models and implemented physical databases in Oracle and SQL Server. In addition, I sourced SAP data for loading the data warehouse and reproduced SAP utilization and labor cost calculations for the SQL Server star schema. Finally, I designed Informatica ETL mapping requirements, reviewed deliverables from multiple teams, and instructed the teams in data warehousing best practices.

The technology stack included Erwin, Oracle, Informatica, Microsoft SQL Server, and SAP.

Microservices Enterprise Architecture for a Pharmaceutical Services Company

I redesigned a monolithic .NET application for native cloud microservices. I also designed an event hubs pub/sub messaging strategy and a normalized, historical operations data warehouse using data vault design techniques. Further on, I developed data models and implemented physical databases in Azure DB, relying on rapid, agile development using data patterns and service patterns.
The technology stack included Microsoft Power BI, Microsoft Azure SQL Database, Event Hubs, Logic Apps, Application Insights, and Angular.

Analytics Strategy and Data Warehouse for a Leading Media and Entertainment Company

I delivered data architecture review and strategy for multiple applications within the national facilities and network engineering department, proposing and assisting the transition to a corporate information factory architecture. In addition, I introduced normalized data modeling, data vault data modeling, and star-schema data modeling. I also proposed new roles to move toward an advanced analytics center of excellence and further SDLC steps to support design sprints and change control.

The technology stack included OpenJDK, PostgreSQL, RabbitMQ, and Pentaho Data Integration (PDI).

Enterprise Data Strategy for a Building Products Manufacturing Company

As a data science team leader, I provided team leadership, data strategy, and execution roadmap to enable self-service analytics. I also created a vision and strategy for a manufacturing data warehouse, leading two different analytics development teams with 12 members. Finally, I delivered Power BI control charts, innovative analytics, and visualizations, saving millions of dollars.

The technology stack included Microsoft SQL Server and SAP HANA environments, Power BI, and SAP Analytics Cloud.

Languages

T-SQL (Transact-SQL), SQL, Fortran, Snowflake, Python, R, GraphQL, Python 2, XML

Tools

Microsoft Power BI, Informatica ETL, Erwin, Hue, Impala, Azure Logic Apps, Microsoft Excel, Excel 2016, Lucidchart, STATA, Pentaho Data Integration (Kettle), Azure Application Insights, Auth0, Actian, BigQuery, Synapse, Tableau, Cloudera, RabbitMQ, Matillion ETL for Redshift, Power BI Embedded, GitHub, Git

Paradigms

Database Design, Data Science, ETL, Business Intelligence (BI), Dimensional Modeling, OLAP, Agile Project Management, Application Architecture, Event-driven Architecture, DevOps

Platforms

Amazon Web Services (AWS), Azure, Databricks, Azure SQL Data Warehouse, Dedicated SQL Pool (formerly SQL DW), Azure Event Hubs, Azure Synapse, Azure Functions, Oracle Cerner, Oracle, SAP HANA, Apache Kafka, Amazon EC2, Salesforce, Pentaho, AWS Lambda

Storage

Microsoft SQL Server, SQL Server Management Studio (SSMS), PostgreSQL, Azure SQL, Databases, Relational Databases, Data Pipelines, Data Lakes, Database Architecture, Oracle PL/SQL, Database Structure, Dynamic SQL, SQL Architecture, Redshift, PL/SQL, Apache Hive, HDFS, Amazon S3 (AWS S3), Master Data Management (MDM), Database Transactions, OLTP, DB, Redis, Azure Cosmos DB, MongoDB, Amazon DynamoDB, Netezza, Column-oriented DBMS, Azure SQL Databases, MySQL, ER/Studio Data Architecture, JSON, Teradata, Azure Blobs, Azure Queue Storage, SQL Server Integration Services (SSIS), SQL Server Analysis Services (SSAS)

Other

Data Modeling, Data Management, Solution Architecture, IT Consulting, IT Project Management, Consulting, Data Warehouse Design, Leadership, Technical Design, Architecture, Troubleshooting, Data Architecture, Big Data Architecture, Data Analysis, Data Queries, Big Data, Data, Data Engineering, Data Marts, Relational Database Design, Cloud Architecture, Healthcare Effectiveness Data and Information Set (HEDIS), Data Warehousing, Data Loading, Database Schema Design, Reporting, Integration, Data Analytics, Cloud, Data Structures, Logical Database Design, Information Gathering, Data Transformation, Data Cleansing, Delivery Management, Engineering, PL/SQL Tuning, Cloud Infrastructure, Data Migration, ETL Tools, Business Requirements, Informatica, System Integration, AWS Cloud Architecture, Software Design, Analytics, Software Development, Predictive Analytics, Performance Management, Manufacturing, Healthcare Services, Agile Data Science, Financial Services, Consumer Packaged Goods (CPG), Software, Business Process Analysis, Algorithms, Azure Data Factory, Dashboards, Data Visualization, Healthcare Management Systems, Technical Leadership, Distributed Systems, MDM, Master Data, IT Strategy, Cloud Storage, File Systems, Data Profiling, Technical Consulting, Feasibility, Management Systems, Transportation & Logistics, Software Development Management, People Management, Team Management, eCommerce, Engineering Management, Program Management, Amazon RDS, Transactions, Azure Data Lake, Azure Databricks, Statistics, Performance Tuning, DAX, Unstructured Data Analysis, Insurance Technology (Insurtech), Machine Learning Operations (MLOps), Bill of Material, Data Vaults, Machine Learning, DocumentDB, Due Diligence, Software Architecture, IT Operations Management (ITOM), IT Infrastructure, Organization, Software Development Lifecycle (SDLC), Technical Reports, Executive Presentations, Atlas, Documentation, Oracle R, Government, Data Governance, Data Center Management, Hardware, Claims, Parquet, Benefit Administration, Strategy, Internet of Things (IoT), Computer Science, Marketing Cloud, SAP, Team Leadership, Azure Analysis Services, Profisee MDM, SQL Server 2015, Browsers, Key Performance Indicators (KPIs), Statistical Modeling, Dashboard Development, Unix Shell Scripting, CI/CD Pipelines, Message Queues, Genomics, Biometrics, Security, Delta Lake, Cloud Monitoring

Frameworks

Hadoop, .NET Core

Industry Expertise

Bioinformatics, Insurance, Banking & Finance, Automotive, Healthcare

2012 - 2013

Coursework in Quantitative Methods in Clinical and Public Health Research

Harvard University - Cambridge, MA, United States

1973 - 1976

Bachelor of Science Degree in Computer Science

Louisiana Tech University - Ruston, LA, United States

JUNE 2022 - JUNE 2023

Databricks Accredited Lakehouse Fundamentals

Databricks

MAY 2013 - PRESENT

Data Science Essentials

Cloudera

JANUARY 2013 - PRESENT

Quantitative Methods in Clinical and Public Health Research

Harvard Medical School and Harvard School of Public Health

NOVEMBER 2012 - PRESENT

Certified Cloud Security Knowledge (CCSK)

Cloud Security Alliance

APRIL 1995 - PRESENT

Certified Computing Professional

Institute for Certification of Computing Professionals

Collaboration That Works

How to Work with Toptal

Toptal matches you directly with global industry experts from our network in hours—not weeks or months.

1

Share your needs

Discuss your requirements and refine your scope in a call with a Toptal domain expert.
2

Choose your talent

Get a short list of expertly matched talent within 24 hours to review, interview, and choose from.
3

Start your risk-free talent trial

Work with your chosen talent on a trial basis for up to two weeks. Pay only if you decide to hire them.

Top talent is in high demand.

Start hiring