Art Vancil, Developer in Charlottesville, VA, United States
Art is available for hire
Hire Art

Art Vancil

Bio

Art has 25 years of data architecture and cloud-computing consulting experience—mostly in building enterprise data warehouses. Art is an end-to-end solution architect and chief problem solver with a long history of focused execution—according to a statement of work—and successful delivery in a team setting.

Portfolio

Healthcare Start-up
Python, SQL, AI Prompts, AI Modeling
Brightly
Data Strategy, ETL, Data Engineering, Agile Software Development, CRM Design...
AT&T
Synapse, BigQuery, Snowflake, Redshift, Actian, Azure Synapse, SQL...

Experience

  • Data Architecture - 20 years
  • SQL Server Management Studio (SSMS) - 12 years
  • Algorithms - 9 years
  • Erwin - 7 years
  • Cloud Architecture - 6 years
  • Predictive Analytics - 6 years
  • PostgreSQL - 3 years
  • Azure SQL - 2 years

Preferred Environment

Azure, Transact-SQL (T-SQL), Microsoft Power BI, PostgreSQL, PL/SQL, Erwin, Azure Logic Apps, Redshift, Amazon Web Services (AWS), Data Engineering, Architecture, Python, Azure Analysis Services, Databricks, Salesforce, SQL, Data Science, Business Intelligence (BI), Data Visualization, Data Loading, Database Design, Database Schema Design, Reporting, Integration, Amazon S3 (AWS S3), Data Analytics, Technical Leadership, Distributed Systems, Cloud, Algorithms, Leadership, Data Warehouse Design, Consumer Packaged Goods (CPG), Cloud Storage, Data Architecture, Logical Database Design, Database Architecture, Excel 2016, Azure Databricks, Azure Blobs, Azure Queue Storage, Event-driven Architecture, AI Prompts

The most amazing...

...software I've developed is a hash join algorithm for joining many tables. This high-volume solution outperformed Db2's table joins by 66%.

Work Experience

Python Developer

2024 - PRESENT
Healthcare Start-up
  • Developed a database and Python ETL framework that collects healthcare clinical diagnoses and treatment protocols.
  • Built data loading jobs from a variety of published clinical sources, including website scrapers and OCR publications.
  • Introduced AI capabilities and made the data available in published form for non-technical users.
Technologies: Python, SQL, AI Prompts, AI Modeling

ETL Data Strategist

2023 - PRESENT
Brightly
  • Conducted data strategy workshops with IT leaders to discover solutions to ETL and business process bottlenecks.
  • Proposed and designed the ETL tool cloud migration to AWS Glue.
  • Designed process workflow automation, including Salesforce, Glue, Snowflake, and Azure SQL, to reduce process duration from weeks to one day.
Technologies: Data Strategy, ETL, Data Engineering, Agile Software Development, CRM Design, CRM Configuration, CRM Implementation (Oracle), CRM Implementation (Salesforce), Talend, Microservices, NoSQL, Database Caching, Strategy, Technical Product Management, Business Process Re-engineering, System Architecture, Infrastructure as Code (IaC), Technical Architecture, TOGAF, Data Conversion, Data Mapping, Lucidchart, Excel 365, Miro, Business Systems Analysis

Snowflake Data Architect (Contract)

2022 - 2023
AT&T
  • Designed the Snowflake data hub using data modeling, SQL, and Databricks.
  • Created ETL specifications for offshore developers and supported them with guidance and testing.
  • Performed data quality testing and validation for Teradata source data and Snowflake target data.
  • Optimized and rewrote queries to tune them to the highest level of performance.
Technologies: Synapse, BigQuery, Snowflake, Redshift, Actian, Azure Synapse, SQL, Business Intelligence (BI), Data Loading, Cloud, Data Engineering, IT Strategy, Big Data Architecture, Data Management, Delivery Management, Engineering, Cloud Architecture, PL/SQL Tuning, Relational Databases, Cloud Infrastructure, Performance Tuning, Teradata, Business Requirements, Customer Relationship Management (CRM), Web, System Implementation, Data Conversion, Data Mapping, Excel 365, ERD, Azure Databricks

MDM Data Architect

2022 - 2022
Lio Insurance
  • Implemented the customer master subject area in Profisee.
  • Designed data load specifications based on requirements.
  • Extracted Snowflake source data for Profisee loads.
  • Attained strong customer feedback for team leadership and problem-solving.
Technologies: Profisee MDM, Snowflake, Matillion ETL for Redshift, SQL Server 2015, Master Data, Data Management, Data Pipelines, Agile Project Management, Engineering, Cloud Architecture, PL/SQL Tuning, Database Transactions, Data Migration, ETL Tools, Insurance, Insurance Technology (Insurtech), Business Requirements, Customer Relationship Management (CRM), CRM Design, Business Process Re-engineering, System Architecture, Technical Architecture, Technical Writing, Excel 365, Visio, Data Governance

Azure Data Warehouse Architect

2021 - 2022
American Associated Pharmacies
  • Designed and developed Azure data warehouse using Azure SQL, Azure Data Factory, and Power BI to product sales and profitability analysis and customer-facing reports embedded on the RXAAP website.
  • Created relational data models using IDERA ER/Studio; deployed data models to physical Azure SQL databases.
  • Developed ETL to load the data warehouse tables using Azure Data Factory.
  • Created sales and rebate reports embedded in the RXAAP website using Power BI.
Technologies: Data Modeling, Azure SQL, Microsoft Power BI, Azure Data Factory (ADF), ER/Studio Data Architecture, SQL, Business Intelligence (BI), Data Visualization, Data Loading, Database Design, Database Schema Design, Reporting, Integration, Data Analytics, Technical Leadership, Distributed Systems, Cloud, Data Engineering, Master Data Management (MDM), MDM, Data Structures, ETL, Data Pipelines, Data Lakes, Microsoft SQL Server, Data Warehouse Design, File Systems, Data Architecture, Logical Database Design, Database Architecture, Key Performance Indicators (KPIs), Microsoft Excel, Data Transformation, Data Cleansing, Data Profiling, Agile Project Management, Engineering, Cloud Architecture, PL/SQL Tuning, XML, GitHub, Relational Databases, Database Structure, Dedicated SQL Pool (formerly SQL DW), Azure SQL Data Warehouse, Azure Data Lake, OLAP, OLTP, Application Architecture, Cloud Infrastructure, ETL Tools, Git, Unstructured Data Analysis, Business Requirements, Machine Learning Operations (MLOps), Dashboards, Relational Database Design, Data Marts, Technical Design, SQL Architecture, Customer Relationship Management (CRM), Web, CRM Design, CRM Implementation (Salesforce), Infrastructure as Code (IaC), Technical Architecture, Data Mapping, Technical Writing, Excel 365, Systems Analysis, ERD, Program Management, Azure, DAX, Analytics, Azure Databricks, Azure Data Lake Storage

Cloud Data Engineer (Contract)

2020 - 2021
McKnight Consulting Group
  • Performed Cloud Big Data benchmark tests to compare the performance of five big data tools.
  • Loaded industry-standard TPC-H test data of 30TB into five different database platforms.
  • Tuned database storage and indexing features to optimize storage and performance.
  • Executed a suite of standard SQL queries and tuned the performance of those queries.
Technologies: Actian, Redshift, Snowflake, Synapse, BigQuery, Big Data, Dell Boomi, Talend, Software as a Service (SaaS), Technical Architecture, Excel 365, Azure Synapse Analytics

AWS Cloud Architect (Contract)

2020 - 2020
Anthem Wellpoint
  • Compared Atlas and MongoDB versus DocumentDB versus DynamoDB to recommend the best-performing solution for a real-time data streaming solution. Identified limitations and advantages of each tool.
  • Conducted the AWS Well-Architected review, recommending reliability and performance upgrades to the cloud environment.
  • Created a new AWS data-streaming architecture to combine batch and real-time data updates, transaction logging, and JSON document handling.
  • Evaluated and optimized a real-time data streaming application in AWS by introducing GraphQL and DocumentDB.
Technologies: Amazon DynamoDB, Atlas, DocumentDB, MongoDB, GraphQL, Apache Kafka, Amazon Web Services (AWS), SQL, Business Intelligence (BI), Amazon S3 (AWS S3), Cloud, Data Engineering, Cloud Architecture, Data Pipelines, Engineering, Big Data, AWS Lambda, Amazon EC2, Relational Databases, Message Queues, Amazon RDS, Database Transactions, Transactions, Cloud Infrastructure, Oracle Cerner, Insurance, Insurance Technology (Insurtech), Business Requirements, Healthcare Management Systems, Healthcare Effectiveness Data and Information Set (HEDIS), AWS Cloud Architecture, Healthcare, Real-time Data, CRM Design, NoSQL, Business Process Re-engineering, System Architecture, Technical Writing, Excel 365, Systems Analysis

Azure Data Architect (Contract)

2020 - 2020
BioTE Medical (through a development agency)
  • Created data models and cloud architecture models to dramatically restructured the enterprise databases for conversion to Azure cloud microservices. Transformed monolithic MDM into domain-based data stores.
  • Selected a data vault data design pattern. Implemented a GraphQL middleware for data virtualization.
  • Led the C++ .NET Core team to minimize production support impact upon project delivery.
  • Created Power BI dashboards and an OLAP data design, supporting sales and project performance.
Technologies: Microsoft Power BI, Auth0, .NET Core, GraphQL, Azure Application Insights, Azure Logic Apps, Azure Cosmos DB, Redis, Azure Event Hubs, Apache Kafka, Azure SQL, Cloud Architecture, Azure Functions, Engineering, Database Structure, Bioinformatics, OLAP, OLTP, DevOps, Cloud Infrastructure, Data Analysis, DAX, Dashboards, Cloud Monitoring, Healthcare Management Systems, Healthcare Effectiveness Data and Information Set (HEDIS), Data Marts, SQL Architecture, Healthcare, Enterprise Architecture, Microservices, Technical Architecture, Data Mapping, Excel 365, Systems Analysis, Domain-driven Design (DDD), Azure, Analytics, Azure Data Factory (ADF)

Software Release Manager

2018 - 2020
TAMKO Building Products
  • Resolved developer issues with embedded modules.
  • Defined the software distribution process.
  • Monitored software distribution schedules and successful upgrades for each customer.
Technologies: Power BI Embedded, Browsers, Software Development Management, People Management, Agile Project Management, Engineering Management, Delivery Management, Engineering, Cloud Architecture, Program Management, Data Marts, Software Design, Troubleshooting, Technical Design, System Integration, IT Project Management, SQL Architecture, Executive Presentations, Organization, Software, Agile Software Development, CRM Configuration, New Products, Technical Architecture, TOGAF, Excel 365, Analytics, IT Management

Data Science Team Leader (Freelance)

2018 - 2020
TAMKO Building Products
  • Created the vision and strategy for a manufacturing data warehouse using relational star-schema storage, ETL, and Power BI dashboards.
  • Created Power BI interactive analytics dashboards with a Java front end to identify $millions cost savings and control the manufacturing process. Designed an OLAP data structure for reporting.
  • Proposed the data governance program including the IT, business, and PMO roles.
  • Supervised 12 developers and DBAs to enable self-service analytics through team leadership, data strategy, and an execution roadmap.
Technologies: Microsoft Power BI, SAP HANA, Microsoft SQL Server, Azure, SQL Server Management Studio (SSMS), Statistical Modeling, Dashboard Development, Team Management, Engineering Management, Delivery Management, Engineering, Cloud Architecture, OLAP, DAX, Machine Learning Operations (MLOps), Bill of Material, Real-time Data, MQTT, Technical Writing, Excel 365, Visio, ERD, Analytics

Global Center for Innovative Analytics Director

1999 - 2018
Hitachi Consulting
  • Prepared business cases and prepped data for the data science team. Delivered dozens of predictive analytics solutions for the manufacturing, mining, automotive, and transportation industries.
  • Defined the predictive maintenance solution offering, including solution architecture, software, and services components. Performed POCs and client engagements to implement the solutions.
  • Delivered large-scale global cloud migrations to AWS and Azure for financial services, pharmaceutical, and manufacturing companies including Hadoop, Redshift, DevOps, Impala, and Power BI.
  • Defined the big data product offering, including Hadoop hardware specifications, IoT machine data collection, and analysis.
Technologies: Pentaho Data Integration (Kettle), Redshift, Microsoft SQL Server, Hadoop, Machine Learning, Transportation & Logistics, Management Systems, Data Pipelines, Software Development Management, People Management, Team Management, Engineering Management, Delivery Management, Engineering, Cloud Architecture, Oracle PL/SQL, Unix Shell Scripting, XML, Program Management, Relational Databases, Genomics, OLAP, ETL Tools, SQL Server Integration Services (SSIS), SQL Server Analysis Services (SSAS), Machine Learning Operations (MLOps), Bill of Material, Feasibility, Technical Consulting, Information Gathering, Data Marts, System Integration, Consulting, IT Project Management, IT Consulting, Solution Architecture, Executive Presentations, Organization, Due Diligence, Banking & Finance, Automotive, Netezza, Real-time Data, MQTT, CRM Implementation (Oracle), CRM Implementation (Salesforce), HRIS, Database Caching, Strategy, New Products, Oracle ERP, Technical Architecture, Data Mapping, SharePoint, Technical Writing, Visio, Business Systems Analysis, ERD, Amazon Redshift, Azure, Analytics, Azure Data Factory (ADF), IT Management

Principal Data Architect

1994 - 1999
Kaiser Permanente and Neighborhood Health Plan
  • Led Kaiser Permanente HEDIS reporting to achieve 4th-place national recognition for clinical quality.
  • Defined business requirements to meet the reporting and analysis needs of the physicians, claims, medical quality, and finance departments.
  • Delivered database design and ETL data load processes, including a Cognos conceptual layer for analytic elements in user terminology.
Technologies: Biometrics, Bioinformatics, Data Analysis, Dynamic SQL, Oracle Cerner, Insurance, Healthcare Management Systems, Healthcare Effectiveness Data and Information Set (HEDIS), Relational Database Design, IT Consulting, Solution Architecture, SQL Architecture, Executive Presentations, Healthcare, IBM Cognos, CRM Configuration, CRM Implementation (Salesforce), Business Systems Analysis, ERD, Analytics

Senior Software Engineer

1976 - 1979
General Dynamics
  • Designed and delivered custom Fortran solutions on DEC PDP minicomputers to collect real-time flight instrumentation data.
  • Designed and delivered custom Fortran solutions on DEC PDP minicomputers to create real-time test flight reports and to provide real-time controls for data collection storage devices.
  • Designed and delivered custom Fortran solutions on DEC PDP minicomputers to control the execution of CNC inspection equipment.
Technologies: Technical Reports, Software Development Lifecycle (SDLC), IT Infrastructure, IT Operations Management (ITOM), Software Architecture, Engineering, Fortran, Troubleshooting, Technical Design, System Integration, System Implementation

Experience

Implementation of Cloudera on AWS Platform for a Leading Semiconductor Manufacturing Company

I undertook a pilot implementation of Cloudera on the AWS platform, providing advice on technology tools, architecture, and ETL design. I also managed the project tasks and deliverables and designed and developed a supply chain traceability solution.

The technology stack included ​AWS, Cloudera Hadoop, Hue, Impala, Hive, Sqoop, Superset, StreamSets, Tableau, Neo4J, SQL Server, and Oracle.

SalesForce.com Data Extraction for an Internet Banking Company

I designed a strategy for high-volume data warehouse extracts, developing daily metrics subsystem for 140 million accounts. I also delivered 100GB daily feeds from AWS to Marketing Cloud and tuned Redshift data storage and SQL script execution performance.

The technology stack included Redshift and Marketing Cloud.

Asset Optimization Solution

I defined the asset optimization solution strategy among Hitachi Group companies, managing software development of solution artifacts, including the oversight of offshore development and data science teams. I implemented solutions for equipment health index and optimizing inspection cycles.

The technology stack included Domo, Ammo, Pentaho, and Oracle Enterprise Asset Management (EAM).

Operations Data Warehouse for a Fortune 100 Technology Services Company

I designed a normalized, historical operations data warehouse using data vault design techniques. I also developed data models and implemented physical databases in Oracle and SQL Server. In addition, I sourced SAP data for loading the data warehouse and reproduced SAP utilization and labor cost calculations for the SQL Server star schema. Finally, I designed Informatica ETL mapping requirements, reviewed deliverables from multiple teams, and instructed the teams in data warehousing best practices.

The technology stack included Erwin, Oracle, Informatica, Microsoft SQL Server, and SAP.

Microservices Enterprise Architecture for a Pharmaceutical Services Company

I redesigned a monolithic .NET application for native cloud microservices. I also designed an event hubs pub/sub messaging strategy and a normalized, historical operations data warehouse using data vault design techniques. Further on, I developed data models and implemented physical databases in Azure DB, relying on rapid, agile development using data patterns and service patterns.
The technology stack included Microsoft Power BI, Microsoft Azure SQL Database, Event Hubs, Logic Apps, Application Insights, and Angular.

Enterprise Data Strategy for a Building Products Manufacturing Company

As a data science team leader, I provided team leadership, data strategy, and execution roadmap to enable self-service analytics. I also created a vision and strategy for a manufacturing data warehouse, leading two different analytics development teams with 12 members. Finally, I delivered Power BI control charts, innovative analytics, and visualizations, saving millions of dollars.

The technology stack included Microsoft SQL Server and SAP HANA environments, Power BI, and SAP Analytics Cloud.

Analytics Strategy and Data Warehouse for a Leading Media and Entertainment Company

I delivered data architecture review and strategy for multiple applications within the national facilities and network engineering department, proposing and assisting the transition to a corporate information factory architecture. In addition, I introduced normalized data modeling, data vault data modeling, and star-schema data modeling. I also proposed new roles to move toward an advanced analytics center of excellence and further SDLC steps to support design sprints and change control.

The technology stack included OpenJDK, PostgreSQL, RabbitMQ, and Pentaho Data Integration (PDI).

Education

2012 - 2013

Coursework in Quantitative Methods in Clinical and Public Health Research

Harvard University - Cambridge, MA, United States

1973 - 1976

Bachelor of Science Degree in Computer Science

Louisiana Tech University - Ruston, LA, United States

Certifications

JUNE 2022 - JUNE 2023

Databricks Accredited Lakehouse Fundamentals

Databricks

MAY 2013 - PRESENT

Data Science Essentials

Cloudera

JANUARY 2013 - PRESENT

Quantitative Methods in Clinical and Public Health Research

Harvard Medical School and Harvard School of Public Health

NOVEMBER 2012 - PRESENT

Certified Cloud Security Knowledge (CCSK)

Cloud Security Alliance

APRIL 1995 - PRESENT

Certified Computing Professional

Institute for Certification of Computing Professionals

Skills

Tools

Microsoft Power BI, Lucidchart, Informatica ETL, Erwin, Hue, Impala, Azure Logic Apps, Microsoft Excel, Excel 2016, Miro, Visio, STATA, Pentaho Data Integration (Kettle), Azure Application Insights, Auth0, Actian, BigQuery, Synapse, Tableau, Cloudera, RabbitMQ, Matillion ETL for Redshift, Power BI Embedded, GitHub, Git, IBM Cognos, MQTT, AI Prompts

Languages

Transact-SQL (T-SQL), SQL, Python, Fortran, Snowflake, R, GraphQL, Python 2, XML

Paradigms

Database Design, ETL, Business Intelligence (BI), Dimensional Modeling, OLAP, Agile Project Management, Application Architecture, Event-driven Architecture, Agile Software Development, Microservices, DevOps

Platforms

Amazon Web Services (AWS), Azure, Databricks, Azure SQL Data Warehouse, Dedicated SQL Pool (formerly SQL DW), SharePoint, Azure Data Lake Storage, Azure Event Hubs, Azure Synapse, Azure Functions, Oracle Cerner, Web, Talend, Azure Synapse Analytics, Oracle, SAP HANA, Apache Kafka, Amazon EC2, Salesforce, Pentaho, AWS Lambda

Storage

Microsoft SQL Server, SQL Server Management Studio (SSMS), PostgreSQL, Azure SQL, Databases, Relational Databases, Data Pipelines, Data Lakes, Database Architecture, Oracle PL/SQL, Database Structure, Dynamic SQL, SQL Architecture, Redshift, PL/SQL, Apache Hive, HDFS, Amazon S3 (AWS S3), Master Data Management (MDM), Database Transactions, OLTP, NoSQL, Database Caching, DB, Redis, Azure Cosmos DB, MongoDB, Amazon DynamoDB, Netezza, Column-oriented DBMS, Azure SQL Databases, MySQL, ER/Studio Data Architecture, JSON, Teradata, Azure Blobs, Azure Queue Storage, SQL Server Integration Services (SSIS), SQL Server Analysis Services (SSAS), Dell Boomi

Frameworks

Hadoop, TOGAF, .NET Core

Industry Expertise

Bioinformatics, Insurance, Banking & Finance, Automotive, Healthcare

Other

Data Modeling, Data Management, Solution Architecture, IT Consulting, IT Project Management, Consulting, Data Warehouse Design, Leadership, Technical Design, Architecture, Troubleshooting, Data Architecture, Analytics, Data Governance, Big Data Architecture, Software, Data Science, Data Analysis, Data Queries, Big Data, Data, Data Engineering, Azure Data Factory (ADF), Data Marts, Relational Database Design, Cloud Architecture, Healthcare Effectiveness Data and Information Set (HEDIS), Data Warehousing, Data Loading, Database Schema Design, Reporting, Integration, Data Analytics, Cloud, Data Structures, Logical Database Design, Information Gathering, Data Transformation, Data Cleansing, Delivery Management, Engineering, PL/SQL Tuning, Program Management, Cloud Infrastructure, Data Migration, ETL Tools, Business Requirements, HRIS, New Products, Technical Architecture, Data Conversion, Data Mapping, Technical Writing, Excel 365, Systems Analysis, Business Systems Analysis, ERD, IT Management, Informatica, System Integration, AWS Cloud Architecture, Software Design, Software Development, Predictive Analytics, Performance Management, Manufacturing, Healthcare Services, Agile Data Science, Financial Services, Consumer Packaged Goods (CPG), Business Process Analysis, Algorithms, Strategy, Dashboards, Data Visualization, Healthcare Management Systems, Technical Leadership, Distributed Systems, MDM, Master Data, IT Strategy, Cloud Storage, File Systems, Data Profiling, Technical Consulting, Feasibility, Management Systems, Transportation & Logistics, Software Development Management, People Management, Team Management, eCommerce, Engineering Management, Amazon RDS, Transactions, Azure Data Lake, Azure Databricks, Statistics, Performance Tuning, DAX, Unstructured Data Analysis, Insurance Technology (Insurtech), Machine Learning Operations (MLOps), Bill of Material, Enterprise Architecture, Customer Relationship Management (CRM), Real-time Data, System Implementation, CRM Design, CRM Configuration, CRM Implementation (Oracle), CRM Implementation (Salesforce), Software as a Service (SaaS), Oracle ERP, Technical Product Management, Business Process Re-engineering, System Architecture, Infrastructure as Code (IaC), Domain-driven Design (DDD), Amazon Redshift, Data Vaults, Machine Learning, DocumentDB, Due Diligence, Software Architecture, IT Operations Management (ITOM), IT Infrastructure, Organization, Software Development Lifecycle (SDLC), Technical Reports, Executive Presentations, Atlas, Documentation, Oracle R, Government, Data Center Management, Hardware, Claims, Parquet, Benefit Administration, Internet of Things (IoT), Computer Science, Marketing Cloud, SAP, Team Leadership, Azure Analysis Services, Profisee MDM, SQL Server 2015, Browsers, Key Performance Indicators (KPIs), Statistical Modeling, Dashboard Development, Unix Shell Scripting, CI/CD Pipelines, Message Queues, Genomics, Biometrics, Security, Delta Lake, Cloud Monitoring, Bill of Materials, Data Strategy, AI Modeling

Collaboration That Works

How to Work with Toptal

Toptal matches you directly with global industry experts from our network in hours—not weeks or months.

1

Share your needs

Discuss your requirements and refine your scope in a call with a Toptal domain expert.
2

Choose your talent

Get a short list of expertly matched talent within 24 hours to review, interview, and choose from.
3

Start your risk-free talent trial

Work with your chosen talent on a trial basis for up to two weeks. Pay only if you decide to hire them.

Top talent is in high demand.

Start hiring