Daphne Liu, Developer in Jacksonville, FL, United States
Daphne is available for hire
Hire Daphne

Daphne Liu

Bio

Daphne is a highly motivated big data analytics architect and SQL/Tableau/Power BI, business intelligence developer with strong business analytics solution delivery skills and 20 years of progressively responsible OLTP/OLAP database development/architecture experience. She is a frequent seminar speaker and workshop trainer in business intelligence and analytic solutions. Daphne is experienced in collaborating with business users in data modeling and business analytics solutions.

Portfolio

Delta Dental
Snowflake, Python 3, SQL, PyTorch, Machine Learning, LLM Reasoning...
CEVA Logistics
Neural Networks, Performance Tuning, Time Series Analysis, Time Series...

Experience

  • Data Modeling - 15 years
  • SQL - 15 years
  • Data Warehouse Design - 8 years
  • Tableau - 7 years
  • Data Virtualization - 5 years
  • Big Data - 5 years
  • Apache Cassandra - 4 years
  • Machine Learning - 3 years

Preferred Environment

Amazon Web Services (AWS), Azure, Google Cloud, Big Data, Linux, SQL, Tableau, Power BI Desktop, Python, Machine Learning

The most amazing...

...thing about me is that I am a data prodigy. I am an expert in SQL development, data modeling, data warehouse development, data analytics, and visualization.

Work Experience

Principal AI ML Architect/Engineer

2023 - 2026
Delta Dental
  • Led a team of data scientists/engineers in building scalable ML models for churn prediction, lifetime value forecasting, and personalized recommendations.
  • Built scalable enterprise data and AI architectures, along with MLOps systems, to reliably run models in production.
  • Contributed to ML, AI, and agentic AI to design and deliver practical ML solutions—including time series forecasting, regression, classification, neural networks, and RAG-powered LLMs (ChatGPT-4)—focused on real-world impact and scalable deployment.
  • Designed scalable AI/ML system architectures, integrating with data platforms (e.g., Snowflake, Azure ML Studio, Azure AI Foundry, Dataiku, etc.). Defined model lifecycle processes: training, deployment, monitoring, and retraining.
  • Standardized feature engineering pipelines, model registries, and reproducibility frameworks. Defined best practices for MLOps, automation, and CI/CD pipeline integration. Led development of complex ML models and generative AI solutions.
  • Mentored data scientists and business stakeholders on Tableau best practices and advanced analytics. Integrated machine learning outputs, including prediction scores, anomaly detection, and forecasting insights, into reporting solutions.
  • Developed Tableau dashboards to monitor data quality, testing results, and operational KPIs. Leveraged SQL and Python to automate data pipelines, improving efficiency and enabling scalable, self-service analytics across business functions.
Technologies: Snowflake, Python 3, SQL, PyTorch, Machine Learning, LLM Reasoning, OpenAI GPT-4 API, Microsoft Power BI, Tableau, Amazon QuickSight, Codex, Azure Data Factory (ADF), Amazon S3 (AWS S3), AWS Glue

Business Intelligence Architect

2018 - 2023
CEVA Logistics
  • Built Power BI and Tableau dashboards using data warehouse and ML outputs, including prediction scores, anomaly detection, and forecasts. Delivered actionable insights that turned complex analytics into business value.
  • Mentored data scientists and business stakeholders on Power BI and Tableau best practices, demonstrating advanced analytics capabilities and helping teams maximize the value of ML-driven reporting and decision-making.
  • Designed and built scalable data warehouse solutions in Snowflake, integrating data from multiple sources. Developed curated data models and pipelines that enabled reliable self-service BI, improving data accessibility and reporting efficiency.
  • Engineered batch and real-time data pipelines using Python, SQL, and ETL/ELT frameworks. Integrated data from SQL Server, PostgreSQL, S3, and other sources into Snowflake, delivering scalable, reliable, and analytics-ready datasets.
Technologies: Tableau

Principal Advanced Analytic Architect (AI/ML)

2014 - 2023
CEVA Logistics
  • Led a team of data scientists and data engineers in building scalable ML models for route optimization, carrier ranking, and logistics best practice recommendations.
  • Contributed to the enterprise AWS Databricks data warehouse, Symantec self-service data model, and ETL/ELT data pipeline design.
  • Used Informatica PowerCenter for ETL workflows, transforming MS SQL OLTP to OLAP, refreshing cloud data pipelines, and moving data into Cassandra data lakes. Also worked with AWS S3 and Redshift for scalable analytics and warehousing.
  • Implemented both Tableau and Power BI analytic visualization for supply chain management and a freight management system.
  • Implemented and completed data modeling of an enterprise data warehouse, data lakes, and ML/AI forecast models.
  • Used Facebook Prophet (a time series algorithm), XGBoost, LightGBM, classification, and PyTorch to deliver an ML and AI solution to a logistics ground TMS system.
  • Built and deployed a range of AI/ML models—including time series forecasting, regression, classification, neural networks, LLMs, and RAG—to solve real business problems at scale.
  • Delivered a data quality solution using PostgreSQL fuzzy string matching and Python FuzzyWuzzy libraries, cleaning data, and creating mapping groups for the machine learning model.
  • Designed and architected a supply chain carrier advisor ML solution that includes data labeling, feature selection, hyperparameter optimization, algorithm training, and carrier selection smart choices advisory to the supply chain management team.
  • Deployed the supply chain carrier advisor ML model based on TensorFlow TF-Ranking, AutoKeras, and Neural Network algorithms. Over a million records were trained in this model, providing a training result API and batch forecast result references.
Technologies: Neural Networks, Performance Tuning, Time Series Analysis, Time Series, AutoKeras, Pandas, Python, Feature Selection, Data, Machine Learning, Data Architecture, OLAP, NoSQL, SQL, PostgreSQL, Microsoft SQL Server, Tableau, Hortonworks Data Platform (HDP), Grafana, Elasticsearch, Cassandra, Pentaho, Data Analysis, Big Data, Snowflake, Microsoft Excel, Amazon S3 (AWS S3), Database Design, Database Schema Design, Business Intelligence (BI), Integration, Amazon QuickSight, Fabric, Informatica

BI Architect

2013 - 2015
City of Jacksonville
  • Architected Microsoft Business Intelligence Solutions using SQL Server, SSIS, and SSAS.
  • Built SSAS Cube and MDX.
  • Developed SSIS and designed a data warehouse.
  • Designed and developed a Microsoft Power BI Solution.
Technologies: MDX, Microsoft SQL Server, SQL Server Integration Services (SSIS), SSAS, Microsoft Power BI, Microsoft Excel, Amazon S3 (AWS S3), Database Design, Database Schema Design, Reporting, Business Intelligence (BI), Integration, Fabric

BI Architect

2012 - 2013
Crowley Marinetime
  • Built a Microsoft Business Intelligence Solution SSAS Cube for budget and actual.
  • Implemented an SSIS ETL from Oracle and DB2.
  • Created a TSQL for Crowley Vessel Captain log dimensional data model.
  • Developed an SSRS report.
  • Implemented SVN source version control.
Technologies: Subversion (SVN), SQL Server Reporting Services (SSRS), Transact-SQL (T-SQL), IBM Db2, Oracle, SQL Server Integration Services (SSIS), SSAS, Microsoft Power BI, Microsoft Excel, Database Design, Database Schema Design, Reporting, Business Intelligence (BI), Integration, Fabric

Experience

Tableau Dashboard Development

https://public.tableau.com/profile/daphne.liu#!/
Tableau dashboard design for a supply chain carrier KPI, financial management KPI (AP vs AR), and shipment on time performance. Implemented Tableau actions, KPI calculated columns, LOD calculations, dynamic slicers, and performance tuning.

Big Data Cassandra & Solr Document Search

Solr cloud free text search engine design for vendor EDI documents using Solr data import module with Cassandra cluster data stored. A Hadoop HDFS file system was implemented for Solr document index storage. Six Solr collections with shards and replicas. Deployed in March 2016

Big Data Cassandra & Elasticsearch Data Warehouse

Big Data NoSQL Cassandra and Elasticsearch cluster solution design and implementation. Elasticsearch search engine was built on top of Cassandra cluster, Using Pentaho PDI ETL tool moving data from relational databases to Cassandra NoSQL clusters for enterprise data warehouse. Started in 2017 and deployed in July 2018.

Dimensional Data Model for Supply Chain Management and Financial Management

SCM and FM Dimensional models built on top of current SQL server data store. These models provide internal or external customers data sources for business analytics. The solution was developed in TSQL, SSIS, and SQL server 2016.

Supply Chain Carrier Advisor — Machine Learning Model

Carrier Advisor is a machine learning project that advises carriers for operators in a supply chain management system.
I built the AI and ML model from OLAP by labeling data, selecting features and algorithms, POC using AutoML algorithms, and performed the final production deployment using AutoKeras and TensorFlow TF-Ranking. Data was transformed from OLAP to prediction models using Python and Pentaho PDI.

Education

1993 - 1995

Master's Degree in Computer Information Science & Engineering

University of Florida - Florida

Certifications

MARCH 2014 - MARCH 2016

Tableau

Tableau

Skills

Libraries/APIs

Pandas, Fabric, AutoKeras, TensorFlow Deep Learning Library (TFLearn), PyTorch

Tools

AutoML, Microsoft Power BI, Tableau, Grafana, Pentaho Data Integration (Kettle), Microsoft Excel, Amazon QuickSight, Microsoft Access, H2O AutoML, Apache Solr, ARIMA, Prophet ERP, AWS Glue, Amazon SageMaker, dbt Cloud, Superset, SSAS, Subversion (SVN), Power BI Desktop, Codex

Languages

Python, Transact-SQL (T-SQL), SQL, Snowflake, Python 3, MDX

Frameworks

Data Lakehouse, Apache Spark, Hadoop, AWS HA

Paradigms

OLAP, Database Design, Business Intelligence (BI), ETL, HIPAA Compliance

Platforms

Dataiku, Linux, Amazon EC2, Databricks, Microsoft Fabric, Azure, SolrCloud, Apache Kafka, Pentaho, Hortonworks Data Platform (HDP), Oracle, Amazon Web Services (AWS)

Storage

Microsoft SQL Server, OLTP, Data Pipelines, Data Lakes, MySQL, NoSQL, Elasticsearch, Amazon S3 (AWS S3), Redshift, Google Cloud, Cassandra, Druid.io, SQL Server Integration Services (SSIS), IBM Db2, SQL Server Reporting Services (SSRS), PostgreSQL

Other

Data Analysis, Apache Cassandra, Big Data Architecture, Data Virtualization, Data Warehouse Design, Data Modeling, Data Architecture, Big Data, Forecasting, Time Series, AWS Database Migration Service (DMS), Database Schema Design, Integration, Data Engineering, Machine Learning Operations (MLOps), Supervised Machine Learning, Data Build Tool (dbt), ELT, Amazon Redshift, Data Taxonomy, Metadata, Data Warehousing, Data Analytics, Data Visualization, DAX, Dashboard Design, Key Performance Indicators (KPIs), Reports, Dashboards, Informatica, Artificial Intelligence (AI), Classification Algorithms, Data Science, Machine Learning, Neural Networks, Agile Data Science, Linear Regression, Logistic Regression, Reporting, Amazon Bedrock, Cloud, Feature Selection, Performance Tuning, Classification, Data, Time Series Analysis, LLM Reasoning, OpenAI GPT-4 API, Azure Data Factory (ADF)

Collaboration That Works

How to Work with Toptal

Toptal matches you directly with global industry experts from our network in hours—not weeks or months.

1

Share your needs

Discuss your requirements and refine your scope in a call with a Toptal domain expert.
2

Choose your talent

Get a short list of expertly matched talent within 24 hours to review, interview, and choose from.
3

Start your risk-free talent trial

Work with your chosen talent on a trial basis for up to two weeks. Pay only if you decide to hire them.

Top talent is in high demand.

Start hiring