Collin Youngerman, Developer in Fenton, MO, United States
Collin is available for hire
Hire Collin

Collin Youngerman

Bio

Collin is a data engineer, data analyst, and AI engineer with 5 years of experience building production database systems, analytics pipelines, and AI-powered automation tools across energy, telecom, manufacturing, and healthcare. He is a director of technology and analytics, owning data architecture on GCP/BigQuery and managing a $1 million consulting engagement. Collin has built pipelines processing 280,000+ data points and automated 1,000+ hours of manual workflows.

Portfolio

JWestlingco.com
Google Cloud Platform (GCP), Python, Data Engineering, ETL, AI Engineering...
Charter Communications
Amazon Web Services (AWS), Python, Proof of Concept (POC), PostgreSQL, Tableau...
TACONY CORPORATION
Microsoft Power BI, Automation, Generative Artificial Intelligence (GenAI)...

Experience

  • SQL - 5 years
  • Microsoft Power BI - 4 years
  • Data Engineering - 3 years
  • ETL - 3 years
  • Python - 2 years
  • Google Cloud Platform (GCP) - 2 years
  • Google BigQuery - 2 years
  • Agent Development Kit (ADK) - 1 year

Preferred Environment

Microsoft SQL Server, Google Cloud Platform (GCP), Visual Studio Code (VS Code), Git, Microsoft Power BI, Python, Docker

The most amazing...

...thing I've built is a full GCP pipeline with scheduled ingestion, raw API to BigQuery to Power BI, processing 280,000+ data points across 183 hubs in 48 hours.

Work Experience

Director of Technology & Analytics

2026 - PRESENT
JWestlingco.com
  • Built a natural gas pricing analytics pipeline solo in under 2 days, processing 280,000+ data points across 183 NGI pricing hubs on GCP/BigQuery with automated scheduled ingestion.
  • Designed the full data architecture on GCP/BigQuery serving Power BI dashboards for executive decision-making on a $500+ million capital project.
  • Evaluated, secured, and managed around a $1 million consulting engagement for advanced analytics, owning vendor relationship, technical scope, and deliverable review.
  • Served as the sole technology leader reporting directly to the CEO, responsible for all data strategy, vendor evaluations, and platform architecture.
Technologies: Google Cloud Platform (GCP), Python, Data Engineering, ETL, AI Engineering, Agentic AI, Multi Agent Platform, Leadership, Google BigQuery, Data Modeling, Database Design, CI/CD Pipelines, Agent Development Kit (ADK), Data Visualization, Relational Databases, Data Governance, Artificial Intelligence (AI), Data Analysis, Dashboards, MySQL, Analytics, Back-end Architecture, Scalable Architecture, Business, Key Performance Indicators (KPIs), Reporting, Trend Analysis, AI-assisted Development, Business Intelligence (BI), Power Query, Power BI Desktop, Power BI Embedded, Power BI REST APIs, Fabric, Enterprise Data Warehouse (EDW), Data Reporting, Dashboard Design

Network Operations Analyst IV

2024 - 2026
Charter Communications
  • Led the Omni POC, an AI-powered BI platform, meeting with leadership across verticals to define requirements, engineering the data layer, and providing contextual grounding for accurate insights.
  • Built an automated process eliminating 1,000+ hours of manual work, generating insights for executive slide decks.
  • Solved a years-long CRQ problem in a single day by building a log parsing and rule classification system that the team had not been able to crack.
  • Designed and deployed supervised ML models for automated ticket classification, enabling visibility into previously unclassified events.
  • Acted as a de facto technical lead for analytics engineering, mentoring 5 engineers on automation, CI/CD, Python-type safety, and AI tool adoption.
  • Supported contractor hiring and technical vetting for the analytics engineering team.
Technologies: Amazon Web Services (AWS), Python, Proof of Concept (POC), PostgreSQL, Tableau, Data Modeling, Data Warehousing, Data Visualization, Relational Databases, Amazon RDS, ELT, Artificial Intelligence (AI), Data Analysis, Dashboards, MySQL, Analytics, Back-end Architecture, Scalable Architecture, Business, Key Performance Indicators (KPIs), Reporting, Trend Analysis, AI-assisted Development, Business Intelligence (BI), Power Query, Power BI Desktop, Power BI Embedded, Power BI REST APIs, Fabric, Enterprise Data Warehouse (EDW), Data Reporting, Dashboard Design

Business Optimization Analyst

2023 - 2024
TACONY CORPORATION
  • Built ETL pipelines in Azure Data Factory, orchestrating Amazon SP-API ingestion into SQL Server, enabling real-time operational dashboards across business units.
  • Designed SQL Server schemas and stored procedures for automated data transformations, eliminating manual data processing workflows.
  • Delivered Power BI dashboards with DAX measures providing executive visibility into sales, inventory, and operational KPIs.
Technologies: Microsoft Power BI, Automation, Generative Artificial Intelligence (GenAI), Azure Data Factory (ADF), Data Modeling, DAX, Data Visualization, Relational Databases, ELT, Data Governance, Data Analysis, Dashboards, MySQL, Analytics, Business, Key Performance Indicators (KPIs), Reporting, Trend Analysis, Business Intelligence (BI), Power Query, Power BI Desktop, Power BI Embedded, Power BI REST APIs, Fabric, Enterprise Data Warehouse (EDW), Data Reporting, Dashboard Design

Business Intelligence Analyst

2022 - 2022
National Medical Billing Solutions
  • Automated complex reporting workflows using Power BI and SQL, reducing report generation time by 40% and eliminating manual data aggregation.
  • Reviewed an overseas team's work daily to ensure accuracy and completeness.
  • Helped develop the cube for analytics inside of SSAS.
Technologies: Microsoft Power BI, Electronic Health Records (EHR), DAX, Data Visualization, Relational Databases, Data Analysis, Dashboards, MySQL, Analytics, Business, Key Performance Indicators (KPIs), Reporting, Trend Analysis, Business Intelligence (BI), Power Query, Power BI Desktop, Power BI Embedded, Power BI REST APIs, Azure, Enterprise Data Warehouse (EDW), Data Reporting, Dashboard Design

Data Analyst

2021 - 2022
Sydenstricker Nobbe Partners
  • Identified and fixed a critical SQL logic flaw in inventory pricing automation, saving $100,000+ annually.
  • Built a report to be used when interest rates climbed, and they had a harder time selling equipment. It allowed the user to select a type of equipment (including 3rd-party options) and return a list of customers who are 1 purchase cycle away.
  • Developed dashboards and managed the CRM system (Handle). Helped begin the transition from Handle to Microsoft Dynamics.
Technologies: SQL, Transact-SQL (T-SQL), Data Visualization, Relational Databases, Finance, Data Analysis, Dashboards, MySQL, Analytics, Business, Key Performance Indicators (KPIs), Reporting, Trend Analysis, Business Intelligence (BI), Power Query, Power BI Desktop, Power BI Embedded, Power BI REST APIs, Enterprise Data Warehouse (EDW), Data Reporting, Dashboard Design

High School Math Teacher & Football Coach

2016 - 2021
Montgomery County R-II School District
  • Built a high-accountability culture, turning a struggling football program into a consistent winner.
  • Recognized across multiple graduating classes as a highly impactful educator, demonstrating strong mentorship and talent development ability.
  • Translated complex mathematical concepts into clear instructions for diverse audiences, directly applicable to communicating technical solutions to non-technical business partners.
Technologies: Leadership, Competitive Strategy, Relationship Building, Presentations

Experience

NGI Natural Gas Analytics Platform

A fully automated, end-to-end analytics platform built on GCP, Python, Docker, and Power BI that delivers daily natural gas market intelligence from raw API data to executive-ready dashboards, processing 280,000+ data points across 183 pricing hubs. Prior to this, pricing data was manually retrieved and lacked a centralized reporting infrastructure.

HIGHLIGHTS
• Containerized Python ingestion jobs deployed on Cloud Run pull from the NGI API daily, covering 183 pricing points across all major hubs.
• Raw data staged in Cloud Storage, loaded into BigQuery with watermark-based incremental logic and automatic historical catchup.
• Mart views in BigQuery serve a clean, report-ready layer decoupled from raw ingestion.
• Five-page Power BI dashboard covering spot prices, forward curves, regional comparisons, historical trends, and basis analysis.
• Fully scheduled and automated via Cloud Scheduler. The pipeline completes, and the dashboard refreshes by market open each weekday.
• Modular architecture allows new data sources and dashboards to be stood up quickly on the same infrastructure.

Flux Analysis Agent

An autonomous AI agent that performs end-to-end financial variance analysis, replacing manual "Big 4" style audit workflows. The system ingests financial data, identifies material anomalies, autonomously investigates root causes by emailing stakeholders, parses their responses, and generates audit-ready Word reports, all without human intervention.

HIGHLIGHTS
• LangGraph state machine orchestrates a 12-node agentic workflow with conditional routing and loops: data ingestion, anomaly detection, stakeholder outreach, response parsing, and report generation.
• Autonomously composes and sends investigation emails via Gmail API (with Outlook and SMTP alternatives), monitors inbox for replies, and extracts root-cause explanations.
• Clarity scoring evaluates stakeholder response quality using LLM-based confidence extraction with heuristic fallback, determining whether explanations are sufficient or require follow-up.
• Generates formatted, audit-ready variance reports as Word documents (.docx) with findings, root causes, and recommended actions using python-docx.
• Full autonomous loop with graceful degradation: every LLM call has a deterministic template fallback, and all external integrations support mock mode for demos.

NexusLoop – Autonomous Supply Chain Agent

An agentic procurement system built on Google ADK powered by Gemini. Inventory burns in real time, reorder points trigger automatically, purchase orders are created, high-value POs are routed to a human for approval, and orders are tracked through delivery. It works as a digital twin, a process automation layer, or a testing environment for procurement strategies. Procurement teams spend a lot of time monitoring stock and manually starting orders. This eliminates that while keeping human sign-off where it matters.

HIGHLIGHTS
• Sequential Agent made up of 5 individual agents with specific tasks, wrapped inside a Loop Agent. Execution order is enforced by structure, not by the model's judgment.
• Discrete structured steps shift the system toward deterministic behavior. In a demo, that difference is interesting. In production, it's everything.
• Lean reorder math calculates order quantities dynamically, covering supplier MOQ plus lead-time demand to prevent stockouts.
• Human-in-the-loop approval for high-value orders via tokenized webhook.
• Shares a SQLite database between simulation and agent, bridged by a restock_requests table acting as an event queue.
• Scales from 1 SKU to a full 20-part drone BOM with no core logic changes.

Education

2013 - 2016

Bachelor's Degree in Mathematics

Southern Illinois University - Carbondale, Illinois

Certifications

SEPTEMBER 2025 - PRESENT

Generative AI Engineering Professional Certification

IBM

JUNE 2025 - PRESENT

IBM Data Science Professional Certificate

IBM

Skills

Libraries/APIs

Power BI REST APIs, REST APIs, Pandas, NumPy, Beautiful Soup, Claude API, Gmail API, Pydantic, Fabric

Tools

Microsoft Power BI, Power Query, Power BI Desktop, Power BI Embedded, BigQuery, Cloud Scheduler, Git, Algorithm Design, Tableau

Languages

Python, SQL, Transact-SQL (T-SQL), PL/pgSQL

Paradigms

Business Intelligence (BI), ETL, Automation, Database Design, Back-end Architecture

Storage

Microsoft SQL Server, Relational Databases, MySQL, PostgreSQL, Data Pipelines, SQLite

Platforms

Google Cloud Platform (GCP), Amazon Web Services (AWS), Cloud Run, Azure, Visual Studio Code (VS Code), Docker

Frameworks

LangGraph

Other

Data Engineering, Data Modeling, DAX, Data Visualization, Data Analysis, Dashboards, Analytics, Business, Key Performance Indicators (KPIs), Reporting, Trend Analysis, AI-assisted Development, Data Reporting, Dashboard Design, AI Engineering, Agentic AI, Multi Agent Platform, Leadership, Relationship Building, Google BigQuery, Data Warehousing, Agent Development Kit (ADK), Cloud Storage, Multi-agent Systems, AI Agent Orchestration, Amazon RDS, ELT, Artificial Intelligence (AI), Scalable Architecture, Enterprise Data Warehouse (EDW), Logical Reasoning, Analytical Thinking, Problem Solving, Statistical Analysis, Proof of Concept (POC), Generative Artificial Intelligence (GenAI), Electronic Health Records (EHR), Competitive Strategy, Presentations, Azure Data Factory (ADF), CI/CD Pipelines, LangChain, RAG Architecture, RAG Pipelines, Fine-tuning, Hugging Face, Vector Databases, Deep Learning, Neural Networks, Data Science, Data Collection, Data Wrangling, Exploratory Data Analysis, Predictive Modeling, Web Scraping, Model Evaluation, Supervised Learning, Geospatial Analytics, Predictive Analytics, Logistic Regression, Support Vector Machines (SVM), Decision Trees, Machine Learning, Architecture, Gemini, SMTP, Webhooks, APIs, Digital Twin Simulation, Supply Chain, Python-docx, AI Architecture, State Machines, Document Design, Finance, Data Governance

Collaboration That Works

How to Work with Toptal

Toptal matches you directly with global industry experts from our network in hours—not weeks or months.

1

Share your needs

Discuss your requirements and refine your scope in a call with a Toptal domain expert.
2

Choose your talent

Get a short list of expertly matched talent within 24 hours to review, interview, and choose from.
3

Start your risk-free talent trial

Work with your chosen talent on a trial basis for up to two weeks. Pay only if you decide to hire them.

Top talent is in high demand.

Start hiring