
Collin Youngerman
Verified Expert in Engineering
Data and AI Engineer and Developer
Fenton, MO, United States
Toptal member since April 2, 2026
Collin is a data engineer, data analyst, and AI engineer with 5 years of experience building production database systems, analytics pipelines, and AI-powered automation tools across energy, telecom, manufacturing, and healthcare. He is a director of technology and analytics, owning data architecture on GCP/BigQuery and managing a $1 million consulting engagement. Collin has built pipelines processing 280,000+ data points and automated 1,000+ hours of manual workflows.
Portfolio
Experience
- SQL - 5 years
- Microsoft Power BI - 4 years
- Data Engineering - 3 years
- ETL - 3 years
- Python - 2 years
- Google Cloud Platform (GCP) - 2 years
- Google BigQuery - 2 years
- Agent Development Kit (ADK) - 1 year
Preferred Environment
Microsoft SQL Server, Google Cloud Platform (GCP), Visual Studio Code (VS Code), Git, Microsoft Power BI, Python, Docker
The most amazing...
...thing I've built is a full GCP pipeline with scheduled ingestion, raw API to BigQuery to Power BI, processing 280,000+ data points across 183 hubs in 48 hours.
Work Experience
Director of Technology & Analytics
JWestlingco.com
- Built a natural gas pricing analytics pipeline solo in under 2 days, processing 280,000+ data points across 183 NGI pricing hubs on GCP/BigQuery with automated scheduled ingestion.
- Designed the full data architecture on GCP/BigQuery serving Power BI dashboards for executive decision-making on a $500+ million capital project.
- Evaluated, secured, and managed around a $1 million consulting engagement for advanced analytics, owning vendor relationship, technical scope, and deliverable review.
- Served as the sole technology leader reporting directly to the CEO, responsible for all data strategy, vendor evaluations, and platform architecture.
Network Operations Analyst IV
Charter Communications
- Led the Omni POC, an AI-powered BI platform, meeting with leadership across verticals to define requirements, engineering the data layer, and providing contextual grounding for accurate insights.
- Built an automated process eliminating 1,000+ hours of manual work, generating insights for executive slide decks.
- Solved a years-long CRQ problem in a single day by building a log parsing and rule classification system that the team had not been able to crack.
- Designed and deployed supervised ML models for automated ticket classification, enabling visibility into previously unclassified events.
- Acted as a de facto technical lead for analytics engineering, mentoring 5 engineers on automation, CI/CD, Python-type safety, and AI tool adoption.
- Supported contractor hiring and technical vetting for the analytics engineering team.
Business Optimization Analyst
TACONY CORPORATION
- Built ETL pipelines in Azure Data Factory, orchestrating Amazon SP-API ingestion into SQL Server, enabling real-time operational dashboards across business units.
- Designed SQL Server schemas and stored procedures for automated data transformations, eliminating manual data processing workflows.
- Delivered Power BI dashboards with DAX measures providing executive visibility into sales, inventory, and operational KPIs.
Business Intelligence Analyst
National Medical Billing Solutions
- Automated complex reporting workflows using Power BI and SQL, reducing report generation time by 40% and eliminating manual data aggregation.
- Reviewed an overseas team's work daily to ensure accuracy and completeness.
- Helped develop the cube for analytics inside of SSAS.
Data Analyst
Sydenstricker Nobbe Partners
- Identified and fixed a critical SQL logic flaw in inventory pricing automation, saving $100,000+ annually.
- Built a report to be used when interest rates climbed, and they had a harder time selling equipment. It allowed the user to select a type of equipment (including 3rd-party options) and return a list of customers who are 1 purchase cycle away.
- Developed dashboards and managed the CRM system (Handle). Helped begin the transition from Handle to Microsoft Dynamics.
High School Math Teacher & Football Coach
Montgomery County R-II School District
- Built a high-accountability culture, turning a struggling football program into a consistent winner.
- Recognized across multiple graduating classes as a highly impactful educator, demonstrating strong mentorship and talent development ability.
- Translated complex mathematical concepts into clear instructions for diverse audiences, directly applicable to communicating technical solutions to non-technical business partners.
Experience
NGI Natural Gas Analytics Platform
HIGHLIGHTS
• Containerized Python ingestion jobs deployed on Cloud Run pull from the NGI API daily, covering 183 pricing points across all major hubs.
• Raw data staged in Cloud Storage, loaded into BigQuery with watermark-based incremental logic and automatic historical catchup.
• Mart views in BigQuery serve a clean, report-ready layer decoupled from raw ingestion.
• Five-page Power BI dashboard covering spot prices, forward curves, regional comparisons, historical trends, and basis analysis.
• Fully scheduled and automated via Cloud Scheduler. The pipeline completes, and the dashboard refreshes by market open each weekday.
• Modular architecture allows new data sources and dashboards to be stood up quickly on the same infrastructure.
Flux Analysis Agent
HIGHLIGHTS
• LangGraph state machine orchestrates a 12-node agentic workflow with conditional routing and loops: data ingestion, anomaly detection, stakeholder outreach, response parsing, and report generation.
• Autonomously composes and sends investigation emails via Gmail API (with Outlook and SMTP alternatives), monitors inbox for replies, and extracts root-cause explanations.
• Clarity scoring evaluates stakeholder response quality using LLM-based confidence extraction with heuristic fallback, determining whether explanations are sufficient or require follow-up.
• Generates formatted, audit-ready variance reports as Word documents (.docx) with findings, root causes, and recommended actions using python-docx.
• Full autonomous loop with graceful degradation: every LLM call has a deterministic template fallback, and all external integrations support mock mode for demos.
NexusLoop – Autonomous Supply Chain Agent
HIGHLIGHTS
• Sequential Agent made up of 5 individual agents with specific tasks, wrapped inside a Loop Agent. Execution order is enforced by structure, not by the model's judgment.
• Discrete structured steps shift the system toward deterministic behavior. In a demo, that difference is interesting. In production, it's everything.
• Lean reorder math calculates order quantities dynamically, covering supplier MOQ plus lead-time demand to prevent stockouts.
• Human-in-the-loop approval for high-value orders via tokenized webhook.
• Shares a SQLite database between simulation and agent, bridged by a restock_requests table acting as an event queue.
• Scales from 1 SKU to a full 20-part drone BOM with no core logic changes.
Education
Bachelor's Degree in Mathematics
Southern Illinois University - Carbondale, Illinois
Certifications
Generative AI Engineering Professional Certification
IBM
IBM Data Science Professional Certificate
IBM
Skills
Libraries/APIs
Power BI REST APIs, REST APIs, Pandas, NumPy, Beautiful Soup, Claude API, Gmail API, Pydantic, Fabric
Tools
Microsoft Power BI, Power Query, Power BI Desktop, Power BI Embedded, BigQuery, Cloud Scheduler, Git, Algorithm Design, Tableau
Languages
Python, SQL, Transact-SQL (T-SQL), PL/pgSQL
Paradigms
Business Intelligence (BI), ETL, Automation, Database Design, Back-end Architecture
Storage
Microsoft SQL Server, Relational Databases, MySQL, PostgreSQL, Data Pipelines, SQLite
Platforms
Google Cloud Platform (GCP), Amazon Web Services (AWS), Cloud Run, Azure, Visual Studio Code (VS Code), Docker
Frameworks
LangGraph
Other
Data Engineering, Data Modeling, DAX, Data Visualization, Data Analysis, Dashboards, Analytics, Business, Key Performance Indicators (KPIs), Reporting, Trend Analysis, AI-assisted Development, Data Reporting, Dashboard Design, AI Engineering, Agentic AI, Multi Agent Platform, Leadership, Relationship Building, Google BigQuery, Data Warehousing, Agent Development Kit (ADK), Cloud Storage, Multi-agent Systems, AI Agent Orchestration, Amazon RDS, ELT, Artificial Intelligence (AI), Scalable Architecture, Enterprise Data Warehouse (EDW), Logical Reasoning, Analytical Thinking, Problem Solving, Statistical Analysis, Proof of Concept (POC), Generative Artificial Intelligence (GenAI), Electronic Health Records (EHR), Competitive Strategy, Presentations, Azure Data Factory (ADF), CI/CD Pipelines, LangChain, RAG Architecture, RAG Pipelines, Fine-tuning, Hugging Face, Vector Databases, Deep Learning, Neural Networks, Data Science, Data Collection, Data Wrangling, Exploratory Data Analysis, Predictive Modeling, Web Scraping, Model Evaluation, Supervised Learning, Geospatial Analytics, Predictive Analytics, Logistic Regression, Support Vector Machines (SVM), Decision Trees, Machine Learning, Architecture, Gemini, SMTP, Webhooks, APIs, Digital Twin Simulation, Supply Chain, Python-docx, AI Architecture, State Machines, Document Design, Finance, Data Governance
How to Work with Toptal
Toptal matches you directly with global industry experts from our network in hours—not weeks or months.
Share your needs
Choose your talent
Start your risk-free talent trial
Top talent is in high demand.
Start hiring