
Shanto Mathew
Verified Expert in Engineering
Artificial Intelligence Engineer and Developer
Dallas, TX, United States
Toptal member since June 1, 2026
Shanto is an AI engineer specializing in agentic and voice AI. He builds production agents using Claude Code, Codex, and custom multi-agent frameworks, as well as low-latency voice agents for retail, support, and sales. Shanto takes frontier models from prototype to reliable, cost-aware production systems—tool use, RAG, orchestration, evaluations, and the integrations that make them real.
Portfolio
Experience
- Automation - 8 years
- Python 3 - 7 years
- AI Voice Agents - 2 years
- LangGraph - 2 years
- LangChain - 2 years
- AI Agents - 2 years
- Codex - 1 year
- Claude Code - 1 year
Preferred Environment
PyCharm
The most amazing...
...thing I've built is a multi-agent AI that runs revenue operations: finds signals, scores accounts, drafts outreach, and writes CRM—human-approved.
Work Experience
Senior SOAR Engineer – Generative AI Security Automation
CDW
- Architected Cortex XSIAM and XSOAR playbooks with Python custom integrations for autonomous threat triage, alert summarization, and IOC extraction across multi-cloud environments.
- Implemented MCP patterns for context-aware incident correlation across multi-cloud environments, enabling intelligent automated remediation.
- Designed scalable Python data pipelines on AWS and Snowflake for security telemetry ingestion and analysis, supporting Fortune 500 SOC operations.
- Engineered prompt templates and LLM integrations within Palo Alto XSIAM for automated phishing analysis, malware classification, and incident summarization.
Senior Generative AI Engineer – Voice AI & Agentic AI
Independent Consulting
- Built Retell Copilot, a voice agent that provisions other Retell voice agents end-to-end in under 60 seconds from a caller's spec; deployed on AWS Lambda, API Gateway, DynamoDB, and Netlify edge.
- Shipped 7 live voice AI demos on Retell AI, OpenAI Realtime, xAI Grok Voice Agent, ElevenLabs, and Twilio across sales, medical, hotel, and enterprise concierge use cases.
- Architected real-time voice agents on browser WebSocket transport with streaming PCM audio, achieving sub-250 milliseconds time-to-first-audio through buffer tuning, prompt caching, and connection pooling.
- Built role-aware RAG pipelines with LangChain and LlamaIndex using semantic chunking, hybrid search, citation enforcement, and versioned prompt templates with an offline eval harness.
- Built LoRA and QLoRA fine-tuning pipelines using HuggingFace Transformers and custom training loops for cybersecurity and customer-service domains, improving task-specific accuracy by 15-30%.
- Built NetDebt AI Voice Agent for Landmark Management Group: live Retell/Twilio intake agent with 6 custom function-calling tools, pre-seeded lead database, and CloudFront-hosted dashboard.
- Implemented production-grade security boundaries for voice AI: server-side ephemeral client-secret minting, CSP and HSTS hardening, secret-safe deploys, and graceful 503 fallbacks when keys are absent.
- Achieved provider portability across Retell AI, OpenAI Realtime, xAI Grok Voice Agent, Anthropic Claude, and Z.ai GLM-5.1 through clean abstraction layers and an offline eval harness AB-tested in production.
Senior SOAR Engineer – AI Security Automation
Bank of America
- Developed 15 Splunk SOAR applications in Python with AI-enhanced custom functions for automated incident response workflows across the global SOC.
- Built LangGraph multi-agent SOAR proof-of-concept for Splunk ES notables (triage, enrich, correlate, decide) with MCP-guarded actions and integrations to Microsoft 365 Defender, CrowdStrike, and ServiceNow.
- Shipped production agentic AI on LangGraph, automating L1/L2 SOC workflows, achieving 60% reduction in manual analyst intervention.
- Engineered RAG pipelines with ChromaDB and FAISS over MITRE ATT&CK and threat intel, enabling agents to retrieve TTP context for notable enrichment.
- Partnered with SOC leadership to define KPIs for AI-assisted triage, reducing mean time to investigate by 45% across high-volume notable categories.
Senior AI & Voice AI Engineer
CDW
- Built production Voice AI agents with Retell AI and OpenAI for outbound customer engagement, handling 10,000+ calls/month with sub-300-millisecond response latency.
- Designed LangGraph-based conversation flows with dynamic function calling, enabling agents to book appointments, qualify leads, and escalate to humans.
- Integrated Twilio telephony, WebSockets, and streaming TTS/STT pipelines, achieving natural barge-in and interruption handling across voice sessions.
- Implemented evaluation harnesses, call analytics, and prompt tuning loops that improved task completion rate by 35% across voice agent campaigns.
- Architected secure XSIAM/XSOAR Python automations integrating 30+ enterprise security tools, reducing alert fatigue for Fortune 500 SOC clients.
Senior Machine Learning Engineer
Mastercard
- Developed ML models in Python and Spark to detect fraudulent transactions across millions of daily payments, improving precision by 22% over baseline.
- Built end-to-end MLOps pipelines for model training, validation, deployment, and monitoring on AWS, reducing model release cycle from weeks to days.
- Engineered large-scale feature pipelines on Spark and SQL over Mastercard transaction data, powering risk scoring and customer segmentation models.
- Collaborated with risk and product teams to translate domain requirements into ML solutions, presenting model results and trade-offs to executive stakeholders.
Experience
Agentic AI SOC Copilot – LangGraph + RAG
The system uses a RAG layer over MITRE ATT&CK, internal runbooks, and historical incidents (ChromaDB + FAISS) so agents can ground responses in real TTP context. MCP-style tool guardrails control actions across Microsoft 365 Defender, CrowdStrike, and ServiceNow, ensuring every action is policy-checked and auditable.
I implemented prompt engineering patterns, structured outputs, evaluation harnesses, and observability hooks to enable analysts to review and override agent decisions. The deployment reduced manual L1/L2 analyst intervention by 60% and cut mean time to investigate by 45% on high-volume notable categories. Built with Python 3, LangGraph, LangChain, OpenAI, ChromaDB, FAISS, Splunk SOAR, and FastAPI.
Agentic Marketing Operations Workbench
https://gp-agentic-revenue-ops.netlify.app/I designed a signals dashboard tracking hiring, funding, expansion, exec-hire, and compliance triggers across global accounts, with approval queues, in-flight run telemetry (p95 22.4s), and live streaming refresh. I implemented multi-agent orchestration with degraded-mode fallbacks, connector health monitoring, and human-in-the-loop approvals before any outbound action. I delivered a production-grade UI with operator-friendly filters, regional segmentation, and audit-ready signal provenance for go-to-market teams.
Enterprise Voice AI Launch Console (VoiceOps)
https://elevenlabs-forward-deployed-engineer.netlify.app/I built launch-room, architecture, simulation, safety, stakeholder, and productize modules covering API integrations, evaluation results, executive updates, and reusable pilot kits. I also implemented launch-gate checks, synthetic-data evaluations, and hold-launch controls to capture risk before go-live, plus repeatable customer feedback loops for an enterprise voice rollout. I shipped a polished operator UI that mirrors how a forward-deployed engineer owns architecture, runs go-live reviews, and turns one customer launch into a productized motion across accounts.
Grok Voice Medical Front Desk
https://grok-medical-frontdesk.netlify.app/I implemented streaming caller mic and receptionist audio with live transcript, caller intake context, open-slot inventory, and console navigation by voice. I enforced no-medical-advice safety boundaries, PHI-free logging, and an automatic 5-minute call disconnect to satisfy compliance guardrails. I delivered Front Desk, Schedule, and Audit views, plus a synthetic demo mode, so prospects and operators can exercise the agent without exposing real patient data.
Y22 Voice AI Sales Roleplay Simulator
https://y22-ai-sales-roleplay.netlify.app/I built 3 preset buyer archetypes (Skeptical Mid-Market CFO, Friendly-but-Firm VP Sales, Procurement Gatekeeper) with difficulty tiering, plus a custom buyer builder so managers can spin up any persona on demand. I implemented a six-tile behavior scorecard that fills in live during the call, a Prompt Lab for tuning persona prompts, and an end-of-call scorecard to coach reps. I streamed real-time voice with synthetic-data-only safety mode, making it safe to ship to enterprise sales organizations as a daily practice tool.
SOC AI Agent – LangGraph Incident Investigator
https://security-ops-playbook-analyzer.netlify.app/The agent streams graph execution, checkpoint snapshots, tool calls, and human approval interrupts while it triages a freshly generated incident—covering enrichment, log analysis, ticketing, and remediation steps. Surfaced operator-grade telemetry: checkpoints, tool-call counts, token usage, and MTTR per run, plus parallel Send() fan-out and map-reduce back-edges to demonstrate real LangGraph patterns. I packaged the demo as a click-to-run experience so SOC managers can see how agentic AI compresses Tier-1 triage without giving up auditability or approval gates.
Education
Bachelor of Technology Degree in Computer Science
Mahatma Gandhi University - Kerala, India
Certifications
Claude Code Certification
Anthropic
Skills
Libraries/APIs
Python API, React
Tools
Claude Code, Codex, Splunk SOAR, PyCharm, Splunk
Languages
Python 3, SQL, Python, Snowflake, TypeScript
Paradigms
Automation, Model Context Protocol (MCP)
Frameworks
LangGraph
Platforms
Twilio, AWS Lambda, Cortex XSOAR, AWS IoT
Storage
Amazon DynamoDB
Other
AI Agents, AI Voice Agents, Splunk Enterprise Security, LangChain, Professional Services, RAG Systems, ElevenLabs, FastAPI, Retrieval-augmented Generation (RAG), FAISS, Chroma, LoRa, QLoRA, Cortex XSIAM, ChromaDB, MITRE ATT&CK, Prompt Engineering, Large Language Models (LLMs), Conversational AI, WebSockets, Machine Learning, Computer Science, Data Structures, Algorithms, LLM Integration, ElevenLabs Solutions, Grok, Speech Recognition
How to Work with Toptal
Toptal matches you directly with global industry experts from our network in hours—not weeks or months.
Share your needs
Choose your talent
Start your risk-free talent trial
Top talent is in high demand.
Start hiring