
Adrian Garcia
Verified Expert in Engineering
Back-end and AI Engineer Developer
Merced, CA, United States
Toptal member since December 30, 2025
Adrian is a senior back-end engineer with 8+ years of experience building cloud-native platforms for AI, ad tech, healthcare, and fintech. He delivers end-to-end solutions—from microservices to full stack—using Python, Go, and TypeScript/Node.js. His expertise spans event-driven REST and gRPC systems and NLP and LLM back ends, including RAG, fine-tuning, and vector search. Adrian has deployed production stacks on AWS, GCP, and Azure, with a focus on reliability and observability.
Portfolio
Experience
- Python - 10 years
- System Design - 8 years
- Node.js - 8 years
- Back-end Development - 8 years
- PostgreSQL - 8 years
- Artificial Intelligence (AI) - 6 years
- Apache Kafka - 6 years
- Go - 5 years
Preferred Environment
MacOS, Windows, GitHub, Google Cloud Platform (GCP), Amazon Web Services (AWS), Large Language Models (LLMs), Python 3, Go, Node.js, BERT
The most amazing...
...thing I've built is a production-grade NLP entity extraction service on Kubernetes with autoscaling and observability, sustaining 10,000+ requests per minute.
Work Experience
Senior Back-end & AI Engineer
Harvest LLC
- Built Go and FastAPI microservices with OAuth 2, JSON Web Tokens (JWT), RBAC, rate limiting, retries, and tracing.
- Delivered REST and gRPC APIs and improved stability under load via pagination, backpressure, and pooling.
- Implemented Redis caching and concurrency tuning to reduce P95 latency during traffic spikes.
- Built ETL/ELT pipelines with data duplication, schema-drift detection, and validation suites to improve data SLAs.
- Deployed to Amazon ECS/EKS with autoscaling and zero-downtime rollouts. Added CI/CD safety checks.
- Implemented internal RAG and semantic search using LangChain and Pinecone with guardrails and cost control.
- Integrated Thrift into microservices for cross-language communication between services written in Go and Python.
- Defined data models and service interfaces using Thrift's IDL, enabling efficient communication and reducing latency in distributed systems.
- Managed delivery execution in Jira and collaborated with back-end, QA, and product teams to define requirements, deliver features end-to-end, and improve reliability through testing, monitoring, and iterative improvements.
Back-end Engineer
Citadel LLC
- Designed high-reliability Node.js and TypeScript services using event-driven architecture for regulated workflows.
- Built fault-tolerant ingestion pipelines with BullMQ/Redis and RabbitMQ worker queues and rate limiting.
- Developed document-intelligence microservices using LLMs plus custom named entity recognition and key information extraction pipelines.
- Implemented MLOps automation using MLflow, GitOps, and CI/Terraform for reproducible deployments.
- Built RAG and semantic search APIs using vector databases—pgvector/Pinecone/Weaviate—and GraphRAG with Neo4j.
- Added service-mesh security and full-stack observability—mTLS, OpenTelemetry, dashboards.
- Designed and developed high-performance Go microservices for data ingestion, processing pipelines, and real-time data analytics, integrating with Cassandra for large-scale, distributed data storage.
- Optimized Cassandra queries and data models to ensure efficient reads and writes in high-velocity systems, supporting high-throughput, low-latency needs in data-driven applications.
Senior AI & GenAI Engineer
Convoy
- Architected an end-to-end ingestion pipeline for legal PDFs/HTML with embeddings and FAISS indexing.
- Built dense retrieval and reranking workflows to increase context precision and retrieval quality.
- Delivered RAG-driven Q&A microservices in FastAPI, packaging retrieval and prompts into low-latency APIs.
- Implemented continuous refresh pipelines via AWS Lambda and S3 events for re-embedding and index updates.
- Designed GPU-accelerated inference environments with Docker, Kubernetes, and TensorRT for throughput and latency.
- Built evaluation and governance frameworks and secured endpoints or storage for sensitive documents.
Senior Back-end and Data Science Engineer
Zillow
- Designed and built Go back-end microservices for adtech bidding systems, optimizing for low latency, high availability, and real-time performance.
- Developed Kafka streaming pipelines to process impressions, clicks, and conversions in real time, supporting advertiser decisioning workflows.
- Implemented Go Kafka consumer groups with partitioning, offset management, and reliable event processing for high-QPS systems.
- Delivered low-latency data access using PostgreSQL, Cassandra, and Redis for high-frequency queries and ad targeting.
- Integrated fraud screening with GeoIP validation and double verifier services for real-time bidding decisions.
- Optimized real-time bidding algorithms and social network enrichment to enhance ad targeting and improve conversion rates.
- Worked on ad delivery optimization with social network data for accurate advertising personalization.
- Containerized Go services with Docker, deployed on Kubernetes, and established CI/CD pipelines using GitHub Actions.
- Leveraged event-driven microservices to scale real-time ad decisioning systems and integrated Thrift into the microservices architecture for cross-language communication between Go, Python, and Java services.
- Gained proficiency in GraphQL, building high-performance APIs for large-scale adtech systems and social network data processing.
Full-stack Engineer
Twills
- Re-architected and scaled an eCommerce back end using Node.js and PHP services, delivering ordering, accounts, and catalog APIs supporting 1,000+ daily users.
- Built and maintained back-end services in Symfony/Laravel and C#/.NET for payments, admin tooling, and internal operations workflows.
- Implemented enterprise security controls, including JWT/OAuth 2, RBAC, idempotent webhooks, audit logs, and rate limiting across services.
- Integrated payments, shipping, and inventory providers via OpenAPI contracts to standardize partner integrations and reduce integration defects.
- Tuned MySQL with indexing, query optimization, and connection pooling to improve P95 latency and reduce system load.
- Introduced Docker/VMware-based environments and CI/CD pipelines for repeatable deployments; added centralized logging, monitoring, alerts, and disaster recovery runbooks.
Experience
AI Data Visualization Platform (MVP)
Company Data Analytics
Multi-agent DAG Orchestration Back End
Education
PhD in Material Science (Focused on Computational Science and Data Science)
University of California, Irvine - Irvine, CA, USA
Master's Degree in Engineering (Focused on Scientific Computing and Analytics)
University of California, Irvine - Irvine, CA, USA
Bachelor's Degree in Mechanical Engineering (Focused on Programming and Modeling)
University of California, Irvine - Irvine, CA, USA
Certifications
TestDome Kubernetes Certification
TestDome
Solution and Software Architecture Certification
TestDome
TestDome Azure Certification
TestDome
TestDome Python Data Science Certification
TestDome
TestDome AWS Certification
TestDome
TestDome REST API Certification
TestDome
TestDome Machine Learning Certification
TestDome
TestDome Go Certification
TestDome
TestDome Python Certification
TestDome
Skills
Libraries/APIs
NumPy, Pandas, Node.js, SpaCy, PyTorch, TensorFlow, OpenAPI, OpenAI API, REST APIs, Asyncio, React, Back-end APIs, Scikit-learn, Claude API, Pydantic
Tools
GoLand, Jira, Slack, Amazon Elastic Container Service (ECS), RabbitMQ, Terraform, Apache Airflow, Go Kit, Celery, Grafana, GitHub, GitLab, Bitbucket, JetBrains Rider, Amazon EKS, Google Kubernetes Engine (GKE), ChatGPT, Claude, AWS Batch, Jupyter, n8n, Azure ML Studio
Languages
Python, Go, TypeScript, JavaScript, Python 3, GraphQL, C++, C#.NET, PHP, C#
Frameworks
gRPC, Express.js, NestJS, Gin, Echo, JSON Web Tokens (JWT), OAuth 2, Swagger, Next.js, LangGraph, Symfony, Laravel
Paradigms
Agile, Kanban, REST, Microservices, API Observability, ETL, Microservices Architecture, Back-end Architecture, Real-time Systems
Platforms
Docker, Visual Studio Code (VS Code), Firebase, Kubernetes, AWS IoT, AWS Lambda, Apache Kafka, Ollama, MacOS, Linux, Windows, Amazon Web Services (AWS), Azure, Vercel, Temporal Cloud, Google Cloud Platform (GCP)
Storage
Redis, PostgreSQL, Amazon S3 (AWS S3), Neo4j, MongoDB, Cassandra, Amazon DynamoDB, Data Validation
Other
Windows Subsystem for Linux (WSL), Repository Management, Waterfall, Machine Learning, Scientific Computing, Data Processing, Software Development, System Design, Computational Modeling, FastAPI, OpenTelemetry, OpenAI, CI/CD Pipelines, LangChain, Pinecone, BullMQ, MLflow, Pgvector, Weaviate, Transformers, FAISS, Hugging Face, Fine-tuning, Redis Streams, Prometheus, GitHub Actions, Vector Databases, Caching, Scraping, ELT, Logistic Regression, Orchestration, Concurrency, API Design, Full-stack Development, Software Architecture, Deep Learning, Large Language Models (LLMs), BERT, Document Parsing, Back-end Development, Back-end Admin Systems, Problem Solving, Algorithms, Artificial Intelligence (AI), Back-end, Retrieval-augmented Generation (RAG), Milvus, WebSockets, Conversational AI, Natural Language Processing (NLP), RESTFul APIs, APIs, Prompt Engineering, AI Agents, Time Series, Time Series Data, Time Series Analysis, Linear Regression, K-means Clustering, AI Chatbots, Data Protection, Mistral AI, API Integration, Data Visualization, Agentic AI, Chatbots, Amazon Bedrock, Large-scale Projects, Anthropic, Numerical Methods, R Programming, Parquet, NVIDIA TensorRT, Monitoring, Gradient Boosting, Quarto, Fault Tolerance, Front-end Development, Data Science, Data Scientist, Web Scraping, Binary Classification Models, Neural Networks, Auto Encoder, Apache Cassandra, Embeddings from Language Models (ELMo)
How to Work with Toptal
Toptal matches you directly with global industry experts from our network in hours—not weeks or months.
Share your needs
Choose your talent
Start your risk-free talent trial
Top talent is in high demand.
Start hiring