Gustavo Franco, Developer in Providence, RI, United States
Gustavo is currently unavailable

Gustavo Franco

AI Architect, Lead Data Scientist, and Developer

Providence, RI, United States

Toptal member since March 19, 2026

Bio

Gustavo is a versatile AI architect, lead data scientist, and prompt engineer with 12+ years of experience. He has designed and delivered scalable, production-grade machine learning (ML) and generative AI (GenAI) solutions for government, retail, and enterprise clients. Gustavo is an expert in large language models (LLMs), agentic systems, NLP, MLOps, and cloud-based infrastructure, including Azure and AWS.

Portfolio

ICF International
Python, Databricks, Large Language Models (LLMs), DevOps...
Atrium
Machine Learning Operations (MLOps), MLflow, Machine Learning...
VMWare
Python, Machine Learning Operations (MLOps), MLflow, Spark, PySpark, Dataiku...

Experience

  • DevOps - 12 years
  • Machine Learning - 12 years
  • Python - 12 years
  • Natural Language Processing (NLP) - 11 years
  • Databricks - 7 years
  • Large Language Models (LLMs) - 6 years
  • LangChain - 5 years
  • RAG Architecture - 4 years

Preferred Environment

Python, LangChain, Databricks, Dataiku, FastAPI, MLflow, RAG Architecture, Large Language Models (LLMs), Machine Learning, Natural Language Processing (NLP)

The most amazing...

...system I've built is an end-to-end agentic AI platform that transforms raw, unstructured feedback and structured data into actionable, self-service insights.

Work Experience

Senior AI Engineer

2024 - 2026
ICF International
  • Architected and deployed an agentic LLM system that allowed users to query structured survey data via natural language. The system reformatted user input, queried data sources, and composed accurate responses.
  • Conducted experiments and developed POCs demonstrating the impact of LLM agents and RAG to address key voice of customer (VoC) challenges in online passport services, increasing stakeholder buy-in across departments.
  • Designed, developed, and deployed production-grade LLM and GenAI systems using Llama 3, implementing multi-agent architectures for sentiment analysis, thematic categorization, and summarization.
Technologies: Python, Databricks, Large Language Models (LLMs), DevOps, Natural Language Processing (NLP), Data Science, ETL, Azure Databricks, Data Engineering, CI/CD Pipelines, Apache Spark, PySpark, Machine Learning Operations (MLOps), Data Analysis, Data Pipelines, PostgreSQL, SQL, Docker, Artificial Intelligence (AI), APIs, Azure, AI Integration, Git, Data Integration, AI Development, AI Architecture

Lead Data Scientist

2023 - 2024
Atrium
  • Architected and optimized a customer retention model using NLP and GenAI techniques, increasing ROI from high-value repeat customers by over 15%.
  • Led the project management for customer retention strategies and customer-facing initiatives, ensuring timely delivery and stakeholder satisfaction.
  • Explored advanced LLM-based text-to-SQL solutions, aiming to improve data accessibility for non-technical users.
Technologies: Machine Learning Operations (MLOps), MLflow, Machine Learning, Natural Language Processing (NLP), Large Language Models (LLMs), Python, Data Science, ETL, Data Engineering, CI/CD Pipelines, Apache Spark, PySpark, Data Analysis, Data Pipelines, PostgreSQL, SQL, Amazon Web Services (AWS), Docker, Artificial Intelligence (AI), APIs, Google Cloud Platform (GCP), Amazon S3 (AWS S3), Snowflake, AI Integration, Git, Data Integration, AI Development, AI Architecture

ML Specialist

2022 - 2023
VMWare
  • Developed a client recommendation system using customer segmentation, increasing client acquisition rates by 18%.
  • Built a customer acquisition model using advanced ML techniques, resulting in a 22% increase in customer retention.
  • Created robust production pipelines in Dataiku, including logging, unit tests, and integration tests, ensuring model reliability and scalability.
Technologies: Python, Machine Learning Operations (MLOps), MLflow, Spark, PySpark, Dataiku, Data Science, Data Engineering, CI/CD Pipelines, Apache Spark, Data Analysis, Data Pipelines, PostgreSQL, SQL, Amazon Web Services (AWS), Docker, Artificial Intelligence (AI), APIs, Google Cloud Platform (GCP), Amazon S3 (AWS S3), AI Integration, Git, Data Integration, Time Series, AI Development, AI Architecture

Experience

Voice of the Customer AI Platform

At ICF, I designed and deployed a multi-agent LLM system (Llama 3 70B) that allowed users to interact with complex datasets using natural language. Instead of relying on dashboards or analysts, users could ask questions, and the system would:

• Reformulate the query intelligently (handling ambiguity and intent)
• Route the request to the right tools or data sources
• Query structured databases and analytics pipelines
• Interpret the results contextually
• Generate a clear, business-ready answer

At the same time, the system processed tens of thousands of open-ended feedback entries through a three-stage pipeline (sentiment, themes, and insights), enabling leadership to understand customer pain points quickly.

Education

2019 - 2026

PhD in Informatics and Applied Mathematics

University of Massachusetts Dartmouth - Dartmouth, MA, USA

2019 - 2022

Master's Degree in Data Science

University of Massachusetts Dartmouth - Dartmouth, MA, USA

2011 - 2017

Bachelor's Degree in Mathematics and Computer Science

University of Massachusetts Dartmouth - Dartmouth, MA, USA

Skills

Libraries/APIs

PySpark, D3.js

Tools

Git, Tableau, Microsoft Power BI

Languages

Python, SQL, Snowflake, TypeScript

Frameworks

Spark, Apache Spark

Paradigms

DevOps, ETL

Platforms

Amazon Web Services (AWS), Databricks, Dataiku, Docker, Google Cloud Platform (GCP), Azure

Storage

Data Pipelines, PostgreSQL, Amazon S3 (AWS S3), Data Integration

Other

RAG Architecture, Large Language Models (LLMs), Machine Learning, Natural Language Processing (NLP), Machine Learning Operations (MLOps), CI/CD Pipelines, Data Engineering, Azure Databricks, Data Science, Data Analysis, Artificial Intelligence (AI), APIs, AI Integration, Time Series, AI Development, AI Architecture, LangChain, MLflow, FastAPI

Collaboration That Works

How to Work with Toptal

Toptal matches you directly with global industry experts from our network in hours—not weeks or months.

1

Share your needs

Discuss your requirements and refine your scope in a call with a Toptal domain expert.
2

Choose your talent

Get a short list of expertly matched talent within 24 hours to review, interview, and choose from.
3

Start your risk-free talent trial

Work with your chosen talent on a trial basis for up to two weeks. Pay only if you decide to hire them.

Top talent is in high demand.

Start hiring