Joao Diogo de Oliveira, Developer in Fortaleza - State of Ceará, Brazil
Joao is available for hire
Hire Joao

Joao Diogo de Oliveira

Bio

Joao is an AI/ML architect and hands-on AI coach with 15+ years driving change across Fortune 100s (P&G, Hearst) and high-impact startups in healthcare, energy, and media. He holds a master's in computer engineering with multiple ML certifications. He's led 15+ GenAI automations and coached diverse engineering teams—from developers to senior architects—on integrating AI tools and agents across the SDLC, shipping systems that transform how teams work.

Portfolio

Hearst - Technology
Python, Artificial Intelligence (AI), Gemini...
RealWear - Main
Large Language Models (LLMs), Artificial Intelligence (AI), LangChain...
Vasilis K. Pozios, M.D.
Artificial Intelligence (AI), Large Language Models (LLMs), AI Automation...

Experience

  • Artificial Intelligence (AI) - 8 years
  • Machine Learning - 7 years
  • Data Analytics - 6 years
  • Computer Vision - 6 years
  • PyTorch - 4 years
  • Deep Learning - 4 years
  • Generative Artificial Intelligence (GenAI) - 3 years
  • AI Agents - 3 years

Preferred Environment

PyTorch, Machine Learning, Amazon Web Services (AWS), Generative Artificial Intelligence (GenAI), Computer Vision, Deep Learning, Data Analysis, Agentic AI, AI Architecture

The most amazing...

...thing was building 3D AI models, architecting voice AI agents for smart glasses, and leading 15+ GenAI automations at a Fortune 100, from zero to production.

Work Experience

AI Engineering Lead

2023 - PRESENT
Hearst - Technology
  • Grew from a single AI project to technical lead of a 10+ project portfolio across 7+ Hearst companies over 2.5 years, coaching diverse engineering teams—from developers to architects—on transforming workflows with AI tools and agents.
  • Achieved 93% accuracy in translating human language requests about financial data into complex SQL BigQuery queries, enabling users without SQL knowledge to access financial data such as loans and bonds.
  • Designed and delivered multiple Hearst automation projects—using AI Agents, RPAs, scripts, etc. that transformed manual operational workflows into end-to-end automation.
  • Replaced a 10+-year legacy healthcare system with a GenAI MVP in 3-4 weeks and replicated text-to-SQL (93% accuracy) across four companies, establishing reusable evaluation and deployment patterns adopted enterprise-wide.
  • Leveraged GenAI to extract features and analyze 1+ million archaeological images, significantly restoring and preserving past knowledge in a cost-efficient manner.
Technologies: Python, Artificial Intelligence (AI), Gemini, Generative Artificial Intelligence (GenAI), Google Cloud Platform (GCP), Azure, AI Agents, Information Extraction, Large Language Models (LLMs), Data Science, Natural Language Processing (NLP), Amazon Web Services (AWS), OpenAI, Multimodal Models, Multimodal GenAI, Agentic AI, Model Context Protocol (MCP), Pydantic, AI Architecture, AI Automation, AI Programming, AI Design, Finance, Microsoft Copilot, GitHub Copilot Chat, Claude Code, Cursor AI, Cline, Gemini API, Google Antigravity, AI Tools, Architecture, AI-assisted Development, AI Enablement, Software Development Lifecycle (SDLC), System Development Life Cycle (SDLC)

AI Solutions Architect

2025 - 2026
RealWear - Main
  • Architected a multi-agent voice AI back end (LangGraph and Pipecat) that evolved a beta LLM prototype into a production-ready platform for industrial smart glasses.
  • Designed and shipped MCP integration with OAuth 2.1 and Auth0 Token Vault, enabling secure hands-free access to Email, Teams, and Calendar on wearable devices.
  • Implemented end-to-end observability (Langfuse, OpenTelemetry, Azure Insights) covering traces, token costs, and audio diagnostics, improving debugging speed and release confidence.
  • Ran startup-performance experiments optimizing cold/warm boot latency for STT/TTS init, memory store, and skill loading across cloud and edge.
Technologies: Large Language Models (LLMs), Artificial Intelligence (AI), LangChain, LangGraph, Computer Vision, Natural Language Processing (NLP), Machine Learning, Prompt Engineering, AI Automation, AI Agents, Claude Code, Cursor AI, AI Tools, Architecture, AI-assisted Development, Software Development Lifecycle (SDLC), System Development Life Cycle (SDLC)

AI Technical Lead

2025 - 2025
Vasilis K. Pozios, M.D.
  • Enabled forensic psychiatrists to ingest thousands of pages of records, auto-extract and de-identify PHI, and generate structured forensic reports in a fraction of the manual time.
  • Led the full product delivery of a forensic psychiatry SaaS from zero to production, coordinating front-end, back-end, and AWS infrastructure into a unified release cadence.
  • Delivered HIPAA-aligned PHI masking and de-identification, enabling safe processing of sensitive medical and criminal records across a multi-tenant platform.
  • Built an interview transcription pipeline with hierarchical summarization and adaptive rate limiting for forensic psychiatric evaluations.
Technologies: Artificial Intelligence (AI), Large Language Models (LLMs), AI Automation, AI Agents, Claude Code, Cursor AI, AI Tools, AI-assisted Development, Software Development Lifecycle (SDLC), System Development Life Cycle (SDLC), Gemini API

AI Facilitator & Internal Consultant

2023 - 2025
Toptal
  • Designed and facilitated hands-on technical workshops — including a 3-day LLM coaching sprint for 100+ engineers — combining live labs with applied exercises to build practitioner-level fluency.
  • Designed and facilitated a 2-day hands-on AI-assisted coding workshop for FranklinCovey, building live demos and labs covering IDE assistants, prompt patterns, and AI code review — with adoption metrics targeting 20–30% faster iteration.
  • Facilitated advanced sessions on Quantum Computing and Reinforcement Learning, adapting content and pacing to mixed-proficiency audiences of experienced engineers.
  • Provided vision and strategic guidance for internal AI projects, including infrastructure, techniques, and models to achieve project goals.
Technologies: Artificial Intelligence (AI), Large Language Models (LLMs), Generative Pre-trained Transformers (GPT), Generative Artificial Intelligence (GenAI), AI Architecture, AI Automation, AI Tools, Gemini API

Machine Learning Developer (via Toptal)

2023 - 2025
EIS - Main
  • Conducted a feasibility study and implemented a POC for capturing, counting, and geo-locating valves in oil and gas plant scans.
  • Developed an AI model to identify valves in image batches from plant scans, improving detection accuracy and efficiency.
  • Implemented a method to automatically process and slice cloud point data, extracting images and transforming them into 2D representations.
  • Labeled 3D data to train deep learning models for 3D segmentation, successfully applying models such as PointNet and PointNet++ to real data.
  • Developed an inference pipeline to label unseen data and output labeled point clouds, enhancing data processing capabilities.
Technologies: Machine Learning, Computer Vision, Deep Learning, Convolutional Neural Networks (CNNs), Artificial Intelligence (AI), Point Clouds, Point Cloud Data, Image Processing, Natural Language Processing (NLP), Python, TensorFlow, PyTorch, You Only Look Once (YOLO), Generative Artificial Intelligence (GenAI), Pydantic, AI Architecture, AI Automation, Software Development Lifecycle (SDLC), System Development Life Cycle (SDLC)

AI Developer (via Toptal)

2022 - 2024
Peyton & Greyson Solutions Inc,
  • Developed an AI application for automatic proposal writing, saving 20% of a specialized employee's time and increasing efficiency.
  • Architected the entire IT solution, encompassing database selection, AWS serverless services, a web app back end, API configuration, and AI model deployment.
  • Tracked team development, ensuring milestones were met and successfully delivering from demos to critical project deliverables.
Technologies: Artificial Intelligence (AI), AI Design, Generative Adversarial Networks (GANs), Language Models, OpenAI, APIs, Backendless, Amazon Web Services (AWS), AWS Lambda, Amazon RDS, Python, Large Language Models (LLMs), Models, AI Programming, Natural Language Understanding (NLU), Matplotlib, Natural Language Processing (NLP), Generative Pre-trained Transformers (GPT), Information Extraction, GitHub, Cloud Platforms, Data Pipelines, Early-stage Startups, Data Processing, Data Transformation, Back-end, ChatGPT, OpenAI GPT-3 API, Generative Pre-trained Transformer 3 (GPT-3), DevOps, Amazon SageMaker, Jupyter Notebook, OpenAI GPT-4 API, Kubernetes, Scraping, Analytics, Keras, Sentiment Analysis, Generative Artificial Intelligence (GenAI), Data Structures, DeepSeek, AI Automation, Architecture, Software Development Lifecycle (SDLC), System Development Life Cycle (SDLC)

IT Engineer | Artificial Intelligence Engineer

2019 - 2024
Freelance Clients
  • Developed an AI project for energy prediction of solar and wind farms, totaling 2.6 GW of installed power, optimizing energy output and management.
  • Built a computer vision model for face recognition, enhancing security and identification processes.
  • Created a computer vision model to assist in pneumonia detection through X-rays, improving diagnostic accuracy.
  • Provided consulting services for wind certification of two offshore projects, predicting a combined installed power of 2GW.
  • Managed and maintained over 20 distributed Linux servers, ensuring their security, updating, and creating key performance indicators (KPIs) for performance tracking.
Technologies: Python 2, Python 3, Deep Learning, Statistics, Data Analytics, Python, Data Science, Deep Neural Networks (DNNs), Big Data Architecture, Linux, Datasets, Pandas, Machine Learning Operations (MLOps), Image Processing, Large Language Models (LLMs), Models, AI Programming, Generative Pre-trained Transformers (GPT), Natural Language Processing (NLP), Data Processing Automation, Artificial Intelligence (AI), Image Generation, ARIMA, LSTM, SARIMA, R, Matplotlib, Information Extraction, GitHub, Cloud Platforms, Data Pipelines, Energy, Neural Networks, Regression Modeling, Data Processing, Data Transformation, CSV, Data Analysis, Back-end, DevOps, Amazon SageMaker, Jupyter Notebook, Speech Recognition, Scraping, Analytics, FFmpeg, Keras, Sentiment Analysis, Image Recognition, TensorFlow, PyTorch, Computer Vision, Generative Artificial Intelligence (GenAI), OpenAI, Speech-to-Text (STT), Speech to Intent, Pydantic, Architecture, Software Development Lifecycle (SDLC), System Development Life Cycle (SDLC)

Product Owner | Country Manager

2017 - 2024
Prewind
  • Developed AI models for deep learning, weather forecasting, and energy prediction across multiple markets, enhancing predictive capabilities.
  • Conducted comprehensive business and data analytics for customers, providing actionable insights.
  • Established a European institute in Brazil successfully, expanding the organization's reach and impact.
  • Managed a portfolio of clients with a combined energy production of 3+ GW, optimizing energy management and client satisfaction.
Technologies: Deep Learning, Artificial Intelligence (AI), Machine Learning, Data Analytics, Data Science, Data Visualization, Linux, Datasets, Pandas, Amazon Web Services (AWS), Python, Models, Matplotlib, Information Extraction, GitHub, Early-stage Startups, Energy, Neural Networks, Data Transformation, CSV, Data Analysis, Back-end, DevOps, Workshop Facilitation, Analytics, Sentiment Analysis, Image Recognition

Team Leader

2023 - 2023
Stop the Traffik
  • Analyzed key tech issues in a volunteer organization and developed a plan to address them, leading a team of 11 volunteers across nine countries.
  • Led a team of ML/AI specialists to develop an AI model for sentiment analysis, automating the classification of trafficking articles and eliminating manual labor.
  • Guided a team of ML/AI specialists to enhance a legacy model, improving the classification of articles into relevant and non-relevant categories.
  • Steered through meetings the project success and engagement to deliver the proposed outcomes to the organization. Participated in all parts of development (AI, DevOps, Python) to make sure that commitments were met and delivered.
Technologies: IBM Cloud, Amazon SageMaker, Kubernetes, Data Science, Python, Artificial Intelligence (AI), IBM Cloud Platform

NLP Engineer (via Toptal)

2023 - 2023
Mercatus Center at George Mason University - Main
  • Developed a text classification model for documents within 96 labels, using various NLP techniques for NAICS code probabilities.
  • Explored and combined advanced text classification techniques, improving F1 score by 15%.
  • Used Amazon SageMaker to provide an effective and insightful training and inference pipeline.
  • Achieved F1 scores in some categories up to 0.95 and 0.98 (from 0 – 1) in others using different techniques, which increased from 0.4 to 0.7.
Technologies: Natural Language Processing (NLP), Python, Generative Pre-trained Transformers (GPT), NLPP, Deep Neural Networks (DNNs), Amazon SageMaker, Transformers, Data Science, Artificial Intelligence (AI), TensorFlow

Managing Director

2013 - 2021
Niway Group
  • Managed daily investment operations, including a shopping mall and business towers, and represented the group before government bodies.
  • Reversed a seven-year financial loss into profit through significant operational changes.
  • Oversaw the financial management of constructing three 12-floor towers, with a total cost of R$43 million.
Technologies: Team Leadership, Finance, Data Science, Data Visualization, Python, Real Estate, CSV, Data Analysis, CTO, Workshop Facilitation, Analytics

Engineering Manager

2012 - 2013
Procter & Gamble
  • Implemented multiple line update projects across plants in France, Italy, and Spain, enhancing operational efficiency.
  • Developed and deployed cost-saving solutions across multiple factories, resulting in significant savings.
  • Led technical discussions with suppliers to ensure compliance with project requirements and specifications.
Technologies: Agile, Project Design & Management, Process Management, APIs, Linux, Supply Chain Management (SCM), Supply Chain Optimization, SARIMA, Data Processing, Data Analysis, Workshop Facilitation

Supply Chain Leader

2009 - 2012
Procter & Gamble
  • Led the design and implementation of a global pilot project to remodel the company's logistics sector, improving efficiency and reducing costs.
  • Addressed inventory cost issues, achieving a reduction from $12 million to $7 million.
  • Created a cross-docking supply chain prototype, resulting in annual savings of $2 million.
  • Coached and guided team members, ensuring coordinated efforts and successful project outcomes.
Technologies: Project Design & Management, Logistics, Agile, Forecasting, Data Science, Datasets, Supply Chain Management (SCM), Supply Chain Optimization, Data Processing, Data Analysis, Workshop Facilitation

Experience

CV: X-ray Pneumonia Detection

https://github.com/joao-d-oliveira/X-Ray_PneumoniaDetection
Developed a computer vision model to detect pneumonia from X-ray images with an 86% precision, comparable to a trained physician. The model processes X-ray images, detects foreign tissue, and predicts whether the image indicates pneumonia.

Power Generation Forecast for Wind and Solar Farms

http://www.ren.pt
Conducted data analysis and developed an ensemble of models with deep learning to forecast power generation for over 300 wind and solar farms in Portugal, enhancing energy management and prediction accuracy.

Surgery Assistance Software

Developed AI software for voice recognition and command interpretation in surgical settings, predicting tool usage based on historical data. I successfully designed and implemented the software architecture, achieving an MVP.

NLP in Healthcare | Score Clinical Patient Notes

https://www.kaggle.com/c/nbme-score-clinical-patient-notes
A project to classify each patient's probable disease according to actual notes taken from clinical trials by doctors. I developed a natural language processing (NLP) model using RoBERTa to classify each patient's disease based on clinical notes from trials.

CV: Image Captioning | Identifying Objects and Writing Caption

Developed a machine learning model that, through deep learning networks, analyses images, identifies objects, and captions the images accordingly. The project got a BLUE-1 score of 0.679 for an image caption—a score of 0.6 – 0.7 is considered best in class.

Computer Vision | Face Detection

A computer vision model built with ML techniques that uses video-based facial recognition. I developed a video-based facial recognition model with a false acceptance rate (FAR) of approximately 10^-5, suitable for security applications.

Email NLP/NLU/NER Analysis

Utilized advanced NLP techniques to extract insights from emails, achieving over 83% accuracy. I conducted data analysis, summarization, and classification of important information from the text.

Education

2003 - 2009

Master's Degree in Computer Science

University of Porto - Porto, Portugal

2007 - 2008

Exchange Program Coursework Toward Master's Degree in Computer Science

Delft University of Technology - Delft, Netherlands

Certifications

AUGUST 2022 - PRESENT

Quantum Excellence Certificate

IBM | Qiskit Global Summer School 2022

JULY 2022 - PRESENT

AI for Healthcare

Udacity

JULY 2021 - PRESENT

Machine Learning

Stanford University

JULY 2021 - PRESENT

Deep Reinforcement Learning

Udacity

JUNE 2021 - PRESENT

Advanced Computer Vision - Machine Learning

Udacity

Skills

Libraries/APIs

PyTorch, TensorFlow, Scikit-learn, Pandas, LSTM, Matplotlib, Keras, Pydantic, OpenCV, PyTorch Lightning, FFmpeg

Tools

You Only Look Once (YOLO), ARIMA, GitHub, Amazon SageMaker, Claude Code, SARIMA, ChatGPT, DeepSeek, NLPP, Oracle Demantra, Microsoft Copilot

Languages

Python 3, SQL, Python, R, Python 2, C++

Platforms

Linux, Amazon Web Services (AWS), Jupyter Notebook, Azure, Google Cloud Platform (GCP), Kubernetes, Docker, Backendless, AWS Lambda, IBM Cloud Platform

Storage

Data Pipelines, PostgreSQL, MySQL

Industry Expertise

System Development Life Cycle (SDLC)

Paradigms

Agile, DevOps, Model Context Protocol (MCP), Anomaly Detection

Frameworks

LangGraph

Other

Machine Learning, Deep Learning, Data Structures, Artificial Intelligence (AI), Algorithms, Team Leadership, Project Design & Management, Computer Vision, BERT, Natural Language Processing (NLP), APIs, Data Science, Deep Neural Networks (DNNs), Datasets, Language Models, OpenAI, Image Processing, Large Language Models (LLMs), Models, AI Programming, Data Processing Automation, Real Estate, Supply Chain Management (SCM), Supply Chain Optimization, Forecasting, Information Extraction, Energy, Generative Artificial Intelligence (GenAI), Neural Networks, Regression Modeling, Data Processing, Data Transformation, CSV, Data Analysis, Generative Pre-trained Transformers (GPT), Back-end, Generative Pre-trained Transformer 3 (GPT-3), OpenAI GPT-4 API, Workshop Facilitation, Analytics, Convolutional Neural Networks (CNNs), Sentiment Analysis, Point Clouds, Point Cloud Data, Gemini, AI Agents, Multimodal Models, Multimodal GenAI, Agentic AI, AI Architecture, AI Automation, Cursor AI, Gemini API, AI Tools, Architecture, AI-assisted Development, Software Development Lifecycle (SDLC), Data Analytics, Finance, Process Management, Logistics, Statistics, Computer Vision Algorithms, Data Visualization, Big Data Architecture, Machine Learning Operations (MLOps), Generative Adversarial Networks (GANs), Natural Language Understanding (NLU), Hugging Face, Cloud Platforms, Early-stage Startups, Web Development, Word Embedding, OpenAI GPT-3 API, API Integration, Speech Recognition, Scraping, Facial Recognition, Image Recognition, Speech-to-Text (STT), AI Enablement, Quantum Computing, Healthcare IT, Deep Reinforcement Learning, Object Detection, Generative Models, AI Design, Amazon RDS, Image Generation, CTO, Transformers, IBM Cloud, Qiskit, AgentGPT, Speech to Intent, LangChain, Prompt Engineering, GitHub Copilot Chat, Cline, Google Antigravity

Collaboration That Works

How to Work with Toptal

Toptal matches you directly with global industry experts from our network in hours—not weeks or months.

1

Share your needs

Discuss your requirements and refine your scope in a call with a Toptal domain expert.
2

Choose your talent

Get a short list of expertly matched talent within 24 hours to review, interview, and choose from.
3

Start your risk-free talent trial

Work with your chosen talent on a trial basis for up to two weeks. Pay only if you decide to hire them.

Top talent is in high demand.

Start hiring