Joao Diogo de Oliveira, Developer in Fortaleza - State of Ceará, Brazil
Joao is available for hire
Hire Joao

Joao Diogo de Oliveira

Verified Expert  in Engineering

Machine Learning Engineer and Developer

Location
Fortaleza - State of Ceará, Brazil
Toptal Member Since
October 20, 2022

Joao is an accomplished AI/ML engineer with over 14 years of experience working with Fortune 100 companies such as Procter & Gamble and Hearst, as well as innovative startups in the healthcare, energy, and finance sectors. He holds a master's degree in computer engineering from the University of Porto and has earned multiple certifications in machine learning and deep learning. Joao's diverse expertise across various industries underscores his versatility and high skill level.

Portfolio

Hearst - Technology
Python, Artificial Intelligence (AI), Generative Pre-trained Transformers (GPT)...
Toptal
Artificial Intelligence (AI), Large Language Models (LLMs)...
EIS - Main
Machine Learning, Computer Vision, Deep Learning...

Experience

Availability

Part-time

Preferred Environment

Python 3, PyTorch, TensorFlow, Machine Learning, Google Cloud Platform (GCP), Amazon Web Services (AWS), Generative Artificial Intelligence (GenAI), Computer Vision, Deep Learning, Data Analysis

The most amazing...

...thing I've led are AI projects predicting energy for 300+ farms in 1.5 months and detecting pneumonia with 86% precision, delivering high-impact solutions.

Work Experience

MVP Developer (via Toptal)

2023 - PRESENT
Hearst - Technology
  • Developed an MVP, replacing a 10-year-old legacy system in the healthcare industry within three to four weeks, resulting in improved cost, reliability, speed, interpretability, and accuracy using generative AI.
  • Achieved 93% accuracy in translating human language requests about financial data into complex SQL BigQuery queries, enabling users without SQL knowledge to access financial data such as loans and bonds.
  • Leveraged GenAI to extract features and analyze 1+ million archaeological images, significantly restoring and preserving past knowledge in a cost-efficient manner.
  • Researched and implemented advanced generative AI models, including GPT-4, GPT-4 Turbo, Gemini, Claude 3, LlamaIndex, LangChain, and Auto-GPT, to enhance accessibility and usability for diverse user groups within the organization.
Technologies: Python, Artificial Intelligence (AI), Generative Pre-trained Transformers (GPT), Generative Pre-trained Transformer 3 (GPT-3), AgentGPT, Gemini, Generative Artificial Intelligence (GenAI), Google Cloud Platform (GCP), Azure, AI Agents, Information Extraction, Generative AI, Large Language Models (LLMs), Data Science, Natural Language Processing (NLP), Amazon Web Services (AWS), OpenAI, Multimodal Models, Multimodal GenAI

Internal Consultant

2023 - PRESENT
Toptal
  • Provided vision and strategic guidance for internal AI projects, including infrastructure, techniques, and models to achieve project goals.
  • Delivered a comprehensive 3-day workshop on large language models (LLMs) to an audience of over 100 attendees, enhancing their understanding and application of LLMs.
  • Conducted advanced technical workshops on topics such as "Introduction to Quantum Computing" and "Reinforcement Learning," equipping participants with cutting-edge knowledge and skills.
Technologies: Artificial Intelligence (AI), Large Language Models (LLMs), Generative Pre-trained Transformers (GPT)

Machine Learning Developer (via Toptal)

2023 - PRESENT
EIS - Main
  • Conducted a feasibility study and implemented a POC for capturing, counting, and geo-locating valves in oil and gas plant scans.
  • Developed an AI model to identify valves in image batches from plant scans, improving detection accuracy and efficiency.
  • Implemented a method to automatically process and slice cloud point data, extracting images and transforming them into 2D representations.
  • Labeled 3D data to train deep learning models for 3D segmentation, successfully applying models such as PointNet and PointNet++ to real data.
  • Developed an inference pipeline to label unseen data and output labeled point clouds, enhancing data processing capabilities.
Technologies: Machine Learning, Computer Vision, Deep Learning, Convolutional Neural Networks (CNN), Artificial Intelligence (AI), Point Clouds, Point Cloud Data, Image Processing, Natural Language Processing (NLP), Python, TensorFlow, PyTorch

IT Engineer | Artificial Intelligence Engineer

2019 - PRESENT
Freelance Clients
  • Developed an AI project for energy prediction of solar and wind farms, totaling 2.6 GW of installed power, optimizing energy output and management.
  • Built a computer vision model for face recognition, enhancing security and identification processes.
  • Created a computer vision model to assist in pneumonia detection through X-rays, improving diagnostic accuracy.
  • Provided consulting services for wind certification of two offshore projects, predicting a combined installed power of 2GW.
  • Managed and maintained over 20 distributed Linux servers, ensuring their security, updating, and creating key performance indicators (KPIs) for performance tracking.
Technologies: Python 2, Python 3, Deep Learning, Statistics, Data Analytics, Python, Data Science, Deep Neural Networks, Big Data Architecture, Linux, Datasets, Pandas, Machine Learning Operations (MLOps), Image Processing, Hardware, Large Language Models (LLMs), Models, AI Programming, Generative Pre-trained Transformers (GPT), Natural Language Processing (NLP), Data Processing Automation, Artificial Intelligence (AI), Image Generation, ARIMA, ARIMA Models, LSTM, SARIMA, R, Matplotlib, Information Extraction, GitHub, Cloud Platforms, Data Pipelines, Energy, Neural Networks, Regression Modeling, Data Processing, Data Transformation, CSV, Data Analysis, Back-end, DevOps, Amazon SageMaker, Jupyter Notebook, Speech Recognition, Scraping, Analytics, FFmpeg, Keras, Sentiment Analysis, Image Recognition, TensorFlow, PyTorch, Computer Vision, Generative AI, OpenAI, Speech to Text, Speech to Intent

AI Developer (via Toptal)

2022 - 2024
Peyton & Greyson Solutions Inc,
  • Developed an AI application for automatic proposal writing, saving 20% of a specialized employee's time and increasing efficiency.
  • Architected the entire IT solution, encompassing database selection, AWS serverless services, a web app back end, API configuration, and AI model deployment.
  • Tracked team development, ensuring milestones were met and successfully delivering from demos to critical project deliverables.
Technologies: Artificial Intelligence (AI), AI Design, Generative Adversarial Networks (GANs), Language Models, OpenAI, APIs, Backendless, Amazon Web Services (AWS), AWS Lambda, Amazon RDS, Python, DaVinci, Large Language Models (LLMs), Models, AI Programming, Natural Language Understanding (NLU), Matplotlib, Natural Language Processing (NLP), Generative Pre-trained Transformers (GPT), Information Extraction, GitHub, Cloud Platforms, Data Pipelines, Early-stage Startups, Data Processing, Data Transformation, Back-end, ChatGPT, OpenAI GPT-3 API, Generative Pre-trained Transformer 3 (GPT-3), DevOps, Amazon SageMaker, Jupyter Notebook, OpenAI GPT-4 API, Kubernetes, Scraping, Analytics, Keras, Sentiment Analysis, Generative AI, Data Structures

Product Owner | Country Manager

2017 - 2024
Prewind
  • Developed AI models for deep learning, weather forecasting, and energy prediction across multiple markets, enhancing predictive capabilities.
  • Conducted comprehensive business and data analytics for customers, providing actionable insights.
  • Established a European institute in Brazil successfully, expanding the organization's reach and impact.
  • Managed a portfolio of clients with a combined energy production of 3+ GW, optimizing energy management and client satisfaction.
Technologies: Deep Learning, Artificial Intelligence (AI), Machine Learning, Data Analytics, Data Science, Data Visualization, Linux, Datasets, Pandas, Amazon Web Services (AWS), Python, Hardware, Models, Matplotlib, Information Extraction, GitHub, Early-stage Startups, Energy, Neural Networks, Data Transformation, CSV, Data Analysis, Back-end, DevOps, Workshop Facilitation, Analytics, Sentiment Analysis, Image Recognition

Team Leader

2023 - 2023
Stop the Traffik
  • Analyzed key tech issues in a volunteer organization and developed a plan to address them, leading a team of 11 volunteers across nine countries.
  • Led a team of ML/AI specialists to develop an AI model for sentiment analysis, automating the classification of trafficking articles and eliminating manual labor.
  • Guided a team of ML/AI specialists to enhance a legacy model, improving the classification of articles into relevant and non-relevant categories.
  • Steered through meetings the project success and engagement to deliver the proposed outcomes to the organization. Participated in all parts of development (AI, DevOps, Python) to make sure that commitments were met and delivered.
Technologies: IBM Cloud, Amazon SageMaker, Kubernetes, Data Science, Python, Artificial Intelligence (AI), IBM Cloud Platform

NLP Engineer (via Toptal)

2023 - 2023
Mercatus Center at George Mason University - Main
  • Developed a text classification model for documents within 96 labels, using various NLP techniques for NAICS code probabilities.
  • Explored and combined advanced text classification techniques, improving F1 score by 15%.
  • Used Amazon SageMaker to provide an effective and insightful training and inference pipeline.
  • Achieved F1 scores in some categories up to 0.95 and 0.98 (from 0 – 1) in others using different techniques, which increased from 0.4 to 0.7.
Technologies: Natural Language Processing (NLP), Python, Generative Pre-trained Transformers (GPT), NLPP, Deep Neural Networks, Amazon SageMaker, Transformers, Data Science, Artificial Intelligence (AI), TensorFlow

Managing Director

2013 - 2021
Niway Group
  • Managed daily investment operations, including a shopping mall and business towers, and represented the group before government bodies.
  • Reversed a seven-year financial loss into profit through significant operational changes.
  • Oversaw the financial management of constructing three 12-floor towers, with a total cost of R$43 million.
Technologies: Team Leadership, Finance, Data Science, Data Visualization, Python, Real Estate, CSV, Data Analysis, CTO, Workshop Facilitation, Analytics

Engineering Manager

2012 - 2013
Procter & Gamble
  • Implemented multiple line update projects across plants in France, Italy, and Spain, enhancing operational efficiency.
  • Developed and deployed cost-saving solutions across multiple factories, resulting in significant savings.
  • Led technical discussions with suppliers to ensure compliance with project requirements and specifications.
Technologies: Agile, Project Design & Management, Process Management, APIs, Linux, Hardware, Supply Chain Management (SCM), Supply Chain Optimization, SARIMA, Data Processing, Data Analysis, Workshop Facilitation

Supply Chain Leader

2009 - 2012
Procter & Gamble
  • Led the design and implementation of a global pilot project to remodel the company's logistics sector, improving efficiency and reducing costs.
  • Addressed inventory cost issues, achieving a reduction from $12 million to $7 million.
  • Created a cross-docking supply chain prototype, resulting in annual savings of $2 million.
  • Coached and guided team members, ensuring coordinated efforts and successful project outcomes.
Technologies: Project Design & Management, Logistics, Agile, Forecasting, Data Science, Datasets, Supply Chain Management (SCM), Supply Chain Optimization, Data Processing, Data Analysis, Workshop Facilitation

CV: X-ray Pneumonia Detection

https://github.com/joao-d-oliveira/X-Ray_PneumoniaDetection
Developed a computer vision model to detect pneumonia from X-ray images with an 86% precision, comparable to a trained physician. The model processes X-ray images, detects foreign tissue, and predicts whether the image indicates pneumonia.

Power Generation Forecast for Wind and Solar Farms

http://www.ren.pt
Conducted data analysis and developed an ensemble of models with deep learning to forecast power generation for over 300 wind and solar farms in Portugal, enhancing energy management and prediction accuracy.

Surgery Assistance Software

Developed AI software for voice recognition and command interpretation in surgical settings, predicting tool usage based on historical data. I successfully designed and implemented the software architecture, achieving an MVP.

NLP in Healthcare | Score Clinical Patient Notes

https://www.kaggle.com/c/nbme-score-clinical-patient-notes
A project to classify each patient's probable disease according to actual notes taken from clinical trials by doctors. I developed a natural language processing (NLP) model using RoBERTa to classify each patient's disease based on clinical notes from trials.

CV: Image Captioning | Identifying Objects and Writing Caption

Developed a machine learning model that, through deep learning networks, analyses images, identifies objects, and captions the images accordingly. The project got a BLUE-1 score of 0.679 for an image caption—a score of 0.6 – 0.7 is considered best in class.

Computer Vision | Face Detection

A computer vision model built with ML techniques that uses video-based facial recognition. I developed a video-based facial recognition model with a false acceptance rate (FAR) of approximately 10^-5, suitable for security applications.

Email NLP/NLU/NER Analysis

Utilized advanced NLP techniques to extract insights from emails, achieving over 83% accuracy. I conducted data analysis, summarization, and classification of important information from the text.
2003 - 2009

Master's Degree in Computer Science

University of Porto - Porto, Portugal

2007 - 2008

Exchange Program Coursework Toward Master's Degree in Computer Science

Delft University of Technology - Delft, Netherlands

AUGUST 2022 - PRESENT

Quantum Excellence Certificate

IBM | Qiskit Global Summer School 2022

JULY 2022 - PRESENT

AI for Healthcare

Udacity

JULY 2021 - PRESENT

Machine Learning

Stanford University

JULY 2021 - PRESENT

Deep Reinforcement Learning

Udacity

JUNE 2021 - PRESENT

Advanced Computer Vision - Machine Learning

Udacity

Libraries/APIs

PyTorch, TensorFlow, Scikit-learn, Pandas, LSTM, Matplotlib, Keras, OpenCV, PyTorch Lightning, FFmpeg

Tools

GitHub, Amazon SageMaker, ChatGPT, You Only Look Once (YOLO), NLPP, Oracle Demantra

Languages

Python 3, SQL, Python, R, Python 2, C++

Paradigms

Data Science, Agile, DevOps, Anomaly Detection

Platforms

Linux, Amazon Web Services (AWS), Jupyter Notebook, Google Cloud Platform (GCP), Kubernetes, Docker, Azure, Backendless, AWS Lambda, IBM Cloud Platform

Storage

Data Pipelines, PostgreSQL, MySQL

Other

Machine Learning, Deep Learning, Data Structures, Artificial Intelligence (AI), Algorithms, Team Leadership, Project Design & Management, Computer Vision, BERT, Natural Language Processing (NLP), Deep Neural Networks, Datasets, Language Models, OpenAI, Image Processing, Hardware, Large Language Models (LLMs), Models, AI Programming, Data Processing Automation, Real Estate, ARIMA, ARIMA Models, Supply Chain Management (SCM), Supply Chain Optimization, Forecasting, Information Extraction, Energy, Neural Networks, Regression Modeling, Data Processing, Data Transformation, CSV, Data Analysis, Generative Pre-trained Transformers (GPT), Back-end, Generative Pre-trained Transformer 3 (GPT-3), OpenAI GPT-4 API, Workshop Facilitation, Analytics, Convolutional Neural Networks (CNN), Sentiment Analysis, Point Clouds, Point Cloud Data, Gemini, Generative AI, Multimodal Models, Multimodal GenAI, Data Analytics, Process Management, Logistics, Statistics, Computer Vision Algorithms, Data Visualization, Big Data Architecture, Machine Learning Operations (MLOps), Generative Adversarial Networks (GANs), DaVinci, SARIMA, Natural Language Understanding (NLU), Hugging Face, Cloud Platforms, Early-stage Startups, Generative Artificial Intelligence (GenAI), Web Development, Word Embedding, OpenAI GPT-3 API, API Integration, Speech Recognition, Scraping, Facial Recognition, Image Recognition, AI Agents, Speech to Text, Finance, Quantum Computing, Healthcare IT, Deep Reinforcement Learning, APIs, Object Detection, Generative Models, AI Design, Amazon RDS, Image Generation, CTO, Transformers, IBM Cloud, Qiskit, AgentGPT, Speech to Intent

Collaboration That Works

How to Work with Toptal

Toptal matches you directly with global industry experts from our network in hours—not weeks or months.

1

Share your needs

Discuss your requirements and refine your scope in a call with a Toptal domain expert.
2

Choose your talent

Get a short list of expertly matched talent within 24 hours to review, interview, and choose from.
3

Start your risk-free talent trial

Work with your chosen talent on a trial basis for up to two weeks. Pay only if you decide to hire them.

Top talent is in high demand.

Start hiring