Joao Diogo de Oliveira, Developer in Fortaleza, Brazil

Joao Diogo de Oliveira

Machine Learning Engineer and Developer

Location
Fortaleza, Brazil
Toptal Member Since
October 20, 2022

Joao is a machine learning engineer experienced in developing models using deep learning, stacking, and ensemble. He is motivated by complex challenges, such as computer vision, quantum computing, AI in healthcare, and reinforcement learning. After successfully performing in multinationals like Procter & Gamble and startups in various industries such as energy, finance, logistics, consumer goods, real estate, and the wood industry. Joao is looking forward to the next big challenge.

Joao is available for hire
Hire Joao

Portfolio

Peyton & Greyson Solutions Inc,
Artificial Intelligence (AI), AI Design, Generative Adversarial Networks (GANs)...
Freelance Clients
Python 2, Python 3, Deep Learning, Gradient Boosting, Statistics...
Prewind
Deep Learning, Artificial Intelligence (AI), Machine Learning, Data Analytics...

Location

Fortaleza, Brazil

Availability

Full-time

Preferred Environment

Python 3, PyTorch, TensorFlow, R, Machine Learning, Google Cloud Platform (GCP), Amazon Web Services (AWS)

The most amazing...

...project I've led is predicting over 300 wind and solar farms in a record time of 1.5 months.

Work Experience

2022 - PRESENT

AI Developer

Peyton & Greyson Solutions Inc,
  • Developed an AI application for writing automatic proposals, saving at least 20% of the time from a specialized employee.
  • Designed and architected the entire IT solution: a) database choice and detail; b) AWS Serverless Services; b) chose and set up the web app back-end implementation; c) API configuration; d) to complete AI model development and deployment.
  • Tracked team members' development and ensured that milestones were met, from demos to critical development deliverables.
  • Tailored the GPT-3 model to a specific business case successfully.
Technologies: Artificial Intelligence (AI), AI Design, Generative Adversarial Networks (GANs), Language Models, OpenAI, APIs, Backendless, Amazon Web Services (AWS), AWS Lambda, AWS Amplify, Amazon RDS, Python, DaVinci, Architecture, Large Language Model (LLM), Models, AI Programming, Natural Language Understanding (NLU), Matplotlib, Natural Language Processing (NLP), Generative Pre-trained Transformers (GPT), GPT, Information Extraction, GitHub, Cloud Platforms, Data Pipelines, Early-stage Startups, Data Processing, Data Transformation, Back-end, ChatGPT, OpenAI GPT-3 API, Generative Pre-trained Transformer 3 (GPT-3), DevOps, Amazon SageMaker, Jupyter Notebook
2019 - PRESENT

IT Engineer | Artificial Intelligence Engineer

Freelance Clients
  • Developed an artificial intelligence AI project for energy prediction of solar and wind farms, summing up 2.6 GW of installed power.
  • Built a model for computer vision that did face recognition.
  • Created a model using computer vision to ease pneumonia detection through X-ray.
  • Provided consulting services to deliver wind certification for two offshore projects with a combined predicted installed power of 2GW.
  • Maintained over 20 distributed Linux servers, updating, securing, and creating key performance indicators KPIs.
Technologies: Python 2, Python 3, Deep Learning, Gradient Boosting, Statistics, Data Analytics, Python, Data Science, Deep Neural Networks, Big Data, Big Data Architecture, Linux, Datasets, Pandas, Machine Learning Operations (MLOps), Image Processing, Architecture, Hardware, Large Language Model (LLM), Models, AI Programming, Generative Pre-trained Transformers (GPT), Natural Language Processing (NLP), GPT, Data Processing Automation, Artificial Intelligence (AI), Image Generation, ARIMA, ARIMA Models, LSTM, SARIMA, R, Matplotlib, Information Extraction, GitHub, Cloud Platforms, Data Pipelines, Energy, Neural Networks, Regression Modeling, Data Processing, Data Transformation, CSV, Data Analysis, Back-end, DevOps, Amazon SageMaker, Jupyter Notebook
2017 - PRESENT

Product Owner | Country Manager

Prewind
  • Developed AI models, including deep learning, weather forecast, and energy prediction for multiple markets.
  • Performed business and data analytics for customers.
  • Led the successful establishment of a European institute in Brazil.
  • Managed a portfolio of clients with a combined production of over 3 GW of energy.
Technologies: Deep Learning, Artificial Intelligence (AI), Machine Learning, Data Analytics, Data Science, Data Visualization, Linux, Datasets, Pandas, Amazon Web Services (AWS), Python, Hardware, Models, Matplotlib, Information Extraction, GitHub, Early-stage Startups, Energy, Neural Networks, Data Transformation, CSV, Data Analysis, Back-end, DevOps
2013 - PRESENT

Managing Director

Niway Group
  • Managed daily operations of the group's investments, including a shopping mall, business towers, and representation before official government bodies.
  • Reversed a seven-year loss into profit by applying substantial and stable changes.
  • Supervised the financial control of the construction of three towers, 12 floors each, with a total cost of R$ 43 million.
Technologies: Team Leadership, Finance, Data Science, Data Visualization, Python, Real Estate, CSV, Data Analysis, CTO
2023 - 2023

NLP Engineer

Mercatus Center at George Mason University - Main
  • Developed a long text classification for documents within 96 labels. The purpose was to use different NLP techniques to get probabilities of the three digits NAICS codes.
  • Explored literature on the most advanced techniques of text classification and long text and applied them; Combined the different techniques to achieve a better result, achieving an improvement of 15% on the F1 score.
  • Used AWS SageMaker to provide an effective and insightful training and inference pipeline.
  • Achieved F1 scores on some categories up to 0.95-0.98 on others using different techniques increased from 0.4 to 0.7.
Technologies: Natural Language Processing (NLP), Python, GPT, Generative Pre-trained Transformers (GPT), NLPP, Deep Neural Networks, Amazon SageMaker, Transformers
2012 - 2013

Engineering Manager

Procter & Gamble
  • Implemented multiple line update projects across plants in France, Italy, and Spain.
  • Developed cost-saving solutions and deployed them across multiple factories.
  • Led technical discussions with suppliers to make sure they would meet the requirements.
Technologies: Agile, Project Budget Management, Project Design & Management, Process Management, APIs, Linux, Hardware, Supply Chain Management, Supply Chain Optimization, SARIMA, Data Processing, Data Analysis
2009 - 2012

Supply Chain Leader

Procter & Gamble
  • Led the design and implementation of a global pilot project to remodel the company's logistics sector.
  • Found a solution to complex problems of inventory costs, achieving a reduction from $12 million to $7 million.
  • Participated in creating an internal cross-docking supply chain prototype, resulting in yearly savings of $2 million.
  • Coached, guided, and coordinated the work of multiple team members.
Technologies: Project Design & Management, Logistics, Agile, Forecasting, Data Science, Datasets, Supply Chain Management, Supply Chain Optimization, Data Processing, Data Analysis

Experience

NLP in Healthcare | Score Clinical Patient Notes

https://www.kaggle.com/c/nbme-score-clinical-patient-notes
A project to classify each patient's probable disease according to actual notes taken from clinical trials by doctors and my task was to build a natural language processing (NLP) model on top of the foundation framework RoBERTa.

X-ray Pneumonia Detection

https://github.com/joao-d-oliveira/X-Ray_PneumoniaDetection
A computer vision model, which receives an X-ray image and detects the presence of foreign tissue, and predicts whether the image belongs to a patient with pneumonia or not. The model performed similarly to a trained physician, with a precision of 86% (no pneumonia) and 19% (pneumonia).

Power Generation Forecast for Wind and Solar Farms

A power generation forecast for over 300 wind and solar farms spread across Portugal. I performed the data analysis for the plant's geolocation and wind and solar profile, structuring all the data, building an ensemble of around five models per farm, and training and deploying the models.

Computer Vision - Face detection

A computer vision model, built with ML techniques, that does video-based facial recognition. I was instrumental in making the model and the necessary pipeline from the beginning. Additionally, I've achieved a positive false acceptance rate (FAR) of around 10^-5, meeting clients' needs.

Developing AI Automated Proposal Generation

The application provides automation for Proposal Writing, as the idea was to develop a model and WebApplication to support the model to save the time of specialized employees by at least 20% and I've accomplished developing a working AI Model based on GPT-3. I've also designed and developed the structure and architecture of the web application, making most of the back-end functions and all database architecture.

Image Captioning - Identifying Objects and Writing Caption

Developed a machine learning model that, through deep learning networks, analyses images, identifies objects, and captions the images accordingly; The project got a BLUE-1 score of 0.679 for an image caption, a score of 0.6-0.7 is considered best in class.

Email NLP/NLU/NER Analysis

Through advanced techniques of NLP, extract insights from emails. Classify within a set of pre-defined (achieving an overall score of +83% accuracy overall), extracting important information from the text, doing data analysis, summarisation, and other relevant tasks.

Skills

Languages

Python 3, SQL, Python, R, Python 2, C++

Libraries/APIs

Scikit-learn, Pandas, LSTM, Matplotlib, PyTorch, TensorFlow, OpenCV, PyTorch Lightning

Tools

GitHub, Amazon SageMaker, You Only Look Once (YOLO), NLPP

Paradigms

Data Science, Agile, DevOps, Anomaly Detection

Platforms

Linux, Amazon Web Services (AWS), Jupyter Notebook, Google Cloud Platform (GCP), Backendless, AWS Lambda

Storage

Data Pipelines, PostgreSQL, MySQL

Other

Machine Learning, Deep Learning, Data Structures, Artificial Intelligence (AI), Algorithms, Team Leadership, Project Design & Management, Computer Vision, Natural Language Processing (NLP), Deep Neural Networks, Datasets, Language Models, Image Processing, Hardware, Models, AI Programming, Data Processing Automation, Real Estate, ARIMA, ARIMA Models, Supply Chain Management, Supply Chain Optimization, Forecasting, Information Extraction, Energy, Neural Networks, Regression Modeling, Data Processing, Data Transformation, CSV, Data Analysis, GPT, Generative Pre-trained Transformers (GPT), Back-end, Generative Pre-trained Transformer 3 (GPT-3), Data Analytics, Process Management, Logistics, Statistics, Computer Vision Algorithms, Data Visualization, Big Data Architecture, Machine Learning Operations (MLOps), Generative Adversarial Networks (GANs), OpenAI, DaVinci, Large Language Model (LLM), SARIMA, Natural Language Understanding (NLU), Hugging Face, Cloud Platforms, Early-stage Startups, Generative Artificial Intelligence (AI), Generative Adversarial Networks (GAN), Web Development, Word Embedding, ChatGPT, OpenAI GPT-3 API, API Integration, Finance, Quantum Computing, Healthcare IT, Deep Reinforcement Learning, BERT, APIs, Object Detection, Generative Models, AI Design, Amazon RDS, Image Generation, CTO, Transformers

Education

2003 - 2009

Master's Degree in Computer Science

University of Porto - Porto, Portugal

2007 - 2008

Exchange Program Coursework Toward Master's Degree in Computer Science

Delft University of Technology - Delft, Netherlands

Certifications

AUGUST 2022 - PRESENT

Quantum Excellence Certificate

IBM | Qiskit Global Summer School 2022

JULY 2022 - PRESENT

AI for Healthcare

Udacity

JULY 2021 - PRESENT

Machine Learning

Stanford University

JULY 2021 - PRESENT

Deep Reinforcement Learning

Udacity

JUNE 2021 - PRESENT

Advanced Computer Vision - Machine Learning

Udacity