Guilherme Morais, Developer in Parede, Portugal
Guilherme is available for hire
Hire Guilherme

Guilherme Morais

Verified Expert  in Engineering

Bio

Guilherme is a data scientist motivated by problem solving and an engineer experienced in the Python stack for machine learning. He develops reproducible and sustainable code with comprehensive documentation. Guilherme is also well-versed in project management with a track record of delivering enabling solutions on time. He excels at understanding the needs of cross-functional stakeholders and working collaboratively in international environments.

Portfolio

Loggi
Python, Amazon Web Services (AWS), Streamlit, GitHub, Data Science...
Hospital Albert Einstein
Python, Amazon Web Services (AWS), Docker, Machine Learning, Kubernetes...
University Hospital Tübingen
Technical Documentation, Experimental Design, Data Visualization...

Experience

  • Data Visualization - 8 years
  • Software Development - 6 years
  • Technical Documentation - 6 years
  • Machine Learning - 5 years
  • Data Science - 5 years
  • Visual Studio Code (VS Code) - 5 years
  • Python - 5 years
  • Amazon Web Services (AWS) - 3 years

Availability

Part-time

Preferred Environment

Python, Amazon Web Services (AWS), Visual Studio Code (VS Code), Slack, GitHub

The most amazing...

...model I've created enabled community health workers to identify hypertensive patients with a high risk for severe complications.

Work Experience

Data Specialist

2021 - 2022
Loggi
  • Implemented an end-to-end data pipeline on AWS for external data consolidation.
  • Optimized insurance-related pricing to balance costs and market prices.
  • Created a quasi-experimental design (interrupted time series) to assess customer behavior.
  • Implemented a pipeline for customization of pricing tables based on client segmentation.
Technologies: Python, Amazon Web Services (AWS), Streamlit, GitHub, Data Science, Data Analytics, SQL, Scikit-learn

Data Scientist

2018 - 2020
Hospital Albert Einstein
  • Recruited and mentored several data scientists for a project with the Ministry of Health.
  • Created models for sedentarism, chronic diseases, socioeconomic indexing, and death certificates.
  • Implemented and deployed a Kubernetes-based platform on AWS for big data projects.
  • Developed a Python package with refactored code for data science in healthcare.
Technologies: Python, Amazon Web Services (AWS), Docker, Machine Learning, Kubernetes, Data Visualization, Dashboards, Technical Documentation, Visual Studio Code (VS Code), Data Science, Data Analytics, Predictive Modeling, SQL, TensorFlow, Scikit-learn, Keras

Research Associate

2017 - 2018
University Hospital Tübingen
  • Applied machine learning toward brain-computer interfaces within a clinical environment.
  • Implemented a pipeline for data preprocessing, hyperparameter tuning, and model selection.
  • Analyzed data statistically and translated it to visualizations to ensure outcomes reliability.
Technologies: Technical Documentation, Experimental Design, Data Visualization, Statistical Analysis, Machine Learning, Brain-computer Interface, MATLAB, Data Science

Research Collaborator

2014 - 2017
Federal University of ABC
  • Designed experiments, performed statistical analyses, and generated visualizations.
  • Developed a publicly available toolbox that has 100+ literature citations.
  • Documented and published ten peer-reviewed papers, including two as the first author.
Technologies: Data Analysis, Experimental Design, Statistical Analysis, Technical Documentation, Software Development, MATLAB, Data Visualization, Data Science, Data Analytics

Development Engineer | Territory Manager

2013 - 2017
NIRx Medical Technologies
  • Developed software for data acquisition and real-time visualization.
  • Optimized real-time data streaming, user interface, and measurement stability.
  • Communicated with customers, partners, and local distributors in several countries.
  • Organized workshops and presentations for multi-background audiences.
  • Created technical and business solutions to effectively address user needs and pain points.
  • Managed the project and outlined goals oriented toward customer experience.
Technologies: Software Development, Streaming Data, Data Visualization, Customer Support, Workshops, Real-time Data

Master's Thesis

2012 - 2012
Berlin Institute of Technology
  • Applied machine learning techniques for EEG-brain-computer interfaces.
  • Implemented features for clustering and penalization with a one-class support vector machine.
  • Improved model performance by 80% compared to the original baseline model.
Technologies: Machine Learning, Data Processing, Data Visualization, Data Science

Experience

Prediction of Severe Complications in Patients with Hypertension

A prediction model to classify hypertensive patients according to their risk of developing severe complications, such as stroke and heart failure. The model was trained on data collected by community health workers in Brazil during routine household visits. The goal was to leverage the current structure of primary care and be readily available for scalability in the country. It was endorsed by local physicians and set to be validated with a randomized controlled trial.

Optimization of Pricing Strategies in Logistics Industry

Real-time optimization of pricing strategies based on consolidated market data and observed costs. The algorithm was embedded into a dashboard served on the cloud and presented features for simulating key business metrics within different scenarios to enable stakeholders' decisions.

Pipeline for Data Ingestion, Processing, and Consolidation

A Python-based pipeline communicating with cloud services (Amazon S3, Amazon Athena, and AWS Glue) to consolidate incoming external data into the company's data lake. The pipeline could be triggered by identifying new data available and then run all processes autonomously. For each step performed, relevant logs would also be stored for tracking and debugging.

Package for Data Science Applications in Healthcare

A Python package with refactored code for data science in healthcare. It provided modules to facilitate the processing, visualization, and modeling of data structures commonly used within the healthcare industry. It enabled a team of several data scientists to quickly and reliably run exploratory data analysis, model training, validation, and visualization within hours instead of weeks.

Cloud Platform for Modeling Projects with Big Data

A Kubernetes-based platform implemented on AWS with a common endpoint for the data science team. Users could allocate the necessary resources on a base instance and instantiate on-demand clusters for parallel computing and GPU computing. The instances were created upon containers of a common Docker image to guarantee reproducibility of the environment by team members.

Education

2010 - 2012

Master's Degree in Biomedical Engineering

Politecnico di Milano - Milan, Italy

2007 - 2012

Bachelor's Degree in Electrical Engineering

University of São Paulo - São Paulo, Brazil

Skills

Libraries/APIs

Scikit-learn, Keras, XGBoost, Pandas, Dask, TensorFlow

Tools

GitHub, MATLAB

Languages

Python, SQL

Frameworks

LightGBM, Streamlit

Platforms

Amazon Web Services (AWS), Visual Studio Code (VS Code), Docker, Kubernetes

Storage

Data Pipelines

Other

Machine Learning, Data Visualization, Data Science, Predictive Modeling, Data Analytics, Signal Processing, Data Mining, Software Development, Dashboards, Technical Documentation, Artificial Intelligence (AI), Experimental Design, Statistical Analysis, Brain-computer Interface, Data Analysis, Streaming Data, Customer Support, Workshops, Real-time Data, Data Processing, Linear Algebra, Calculus, Artificial Neural Networks (ANN), Optimization, Graphics Processing Unit (GPU)

Collaboration That Works

How to Work with Toptal

Toptal matches you directly with global industry experts from our network in hours—not weeks or months.

1

Share your needs

Discuss your requirements and refine your scope in a call with a Toptal domain expert.
2

Choose your talent

Get a short list of expertly matched talent within 24 hours to review, interview, and choose from.
3

Start your risk-free talent trial

Work with your chosen talent on a trial basis for up to two weeks. Pay only if you decide to hire them.

Top talent is in high demand.

Start hiring