Felipe Costa Farias
Verified Expert in Engineering
Machine Learning and Software Developer
Recife, Brazil
Toptal member since February 8, 2021
Felipe has a Ph.D. in machine learning and a professional software development background. He has worked with machine learning since 2011 and has the experience and skills necessary to fulfill AI/DS roles, from development to team management. Felipe has applied AI to several areas, such as computer vision, natural language processing, time series, bioinformatics, and pharmacy. He is a quick learner with excellent knowledge of Python and artificial intelligence.
Portfolio
Experience
Availability
Preferred Environment
Slack, Visual Studio Code (VS Code), Vim Text Editor, Linux, Docker, Terminal
The most amazing...
...thing that I've developed was a machine learning model to predict molecular properties to improve drug discovery.
Work Experience
Machine Learning Researcher
Entos
- Developed machine learning models to predict molecular properties.
- Performed machine learning distributed training in 288 GPUs.
- Implemented different graph neural network algorithms.
- Performed statistical analysis on different models.
- Developed a machine learning pipeline with Apache Airflow.
- Participated as the team leader of ML-related projects.
AI Consultant
Elife Brasil
- Created models in Python to perform sentiment analysis on social media texts about companies using scikit-learn and Spacy/Gensim.
- Used state-of-the-art AI models such as transformers and classical word-embedding to perform text understanding with Python and NLTK.
- Developed a model, which uses unsupervised and supervised learning, to automatically suggest responses to clients' emails based on previous human written responses.
- Created PyTorch deep learning models with convolutional neural networks to identify a person's age based on their photo.
- Created PyTorch deep learning models with convolutional neural networks to identify a person's gender based on their photo.
- Deployed models as APIs to a cloud computing platform (Hetzner).
Data Scientist
Confidential Name
- Developed different ML models for a battery lifecycle management startup to detect anomalies in batteries.
- Developed an API to serve different ML battery models.
- Performed exploratory data analysis and cleaned the data to create the ML models.
Researcher/Professor
IFPE Instituto Federal de Pernambuco
- Presented lectures on several topics regarding computer science field, specifically data science and artificial intelligence courses.
- Participated in industrial R&D to develop methods to solve complex problems using AI.
- Led a research group (BRAINS - Brazilian Research in Artificial Intelligence and Systems).
AI Consultant
Elife Brasil
- Developed a machine learning model with biomarkers, social data, and laboratory test results to predict sepsis six hours in advance.
- Preprocessed a very noisy dataset to allow the model to learn with this specific data.
- Benchmarked several machine learning algorithms such as gradient boosting, convolutional, and recurrent neural networks with several architectures.
AI Consultant
Elife Brasil
- Developed a deep learning model using long short-term memory (LSTM) convolutional neural networks (CNN) to classify electrocardiogram (ECG) signals of a patient as normal or a potential anomaly.
- Preprocessed the ECG data using linear and non-linear filters to ease the training phase of the deep learning models.
- Deployed the model as an API to a private cloud computing platform (Hetzner).
- Monitored the model accuracy and time performance.
Software Developer
Stefanini Group
- Maintained and optimized a custom matrix/linear algebra library written in C and C++ to support the development of neural networks.
- Developed a pipeline with document segmentation/categorization, handwritten recognition, and OCR to retrieve textual information from documents and handwritten forms to open banking accounts for one of Brazil's most important banks.
- Developed a distributed system of nodes with specific responsibilities using C# and web services endpoints.
Experience
BuzzMonitor
https://buzzmonitor.com.br/2. A system that performs sentiment analysis using state-of-the-art machine learning algorithms on several social media texts with SpaCy/Gensim. It currently handles thousands of requests per second using a lightweight Flask web service.
3. Module to identify a person (i) age and (ii) gender using social media profile photos applying deep convolutional neural networks with PyTorch.
4. Module for automatic Tagging of social media posts based on previously tagged ones written in Python with scikit-learn.
SafeCardio
NASA Robotic and Engineering Bootcamp
Electroencephalography Classification
Django System for Law Offices
We used Django and PostgreSQL to create both systems.
Molecular Property Prediction System
Education
Ph.D. in Computer Science
Federal University of Pernambuco - Recife, PE, Brazil
Master's Degree in Computer Engineering
University of Pernambuco - Recife, PE, Brazil
Bachelor's Degree in Computer Engineering
University of Pernambuco - Recife, PE, Brazil
Certifications
Microsoft Certified Professional
Microsoft
Certified Scrum Master
Scrum Alliance
Oracle Certified Professional
Oracle
Certified Associate in Project Management (CAPM)
Project Management Institute (PMI)
Microsoft Certified Professional
Microsoft
Skills
Libraries/APIs
PyTorch, NumPy, Pandas, SciPy, OpenCV, TensorFlow, Keras, Scikit-learn, Matplotlib, SpaCy, Natural Language Toolkit (NLTK), Django ORM, PyCaret, JAX
Tools
Git, IPython, Docker Compose, Logging, ChatGPT, Hidden Markov Model, Scikit-image, StatsModels, MATLAB, Apache Airflow, Slack, Vim Text Editor, Terminal, Gensim, Plotly
Languages
Python, C, Java, C#, SQL, C++, XML, C#.NET
Platforms
Docker, Linux, Jupyter Notebook, Azure, Visual Studio Code (VS Code), Amazon, Amazon Web Services (AWS), Kubernetes
Storage
Databases, MySQL, Data Pipelines, JSON, PostgreSQL, SQL Server 2012
Frameworks
Flask, Django
Paradigms
Scrum, REST, Wavelets, ETL
Industry Expertise
Project Management, Healthcare, Social Media
Other
Optimization, Signal Analysis, Artificial Intelligence (AI), Artificial Neural Networks (ANN), Neural Networks, Data Mining, Computer Vision, Pattern Recognition, Deep Learning, Machine Learning, Data Preprocessing, Data Science, Predictive Analytics, Scientific Computing, Document Processing, Text Analytics, Text Processing, Data Visualization, Data, Code Review, Source Code Review, Generative Pre-trained Transformers (GPT), Large Language Models (LLMs), Applied Research, Statistics, Natural Language Processing (NLP), OCR, Containers, Graph Neural Networks, DGL, Scripting, Generative Pre-trained Transformer 3 (GPT-3), OpenAI GPT-3 API, OpenAI, OpenAI GPT-4 API, Fine-tuning, Training, Chatbots, Leadership, Technical Leadership, Probability Theory, Fuzzy Logic, Geometry, Calculus, Physics, Linear Algebra, Discrete Mathematics, Data Structures, Signals, Software Engineering, Circuit Board Design, Electronics, IP Networks, Digital Communication, Genetic Algorithms, Decision Trees, Decision Support Systems, Random Forests, Random Forest Regression, Naive Bayes, Bayesian Inference & Modeling, Evolutionary Algorithms, Convolutional Neural Networks (CNN), Recurrent Neural Networks (RNNs), Deep Neural Networks, Certified ScrumMaster (CSM), Gradient Boosting, Regression Modeling, Clustering, Classification, Text Classification, K-means Clustering, K-nearest Neighbors (KNN), Ensemble Methods, Support Vector Machines (SVM), Support Vector Regression, Normalization, Standardization, Gradient Boosted Trees, Tree Structures, Binary Search Trees, Graphs, Processing & Threading, AdaBoost, Feature Analysis, Metaheuristics, Principal Component Analysis (PCA), Concurrent Computing, Concurrency, Synchronization, Resource Allocation, Information Systems, Image Processing, Information Theory, Microcontrollers, Computer Vision Algorithms, Stochastic Modeling, Markov Model, Hypothesis Testing, Exploratory Data Analysis, Experimental Research, Web Services, Cloud Computing, Long Short-term Memory (LSTM), Sentiment Analysis, Text Mining, Information Retrieval, Search Algorithm Design, Scientific Data Analysis, Computer Science, APIs, Digital Signal Processing, Web Development, EEG, CSV, PMI, Supervised Learning, Unsupervised Learning, Matrix Algebra, Statistical Learning, Signal Processing, Time Series, Simultaneous Localization & Mapping (SLAM), Robot Operating System (ROS), Analytical Geometry, Robotics, DC Motor Drive, Gated Recurrent Unit (GRU), Distributed Systems, Pharmaceuticals, Drug Development, Big Data, Algorithms, Interviewing, Technical Hiring, Team Management
How to Work with Toptal
Toptal matches you directly with global industry experts from our network in hours—not weeks or months.
Share your needs
Choose your talent
Start your risk-free talent trial
Top talent is in high demand.
Start hiring