
Felipe Batista
Verified Expert in Engineering
Machine Learning Engineer and Full-stack Developer
Belo Horizonte - State of Minas Gerais, Brazil
Toptal member since October 17, 2018
Felipe has over 10 years of experience in machine learning and full-stack software development. He is currently focused on leveraging cutting-edge generative AI to build incredible full-stack AI products, ranging from state-of-the-art retrieval-augmented generation (RAG) systems to complex image generation pipelines based on Stable Diffusion. Felipe is proficient with a variety of relevant libraries and technologies, including LlamaIndex, LangChain, PyTorch, Diffusers, and DeepSeek.
Portfolio
Experience
- Machine Learning - 10 years
- Data Science - 10 years
- Artificial Intelligence (AI) - 10 years
- Generative Artificial Intelligence (GenAI) - 5 years
- DeepSeek - 1 year
- Retrieval-augmented Generation (RAG) - 1 year
- ControlNet - 1 year
- ChatGPT - 1 year
Availability
Preferred Environment
GitHub, PyCharm, Linux, MacOS
The most amazing...
...project I've implemented was a GenAI SaaS app that was able to serve 1+ million users, leveraging complex user behavior analytics to improve the core offering.
Work Experience
Founder and Software Engineer
Totem AI
- Developed and scaled an image generation service to 1+ million users and built a sophisticated image evaluation pipeline to improve renders, reduce generation time, and provide high-quality control.
- Built generative AI models for text and image generation, utilizing technologies like Stable Diffusion, GPT (and similar models), retrieval augmented generation (RAG), and diffusion models.
- Gained proficiency in analyzing customer behavior data within the SaaS domain to identify key churn drivers, utilizing advanced analytics and user segmentation techniques. Developed machine learning models to predict churn and identify risky behavior.
- Developed comprehensive business intelligence solutions, focusing on data-driven dashboards and financial modeling for a mid-size agriculture conglomerate with over 60 million in revenue.
- Developed image classification pipelines using convolutional neural networks (CNNs), data augmentation, and transfer learning for real-time object detection, face classification/recognition, and semantic segmentation.
- Architected digital signal processing pipelines for healthcare (freezing of gait detection for Parkinson's disease patients using sensor data).
- Developed real-time machine learning models for fantasy football, including analysis of optimal lineup selections.
- Implemented scientific papers containing state-of-the-art research related to computer vision, digital signal processing, time-series modeling for financial markets, and NLP for data enrichment.
Founder and CTO
AI Interior Design App
- Developed and scaled an image generation service to 1+ million users and built a sophisticated image evaluation pipeline to improve renders, reduce generation time, and provide high-quality control.
- Gained proficiency in analyzing customer behavior data within the SaaS domain to identify key churn drivers, utilizing advanced analytics and user segmentation techniques. Developed machine learning models to predict churn and identify risky behavior.
- Implemented a full-stack application to serve our state-of-the-art fine-tuned diffusion model, which resulted in the creation of 2+ million images.
Machine Learning Engineer
United Nations (via Toptal & Benetech)
- Architected and implemented a video processing pipeline to deduplicate sensitive videos at scale using ad-hoc deep learning models to generate video signatures using TensorFlow and PyTorch pipelines.
- Built a video augmentation pipeline to validate and test deduplication models using OpenCV, MoviePy, and FFmpeg. I implemented several evaluation routines—both visual and quantitative—to evaluate model results.
- Developed multimodal search capabilities that allowed stakeholders to visualize and explore large video datasets in seconds.
- Architected a local-first content management system that made use of fine-tuned video embeddings for analytics, private collaboration, and exploration.
Data Scientist and Full-stack Software Engineer
ShopYak
- Developed a neural network and Q-learning (reinforcement learning) to perform automated A/B testing on different website layouts for eCommerce stores. The objective was to optimize conversions by adjusting the layout, fonts, and colors for each store.
- Designed and developed portions of the front end using AngularJS and SCSS. Set up the build process using Grunt.
- Implemented several PowerBI dashboards to keep track of KPIs and general user behavior (using both DB and Google Analytics integrations).
- Developed a significant portion of the back end using Node.js, Express, and PostgreSQL.
- Integrated Stripe (regular and connect) both in the front and back end.
- Deployed on AWS with HTTPS, Amazon CloudFront, Amazon S3, and Elastic Load Balancing (ELB).
Experience
Custom ChatGPT Application for a Large Construction Company
WORK DONE
• Enabled advanced features such as message streaming, file attachments, and code interpreter usage.
• Architected dynamic vector store selection and implemented an optimized server wrapper using FastAPI and Uvicorn to handle requests at scale.
• Built robust error-handling mechanisms and created comprehensive technical roadmaps for future developments, including dynamic chart plotting, advanced spreadsheet support, and innovative strategies to overcome OpenAI's platform limitations.
• Assisted the development team with several improvements related to the front end, back end, and DevOps.
Full-stack Compliance Application Powered by an LLM Information Extraction Pipeline
A large-scale evaluation pipeline was implemented, which included acquiring data from multiple sources, tracking experiments, and rapidly benchmarking information extraction using LangChain and Instructor.
I developed cost-effective solutions that employed reasoning and function calling to utilize more affordable models for simpler documents while reserving more expensive models for longer and more complex documents.
A full-stack application was developed using Next.js and PostgreSQL and designs were provided by the client.
State of the Art Chat with Documents Application
Large-scale Deep Learning-based Video Analytics Pipeline for a United Nations Project
DSP Using Multiple Deep Learning Architectures (CNNs, LSTM, GRU)
Over the course of the project, I:
1. Reviewed several papers containing the state-of-the-art methods for NILM.
2. Optimized available models to perform an initial POC.
3. Explored different model architectures on a specific setting defined by the client, including:
a. Parallel CNNs
b. Parallel CNNs with LSTMs
c. CNNs with LSTM (bidirectional)
d. CNN with GRU
Models were optimized, and ultimately, the best model was chosen based on the results of a cross-validation routine.
Deliverables were both Jupyter Notebooks and Python Scripts.
Image Classification Pipeline with TensorFlow/Keras
Tested several different model architectures (including multi-input models with both images and bounding boxes).
Settled on a fine-tuned VGG16 network.
The pipeline included data augmentation, cross-validation, visualization of accuracy and loss across epochs (and sample results).
DSP/DL for Freezing of Gait Detection (with POC Mobile App)
Replicated state-of-the-art medical, scientific papers using Python, scikit-learn, TensorFlow, Keras, and Jupyter to obtain well-defined baselines.
After in-depth research of DSP techniques for anomaly detection, I implemented a few deep learning model architectures that had never before been applied to this specific domain.
Performed cross-validation to assess model performance, focusing on the model's generalization potential. The model was trained using data from 80% of the patients and tested on the data from the remaining patients.
I used several models in this project, including CNNs, deep ConvLSTM, and parallel CNNs with LSTM.
In order to test the model, I implemented a simple Android application that used the trained model to make a live Freezing of Gait inference based on the phone's accelerometer data. The model was converted to CoreML for future iOS app development.
Other relevant work includes handling class imbalance by adjusting the loss function of the deep learning models, hyperparameter optimization using GridSearch, and simulation of the effect of different window sizes on model performance.
Education
Bachelor of Science Degree in Economics with a Focus on Econometrics and Computational Methods
UFMG - Belo Horizonte, Brazil
Certifications
AWS Certified Solutions Architect Associate
AWS
Deep Learning Specialization
Coursera
Image and Video Processing
Coursera
Skills
Libraries/APIs
NumPy, Pandas, Keras, TensorFlow, Scikit-learn, OpenCV, PyTorch, Node.js, Amazon Rekognition, SpaCy, OpenAI Assistants API
Tools
ChatGPT, Microsoft Excel, DeepSeek, Amazon SageMaker, Git, Microsoft Power BI, PyCharm, GitHub, Grunt, Azure ML Studio, Azure OpenAI Service, AI Prompts
Languages
Python, JavaScript, SCSS, R, C++
Frameworks
LlamaIndex, Express.js, Flask, Angular, Tailwind CSS, Next.js
Platforms
Jupyter Notebook, Amazon Web Services (AWS), CrewAI, AWS Lambda, Docker, Google Cloud Platform (GCP), MacOS, Linux, PostHog, Azure
Paradigms
Agile Software Development, REST
Storage
Google Cloud, MongoDB, PostgreSQL, Redis
Other
Machine Learning, Computer Vision, Deep Learning, Artificial Intelligence (AI), Data Science, Stable Diffusion, ControlNet, Retrieval-augmented Generation (RAG), Image Generation, Replicate, Team Leadership, Generative Artificial Intelligence (GenAI), Chatbots, OpenAI GPT-4 API, OpenAI GPT-3 API, LangChain, Large Language Models (LLMs), Full-stack Development, Machine Learning Operations (MLOps), Data Analytics, Prompt Engineering, Multi-agent Systems, Analytics, Natural Language Processing (NLP), Econometrics, Computational Finance, Statistics, Generative Pre-trained Transformers (GPT), OAuth, OpenAI, Pinecone, Reinforcement Learning from Human Feedback (RLHF), Deep Neural Networks (DNNs), Supabase, LLaVA, ChatGPT Prompts, CTO, Unstructured Data Analysis
How to Work with Toptal
Toptal matches you directly with global industry experts from our network in hours—not weeks or months.
Share your needs
Choose your talent
Start your risk-free talent trial
Top talent is in high demand.
Start hiring