
Bruno Barbosa Miranda
Verified Expert in Engineering
Data Scientist and Python Developer
Belo Horizonte - State of Minas Gerais, Brazil
Toptal member since September 30, 2021
Bruno was originally a medical doctor but was the first to get a master's degree in computer science (CS) at one of the top universities in his country, without prior CS formal education. He is currently pursuing a Ph.D. in CS at the same university (UFMG). Bruno considers himself a highly motivated and passionate professional who loves his work and constantly learns new things.
Portfolio
Experience
- Machine Learning - 10 years
- Python 3 - 8 years
- Scikit-learn - 6 years
- Deep Reinforcement Learning - 5 years
- Neural Networks - 5 years
- TensorFlow - 5 years
- Amazon Web Services (AWS) - 3 years
- LightGBM - 2 years
Availability
Preferred Environment
Spyder, Windows 10, MacOS, Anaconda, TensorFlow, Scikit-learn, NumPy, Python 3, Amazon Web Services (AWS), LightGBM
The most amazing...
...result for me was when I implemented my own custom deep reinforcement learning algorithm, which learned to play video games independently.
Work Experience
Senior ML Engineer
Shop For A Better World
- Successfully classified a database of over 180,000 businesses in multiple categories to generate metadata.
- Configured Azure Cognitive Search for the client's search engine.
- Created a web scraping system that automatically updates database records over time while deduplicating instances obtained from different sources and consolidating the results.
AI/ML Engineer
Pixelcut Inc.
- Deployed a computer vision neural network to estimate soft shadows based on cutout masks, with improvements on top of the reference paper.
- Implemented multi-GPU training with PyTorch-compiled code and multi-node disk access, training with over one billion synthetic image samples that I generated myself.
- Produced code for the deployment of the model into the client's app using FastAPI.
- Aided the development of a cutout masking model using synthetic data generation.
Data Science Specialist Consultant
Faculdade Unimed
- Reduced the computation time of a critical software component from three days to less than five minutes.
- Helped build the entire Snowflake architecture from the ground up to host a software service for hospital stay analysis, similar to HRG and DRG.
- Built core functionalities to process the entire hospital stay analysis algorithm using Python and Snowpark.
Machine Learning Developer
Arthur Haliski De Andrade
- Deployed a genetic algorithm based on genetic programming that learns to generate custom technical variables for trading. The whole algorithm runs on GPU using RAPIDS.
- Delivered a mixed reinforcement learning neural network algorithm that learns to trade financial markets using neural networks written in PyTorch.
- Managed a team of three programmers while producing complex trading algorithms myself.
Senior Data Scientist
Microsoft
- Developed an email signature expansion method based on optical character recognition (OCR).
- Worked on clustering and classifying malicious emails.
- Resolved many new bugs on Microsoft services, tracking them with data science and other analysis tools.
NLP Expert
Prepaire Labs Limited
- Developed a drug recommender system based on a knowledge graph of interactions between drugs, genes, proteins, and diseases.
- Built a drug embedding system that can cluster drug classes without labeled data.
- Built a large-scale interaction network to predict molecule interactions between different entities.
Data Analyst | Statistician
Product Tranquility LLC
- Helped the client clean inconsistent data points in their survey data.
- Combined machine learning and data science to generate survey insights.
- Reproduced classical survey analysis techniques to analyze pricing.
Data Scientist
Alaris Acquisitions, LLC
- Delivered an app to estimate the similarity between buying and selling institutions with a front-end presentation using Streamlit.
- Deployed online file synchronization for all remote users of our app using Amazon's AWS S3.
- Created a customizable interface to input and change system variables to make the app as flexible as possible while maintaining consistency and scalability.
Senior Data Scientist and ML Engineer
Toptal Client
- Developed and successfully deployed a custom recommender system for one of the clients' ventures.
- Worked actively on the automatic metadata generation from users' images with transfer learning.
- Developed a churn detection model integrated with customized emails for the CS team.
- Worked actively on a semantic search algorithm based on items and user embeddings.
- Developed PowerBI dashboards to show relevant data to clients.
- Integrated our solutions with existing architectures using MLflow and email APIs.
- Developed a lead scoring algorithm based on the positive-unlabeled problem framework.
Senior Data Scientist
Tecnium
- Developed the best unique product identification routine available at the time.
- Taught teammates about neural network approaches, such as autoencoders, RankNet, transformers, and embedding spaces.
- Created a product name embedding, using a transformer neural network, usable in multiple problems.
Senior Data Scientist
Unimed
- Developed data-oriented models using NLP, machine learning, and transformers.
- Helped transform my current sector into a data science-oriented enterprise.
- Built useful metrics and scoring systems using embeddings and neural networks.
- Created and deployed a web crawler to generate healthcare-related disease code datasets.
- Proposed, implemented, and deployed an algorithm for future hospitalization prediction and a semantic representation for patient disease data.
- Competed as one of three finalists in the company's innovation prize awards in all the years I participated, winning the 2021 prize.
- Used time-series algorithms to predict future healthcare requirements. I used multiple neural network architectures, such as variational autoencoders, residual neural networks, and vision transformers to obtain information from medical images.
Teacher
University of Medical Science
- Developed a digital teaching system, using online technologies.
- Taught students about medical research using machine learning and data science methods.
- Oversaw research projects and courses focused on recent machine learning and general technology trends.
Experience
Project Cindy
In this project, I've tested many different reinforcement learning technologies, including :
• Deep-Q learning agent (DQN)
• Asynchronous actor critic agent (A3C)
• Synchronous actor critic agent (A2C)
• Proximal policy optimization (PPO)
• Convolutional Neural Networks (CNN)
Instagram Bot
Brain CT Image Alteration Detection
Automatic Labeling with Reinforcement Learning
Electrocardiogram (ECG) Automatic Classification
https://www.kaggle.com/competitions/dcc-week-challenge-2023/overviewEducation
Ph.D. Degree in Computer Science
Federal University of Minas Gerais - Belo Horizonte
Master's Degree in Computer Science
Federal University of Minas Gerais - Belo Horizonte
Master of Business Administration in Business Administration
Dom Cabral Foundation - Belo Horizonte
Bachelor's Degree in Medicine
Federal University of Minas Gerais - Belo Horizonte
Certifications
AWS Academy Graduate - AWS Academy Cloud Foundations
Amazon Web Services Training and Certification
Cambridge Advanced English Certificate
Cambridge Assessment English
Skills
Libraries/APIs
TensorFlow, Scikit-learn, NumPy, Pandas, PyTorch, PySpark, LSTM, Natural Language Toolkit (NLTK), Spark ML, Amazon API
Tools
Spyder, Atlassian, Jira, Microsoft Excel, Amazon SageMaker, Named-entity Recognition (NER), Azure Machine Learning, ARIMA, Microsoft Power BI, AWS CLI, AWS IAM
Languages
Python 3, Python, SQL, C++, Bash, MQL5, C#, Snowflake, TypeScript
Paradigms
ETL, Business Intelligence (BI), Interoperability, Quantitative Research
Platforms
Anaconda, Jupyter Notebook, Amazon Web Services (AWS), Databricks, Azure, NVIDIA CUDA, MacOS, Docker, Amazon EC2, AWS Lambda
Frameworks
LightGBM, Selenium, Presto, Apache Spark, Streamlit, Spark
Storage
MySQL, MySQLi, JSON, NoSQL, Google Cloud, Google Cloud SQL, Azure Cloud Services, Database Management
Industry Expertise
Bioinformatics, Trading Systems
Other
Business Administration, Innovation, Machine Learning, Neural Networks, Deep Neural Networks (DNNs), Deep Reinforcement Learning, Transformers, Natural Language Processing (NLP), Reinforcement Learning, English, Data Science, Autoencoders, Medical Imaging, Algorithms, Artificial Intelligence (AI), Artificial Neural Networks (ANN), Data Analysis, Analytics, Database Analytics, Data Engineering, Predictive Modeling, Predictive Analytics, Data Mining, Data Modeling, Data Reporting, ETL Tools, Text Mining, Task Analysis, Statistical Modeling, Optical Character Recognition (OCR), Graphics Processing Unit (GPU), Oncology & Cancer Treatment, Generative Pre-trained Transformers (GPT), Sentiment Analysis, Financial Modeling, Regression, Models, Communication, Large Language Models (LLMs), Research, Microsoft Azure, Few-shot Learning, Web Crawlers, Deep Learning, Classification Algorithms, Software Engineering, Computer Vision, Vision Transformer (ViT), Recurrent Neural Networks (RNNs), Long Short-term Memory (LSTM), Web Scraping, Residual Neural Networks (ResNets), Machine Learning Operations (MLOps), Data Visualization, Statistical Data Analysis, Data Analytics, Excel 365, Image Processing, Cloud, Source Code Review, Code Review, Backtesting Trading Strategies, Artificial General Intelligence (AGI), Generative Adversarial Networks (GANs), Leadership, Retrieval-augmented Generation (RAG), Windows 10, BERT, Learning, Rankings, Dedupe.io, Convolutional Neural Networks (CNNs), Meta-learning, 3D Images, Images, Health, Medical Software, SVMs, Support Vector Machines (SVM), Variational Autoencoders, Data Transformation, LSTM Networks, Big Data, Recommendation Systems, B2C Marketing, Analysis, Statistics, MLflow, Finance, Technical Hiring, Interviewing, Linear Regression, Clustering, Surveys, Survey Development & Analysis, Biotechnology, Genetic Algorithms, Mathematics, Algorithmic Trading, GPU Computing, Google Cloud ML, FastAPI, Image Generation, Cognitive Search, Classification, VM, Deduplication, Molecular Docking
How to Work with Toptal
Toptal matches you directly with global industry experts from our network in hours—not weeks or months.
Share your needs
Choose your talent
Start your risk-free talent trial
Top talent is in high demand.
Start hiring