Jose Luis Moreira Arruda
Verified Expert in Engineering
Data Scientist and Python Developer
São José dos Campos - State of São Paulo, Brazil
Toptal member since April 19, 2023
Jose is a data scientist/ML engineer who's experienced across multiple sectors, including eCommerce, healthcare, and fintech. He is an expert in developing strategic projects and building AI/data products. He easily translates pain points and business goals into tailored AI products and designs. He deploys machine learning and deep learning models for time series, CV, and NLP and integrates them into company systems on the cloud. Jose has led data science projects while mentoring colleagues.
Portfolio
Experience
- Scikit-learn - 7 years
- Python - 7 years
- Statistics - 7 years
- Machine Learning - 7 years
- PyTorch - 5 years
- Pandas - 5 years
- Deep Learning - 5 years
- Recommendation Systems - 3 years
Availability
Preferred Environment
Python, Pandas, Linux, PyTorch, Scikit-learn, NumPy, SQL, Artificial Intelligence (AI), Google Cloud Platform (GCP), Azure
The most amazing...
...algorithm I've worked on is a near real-time ML recommendation system to rank fashion eCommerce products.
Work Experience
Senior Data Scientist and ML Engineer
Flinks
- Designed and developed new data products based on stakeholders' requirements and strategic business goals in the payments and open banking field. Designed and built ML systems to mitigate fraud risks in the payment sector.
- Developed systems using OpenAI LLM capabilities to improve transactional data enrichment, ensuring robustness. Designed and deployed scalable ML systems using cloud resources on GCP and AWS.
- Architected a system to improve web scrapping automation using ML and computer vision models.
Senior Data Scientist
Farfetch
- Led the data science development of data-driven initiatives toward company strategic goals, such as increasing profitability and user engagement, while collaborating closely with product and engineering teams.
- Delivered multiple data analyses to better understand customers, products, and their relations. Supplied multiple POCs to provide better clarity on commercial and fashion requirements.
- Implemented deep learning models for information retrieval by extracting good representations from product images, product descriptions, user interactions, and other parameters.
- Monitored live systems, fixed bugs, and implemented production-level features to current systems tested and released on production.
Head of AI | Senior Data Scientist
J!Quant
- Delivered more than five strategic data science projects as a principal contributor, from conception to all phases of development and delivery.
- Drove research roadmaps to deliver state-of-the-art (SOTA) solutions and build AI products. All AI products that I created are based on computer vision, NLP, time-series forecasting, and multi-modal representation learning.
- Built a data science team, which included recruiting, teaching, and mentoring new data scientists. Taught data science to teams in different companies and individual professionals.
- Assessed different companies' data-driven opportunities, making commercial proposals and selling data science projects to big companies.
Research Intern
Werkzeugmaschinenlabor WZL der RWTH Aachen
- Created programming solutions to analyze manufacturing data, signal processing, drive insights, and develop systems to improve production quality.
- Developed two industrial research consulting projects to improve the quality and efficiency of hobbing and milling processes.
- Built analytical, geometric, and statistical models to simulate industrial processes.
Experience
SenseAI | Predicting and Monitoring Beer Quality for the World's Largest Brewery
Our system also performs simulations, giving real-time visibility to the brewers and a way to act in real time during the process and improve product quality. It was deployed in the Azure/Databricks environment. The project was deployed in Brazilian breweries and mainly impacted the improvement of beer quality and waste reduction, saving over $1 million per year.
Self-supervised Computer Vision Model for Fashion
Seventh Place in the International Forecast Competition
https://www.kaggle.com/c/m5-forecasting-uncertaintyThat year, the competition task was to forecasts a daily demand distribution in quantiles for each Walmart product and three Walmart stores. This was a combination of over 40,000 time series.
Since we wanted to make our solution more general and practical and turn it into a product, we developed a unique end-to-end deep learning model based on recent advances of transformers for time-series forecasting instead of multiple ensembles or other nonsuitable models for production approaches.
Personalized Ranking System for Two-sided Marketplace
AI-powered Gift Recommendations
Education
Bachelor's Degree in Mechanical Engineering
Aeronautics Institute of Technology - São José dos Campos, São Paulo, Brazil
Progress Toward Master's Degree in Deep Learning and Machine Learning
RWTH Aachen University - Aachen, Germany
Certifications
Deep Learning Specialization
Deep Learning.AI | via Coursera
Skills
Libraries/APIs
Pandas, PyTorch, Scikit-learn, Keras, OpenAI API, NumPy, TensorFlow, PySpark
Tools
Jupyter, GitHub, Microsoft Power BI, Azure OpenAI Service, MATLAB
Languages
Python, SQL, C
Paradigms
Rapid Prototyping
Platforms
Azure, Amazon Web Services (AWS), Vertex AI, Docker, Linux, Google Cloud Platform (GCP), Ollama, Kubernetes, Databricks
Storage
PostgreSQL, Google Cloud, Graph Databases, Document Databases, NoSQL, Neo4j
Frameworks
Spark, Django
Other
Deep Learning, Computer Vision, Natural Language Processing (NLP), Machine Learning, Recommendation Systems, Convolutional Neural Networks (CNNs), Explainable Artificial Intelligence (XAI), Artificial Intelligence (AI), Data Scientist, Recurrent Neural Networks (RNNs), eCommerce, Time Series, Data Science, Transformer Models, Time Series Analysis, Data Visualization, Data Analytics, Data Interpretation, Data Analysis, Forecasting, Large Language Models (LLMs), Supervised Learning, Generative Artificial Intelligence (GenAI), Statistical Analysis, Regression Modeling, Data Modeling, Algorithms, DBSCAN, Hierarchical Clustering, Clustering Algorithms, Clustering, K-means Clustering, Unstructured Data Analysis, OpenAI GPT-4 API, A/B Testing, Architecture, Demand Planning, AI Consulting, Machine Learning Operations (MLOps), Optimization Algorithms, AI-enabled Search, Leadership, Cloud, Vector Databases, OpenAI, icr, APIs, Statistics, Sequence Models, Neural Networks, Self-supervised Learning, Generative Pre-trained Transformers (GPT), Optical Character Recognition (OCR), Data Engineering, Supply Chain Management (SCM), Inventory Management, Data Build Tool (dbt), Reinforcement Learning, Open-source LLMs, LangChain, Gemini, Prompt Engineering, Retrieval-augmented Generation (RAG), Llama, Large Language Model Operations (LLMOps), Logistics, Anthropic, Web Scraping, Geospatial Data, GitOps, Big Data, Supply Chain, Programming, Simulations, QA Testing, Fourier Analysis, path optimization
How to Work with Toptal
Toptal matches you directly with global industry experts from our network in hours—not weeks or months.
Share your needs
Choose your talent
Start your risk-free talent trial
Top talent is in high demand.
Start hiring