Alexandre Ray da Silva, Developer in São Paulo - State of São Paulo, Brazil
Alexandre is available for hire
Hire Alexandre

Alexandre Ray da Silva

Artificial Intelligence Engineer and Developer

São Paulo - State of São Paulo, Brazil

Toptal member since June 1, 2026

Bio

Alexandre is a hands-on staff AI engineer with 11+ years of experience building production-grade AI and GenAI systems. He specializes in end-to-end AI development, including LLM applications (RAG, agents), ML pipelines, recommendation systems, and MLOps. Alexandre owns the full lifecycle from architecture to production, delivering scalable systems using Python, PyTorch, FastAPI, and cloud platforms. He focuses on solving complex, high-impact real-world problems.

Portfolio

OncoAI
Angular, Django, Python, Artificial Intelligence (AI), Machine Learning...
IpsilonAI
Apache Airflow, Artificial Intelligence (AI), Machine Learning...
Creditas
Ruby, Python, Cloud, PostgreSQL, Machine Learning...

Experience

  • Python - 10 years
  • Artificial Intelligence (AI) - 8 years
  • Scikit-learn - 8 years
  • Machine Learning Operations (MLOps) - 8 years
  • SQL - 8 years
  • Machine Learning - 8 years
  • AI Agents - 3 years
  • RAG Systems - 3 years

Preferred Environment

MacBook, Slack, Google Meet, React

The most amazing...

...thing i've built is OncoAI, a platform designed to help oncologists improve patient outcomes through AI-driven insights.

Work Experience

Co-founder & CTO

2023 - PRESENT
OncoAI
  • Designed recurrence prediction models using structured clinical datasets for patient outcome forecasting.
  • Developed AI-assisted treatment recommendation systems integrating retrieval pipelines and LLM-based reasoning.
  • Architected back-end APIs and data services supporting clinical intelligence workflows.
  • Built cloud-native deployment infrastructure for scalable healthcare AI applications.
  • Led full-stack product development from system architecture through implementation and deployment.
  • Collaborated with healthcare stakeholders to translate clinical requirements into production-ready technical solutions.
Technologies: Angular, Django, Python, Artificial Intelligence (AI), Machine Learning, Machine Learning Operations (MLOps), Healthtech, Flask, Agentic AI, Large Language Models (LLMs), React, Neo4j, AI Architecture, Solution Architecture, Product Development, Amazon Web Services (AWS), Prompt Engineering, ChatGPT, Claude Code, Replit, Agentic Frameworks, Architecture, LangSmith, Monitoring, Telemetry, Industrials, Cursor AI, Pricing, OpenCV, Pricing Models, Natural Language Processing (NLP), eCommerce, Deep Learning, Amazon Bedrock, AI Programming, OWL, Ontologies, Knowledge Graphs, LLM Reasoning, Large Language Model Operations (LLMOps), Machine Learning (ML) APIs, AI Agent Orchestration, Semantic Code, Semantic Search, Google Cloud Platform (GCP), Databricks, PySpark, Time Series Forecasting, life-sciences, Pandas, Bayesian Statistics, NumPy, Agentic RAG Systems, Claude, Anthropic, Convolutional Neural Networks (CNNs), Graph Neural Networks (GNNs), Pattern Recognition, Model Evaluation, Classification, Codex, Fine-tuning, Meta Llama, Observability, Quantization, vLLM, Trading

Staff AI/ML Engineer

2022 - PRESENT
IpsilonAI
  • Designed and implemented production-grade retrieval-augmented generation (RAG) systems for enterprise log intelligence, reducing incident investigation time through automated contextual analysis.
  • Built recommendation engines for store-level product optimization using collaborative filtering and ranking models, improving inventory decision support.
  • Developed speech intelligence pipelines for automated English proficiency scoring using audio embeddings and deep learning inference systems.
  • Architected distributed ML workflows using Apache Airflow and Azure Databricks for scalable model training and deployment.
  • Optimized enterprise search infrastructure using vector retrieval systems and semantic indexing with OpenSearch.
  • Delivered full production lifecycle ownership, including architecture, implementation, deployment, observability, and optimization.
Technologies: Apache Airflow, Artificial Intelligence (AI), Machine Learning, Machine Learning Operations (MLOps), MLflow, Databricks, Azure, Data Architecture, RAG Systems, AI Agents, OpenAI, Retrieval-augmented Generation (RAG), RAG Architecture, Agentic AI, LangChain, RAG Pipelines, Generative Artificial Intelligence (GenAI), Large Language Models (LLMs), GraphRAG, AI Architecture, Data Pipelines, Solution Architecture, Product Development, Amazon Web Services (AWS), API Integration, Prompt Engineering, Vercel, ChatGPT, Claude Code, Microsoft Copilot, Microsoft Copilot Studio, Replit, Agentic Frameworks, Architecture, LangSmith, Agentic Coding, Monitoring, Industrials, Cursor AI, Data Labeling, Pricing, OpenCV, Pricing Models, Natural Language Processing (NLP), eCommerce, Photoroom, Deep Learning, AI Design, Data Science, AI Programming, Azure AI Studio, OWL, Ontologies, Knowledge Graphs, LLM Reasoning, Graph Databases, Large Language Model Operations (LLMOps), Machine Learning (ML) APIs, Energy, Energy Efficiency, Energy Management, Energy Modeling, AI Agent Orchestration, Semantic Code, Semantic Search, Google Cloud Platform (GCP), PySpark, Time Series Forecasting, 3D Pose Estimation, Motion Capture, Pose Estimation, Eye Tracking, HIPAA, Health, Healthcare, eye-tracking, Biomechanics, life-sciences, Optical Character Recognition (OCR), Linear Regression, Pandas, Bayesian Statistics, Statistics, Streamlit, Datadog, NumPy, Vector Databases, Agentic RAG Systems, Claude, Anthropic, Convolutional Neural Networks (CNNs), Graph Neural Networks (GNNs), Pattern Recognition, Vehicle Tracking Systems, Vehicle Routing, Marketing, Classification, GIS, GeoPandas, Codex, Data Security, Fine-tuning, Meta Llama, Vector Search, vLLM, Trading, Kubeflow, Snowflake, AWS Bedrock AgentCore

Senior ML Engineer

2017 - 2021
Creditas
  • Built credit risk scoring models that automated approximately 90% of operational credit desk workflows.
  • Developed predictive systems for lead scoring, customer churn forecasting, and pricing optimization.
  • Designed feature engineering pipelines for large-scale financial datasets.
  • Implemented model monitoring and validation workflows to ensure production reliability.
  • Collaborated with cross-functional business teams to operationalize predictive decision systems.
  • Improved portfolio quality through statistically robust risk modeling.
Technologies: Ruby, Python, Cloud, PostgreSQL, Machine Learning, Machine Learning Operations (MLOps), MLflow, Scikit-learn, Docker, FastAPI, Retrieval-augmented Generation (RAG), RAG Architecture, Agentic AI, LangChain, RAG Pipelines, Azure Kubernetes Service (AKS), Generative Artificial Intelligence (GenAI), Large Language Models (LLMs), GraphRAG, Data Pipelines, Product Development, Amazon Web Services (AWS), API Integration, Prompt Engineering, Vercel, ChatGPT, Replit, Agentic Frameworks, Architecture, LangSmith, Agentic Coding, SDKs, Data Labeling, Pricing, OpenCV, Pricing Models, Natural Language Processing (NLP), eCommerce, Deep Learning, Azure, AI Design, Data Science, AI Programming, Azure AI Studio, OWL, Ontologies, Knowledge Graphs, LLM Reasoning, Graph Databases, Large Language Model Operations (LLMOps), Machine Learning (ML) APIs, AI Agent Orchestration, Semantic Code, Semantic Search, Google Cloud Platform (GCP), PySpark, Time Series Forecasting, Motion Capture, You Only Look Once (YOLO), Health, Healthcare, Optical Character Recognition (OCR), Linear Regression, Pandas, Bayesian Statistics, Pytest, Statistics, Streamlit, Datadog, NumPy, Agentic RAG Systems, Claude, Anthropic, Convolutional Neural Networks (CNNs), Graph Neural Networks (GNNs), Pattern Recognition, Classification, Codex, Fine-tuning, Meta Llama, Observability, Algorithms, Trading, Kubeflow

Software Engineer

2016 - 2016
Service IT
  • Engineered back-end services supporting large-scale digital media and enterprise web platforms.
  • Led migration of high-traffic legacy websites to modern scalable architectures, including WordPress VIP environments.
  • Developed RESTful APIs and database integrations to support content management workflows.
  • Improved platform scalability and reliability through back-end performance optimization and infrastructure refactoring.
  • Collaborated with front-end teams to integrate back-end services into responsive client-facing applications.
  • Implemented database query optimizations that improved response time and system efficiency.
Technologies: MySQL, Git, PHP, JavaScript, OOP Designs, Ruby, SQL, LangChain, Pydantic, Asyncio, Azure Kubernetes Service (AKS), Vector Databases, React, Data Pipelines, Product Development, Amazon Web Services (AWS), API Integration, Vercel, SDKs, Kubernetes, MongoDB, eCommerce, Data Science, Java, RDF, Graph Databases, Redshift, Google Cloud Platform (GCP), Health, Healthcare, Linear Regression, Pandas, Pytest, Statistics, Streamlit, Datadog, Marketing, Data Security, Vector Search, Algorithms, Rust, AWS Bedrock AgentCore

Software Engineer

2015 - 2015
BTG Pactual
  • Developed internal back-end modules supporting fixed-income front-office and back-office operational workflows.
  • Built SQL-based automation scripts for financial data processing and reconciliation tasks.
  • Assisted in developing internal tools that improved operational efficiency for financial reporting pipelines.
  • Implemented database procedures and back-end logic for transactional processing systems.
  • Supported performance tuning of Microsoft SQL Server queries and data workflows.
  • Collaborated with senior engineers to maintain the reliability and accuracy of financial operations infrastructure.
Technologies: C#, SQL Server, SQL, Oracle, Pydantic, Asyncio, React, Product Development, API Integration, SDKs, Kubernetes, MongoDB, eCommerce, RDF, Pytest, Marketing, Observability, Rust

Experience

Legal Engine

https://github.com/alexandrerays/legal-engine
A RAG-based system to answer questions over U.S. Supreme Court opinions with cited sources. The tech stack was Python, LlamaIndex, Claude Code, FAISS, OpenAI GPT-4o-mini, chunking, embeddings, retrieval, and generation.

Medical RAG

https://github.com/alexandrerays/medical-rag
Focused on medical knowledge retrieval and grounded clinical-style responses. The tech stack was Python, FastAPI, MCP Server, Gradio, Supabase, pgvector, Claude Opus 4.6 via AWS Bedrock, embeddings, vector search, and LLM-based generation.

Stock Agent

https://github.com/alexandrerays/stock-agent
An AI agent designed to analyze stock market data, reason over financial information, and support investment research workflows. Tech stack was: Python, AWS Cognito, FastAPI, Terraform, AWS AgentCore, LangGraph, LLMs, APIs, and financial data tools

Speech Flow AI

https://github.com/alexandrerays/speech-flow-ai
A web app for audio processing and speech-to-text transcription, demonstrating applied AI for voice and language workflows. Tech Stack was: TypeScript, Next.js, FastAPI, Whisper AI, Node.js, OpenAI, speech-to-text, and audio processing.

Education

2021 - 2026

Master's Degree in Computer Engineering

University of São Paulo - São Paulo, Brazil

2010 - 2015

Bachelor's Degree in Computational Physics

University of São Paulo - São Carlos, Brazil

Skills

Libraries/APIs

Scikit-learn, Pydantic, Asyncio, React, OpenCV, PySpark, Pandas, NumPy, vLLM, Node.js

Tools

Git, ChatGPT, You Only Look Once (YOLO), Pytest, Claude, Slack, Google Meet, Azure Kubernetes Service (AKS), GraphRAG, Claude Code, Codex, Apache Airflow, Terraform, Whisper, Microsoft Copilot, GIS

Languages

SQL, Python, OWL, JavaScript, Ruby, TypeScript, RDF, C#, PHP, Java, Rust, Snowflake

Frameworks

Agentic Frameworks, Angular, Django, Flask, LangGraph, Streamlit, LlamaIndex, Next.js

Platforms

Docker, Databricks, Amazon Web Services (AWS), LangSmith, Google Cloud Platform (GCP), Azure, Vercel, Replit, Kubernetes, Oracle, Microsoft Copilot Studio, Azure AI Studio, Kubeflow

Storage

PostgreSQL, Data Pipelines, MongoDB, Graph Databases, Neo4j, Datadog, MySQL, Redshift

Industry Expertise

Healthcare, Marketing

Paradigms

Model Context Protocol (MCP)

Other

Physics, Computer Science, Computer Vision, Object Detection, OOP Designs, Machine Learning, Machine Learning Operations (MLOps), MLflow, FastAPI, Artificial Intelligence (AI), LLM Integration, MacBook, Retrieval-augmented Generation (RAG), Agentic AI, LangChain, RAG Pipelines, Large Language Models (LLMs), AI Architecture, Product Development, API Integration, Prompt Engineering, Architecture, SDKs, Cursor AI, Pricing, Pricing Models, Natural Language Processing (NLP), eCommerce, Deep Learning, AI Design, Data Science, AI Programming, Ontologies, Knowledge Graphs, LLM Reasoning, Large Language Model Operations (LLMOps), Machine Learning (ML) APIs, AI Agent Orchestration, Semantic Code, Time Series Forecasting, 3D Pose Estimation, Motion Capture, Pose Estimation, Health, eye-tracking, Optical Character Recognition (OCR), Linear Regression, Bayesian Statistics, Statistics, Agentic RAG Systems, Anthropic, Convolutional Neural Networks (CNNs), Graph Neural Networks (GNNs), Pattern Recognition, Classification, Fine-tuning, Meta Llama, Observability, Algorithms, Trading, Cloud, RAG Systems, AI Agents, OpenAI, Healthtech, FAISS, Supabase, AWS Bedrock AgentCore, RAG Architecture, Vector Databases, Generative Artificial Intelligence (GenAI), Solution Architecture, Agentic Coding, Industrials, Data Labeling, Semantic Search, Eye Tracking, HIPAA, Model Evaluation, Vector Search, SQL Server, Data Architecture, Monitoring, Telemetry, Photoroom, Amazon Bedrock, Energy, Energy Efficiency, Energy Management, Energy Modeling, Biomechanics, life-sciences, Vehicle Tracking Systems, Vehicle Routing, GeoPandas, Data Security, Quantization

Collaboration That Works

How to Work with Toptal

Toptal matches you directly with global industry experts from our network in hours—not weeks or months.

1

Share your needs

Discuss your requirements and refine your scope in a call with a Toptal domain expert.
2

Choose your talent

Get a short list of expertly matched talent within 24 hours to review, interview, and choose from.
3

Start your risk-free talent trial

Work with your chosen talent on a trial basis for up to two weeks. Pay only if you decide to hire them.

Top talent is in high demand.

Start hiring