Antonio Artur de Holanda e Ayres de Moura, Developer in Fortaleza - State of Ceará, Brazil
Antonio is available for hire
Hire Antonio

Antonio Artur de Holanda e Ayres de Moura

Verified Expert  in Engineering

Data Scientist and AI Developer

Fortaleza - State of Ceará, Brazil

Toptal member since October 28, 2022

Bio

Antonio is a data science and AI developer specializing in natural language processing and speaker recognition. He has worked in the government and private sectors and is currently a data scientist at a cybersecurity company. Antonio has nano degrees in AWS ML, ML DevOps, and NLP, 10+ program certifications in data science and AI, and a master's degree from the University of Fortaleza.

Portfolio

Axur
Python 3, Amazon Web Services (AWS), Machine Learning...
University of Fortaleza
Machine Learning, Artificial Intelligence (AI), Speech Analytics, Python 3...
Dell Lead
Python 3, Machine Learning, Artificial Intelligence (AI)...

Experience

  • Statistics - 4 years
  • Python 3 - 4 years
  • Natural Language Processing (NLP) - 4 years
  • Machine Learning - 4 years
  • Artificial Intelligence (AI) - 4 years
  • Deep Learning - 4 years
  • Generative Pre-trained Transformers (GPT) - 4 years
  • Data Analysis - 4 years

Availability

Part-time

Preferred Environment

Ubuntu, Visual Studio Code (VS Code), Jupyter Notebook, Python 3, Git

The most amazing...

...thing I've created is a speaker verification model, which has been used to identify the speaker of unknown audio from a phone apprehended by the police.

Work Experience

Data Scientist

2022 - PRESENT
Axur
  • Built machine learning (ML) models for detecting fraud cases, such as phishing and other online scams.
  • Performed content classification, threat risk evaluation, threat actor reputation assessment, and data analysis for other teams and projects.
  • Used technologies such as sci-kit-learn, TensorFlow, Hugging Face Transformers, and others from the AI stack.
  • Developed LLM-based products using technologies such as LangChain, Hugging Face Transformers, and OpenAI.
Technologies: Python 3, Amazon Web Services (AWS), Machine Learning, Natural Language Processing (NLP), Generative Pre-trained Transformers (GPT), Artificial Intelligence (AI), Graph Theory, Python, Large Language Models (LLMs), LangChain, OpenAI, Amazon SageMaker

Data Science Tech Researcher

2020 - PRESENT
University of Fortaleza
  • Contributed to projects in partnership with Ceará's justice department.
  • Developed machine learning models in Python for speaker identification and verification.
  • Created a module that compares audio from the state's prison database with that from police apprehension and returns a reasonable similarity score to aid investigators' work.
Technologies: Machine Learning, Artificial Intelligence (AI), Speech Analytics, Python 3, Natural Language Processing (NLP), Generative Pre-trained Transformers (GPT), Python

Artificial Intelligence Researcher

2021 - 2022
Dell Lead
  • Contributed to creating the voice assistant module for Dell's matrix app, used internally by employees.
  • Set up the software consisting of two modules: speech recognition and natural language understanding.
  • Developed the speech recognition module using Vosk and the natural language understanding module using Rasa.
  • Configured the application for Dell employees' internal use.
Technologies: Python 3, Machine Learning, Artificial Intelligence (AI), Natural Language Processing (NLP), Generative Pre-trained Transformers (GPT), Natural Language Understanding (NLU), Python

Data Scientist

2020 - 2020
Superintendence of Research and Public Security Strategy (Supesp)
  • Performed extract-transform-load processes on social media data.
  • Built sentiment analysis models on extracted data.
  • Worked with the state's government to continuously assess the public opinion of policies enacted in the early days of the pandemic.
Technologies: Python 3, Artificial Intelligence (AI), Machine Learning, ETL, Sentiment Analysis, Natural Language Processing (NLP), Generative Pre-trained Transformers (GPT), Python

Machine Learning Intern

2018 - 2019
Lanlink Informática Ltda.
  • Developed machine learning models using Python and scikit-learn for churn prediction, contributing as part of a team.
  • Performed data acquisition and machine learning modeling-related tasks.
  • Trained other employees in AI and machine learning as part of my secondary responsibilities.
Technologies: Python 3, Artificial Intelligence (AI), Machine Learning, Python

Quality Trainee

2018 - 2018
Eletra Energy
  • Contributed to projects targeting how to change manufacturing procedures to improve product quality.
  • Analyzed data with Python and Excel as the primary tools.
  • Used the Lean Six Sigma methodology for project development.
Technologies: Statistics, Probability Theory, Python 3, Lean, Quality Control (QC), Statistical Quality Control (SQC), Python

Experience

Polaris

https://www.axur.com/polaris/cybersecurity-teams
Polaris is an AI-powered cybersecurity solution that provides proactive, actionable insights by analyzing and filtering vast amounts of cyber threat intelligence data. It transforms fragmented information into tailored alerts relevant to an organization's specific attack surface, enhancing threat management efficiency and enabling security teams to focus on strategic actions. With real-time updates, custom notifications, and seamless integration with existing tools, Polaris streamlines cybersecurity operations and boosts overall productivity.

Risk Protection Platform

https://www.axur.com/
A platform that helps several companies tackle digital fraud. As a data science team member, I worked on several artificial intelligence models, such as phishing detection, website content classification, threat risk evaluation, topic models, and threat actor reputation assessment.

PEDE Platform

A platform for AI-assisted criminal investigations that is currently in the final development stages. As a data science researcher, I contributed to creating the platform's speaker recognition module.

Dell Matrix

An internal app used by Dell employees. I contributed to this project as an artificial intelligence researcher at Lead Dell, an institute financed by Dell in Fortaleza, Brazil. I created the app's speech recognition and natural language understanding models. Our team's end product was a module capable of recognizing more than 30 commands and executing actions accordingly.

Smishing Detector

https://github.com/AntonioArtur/AWS-ML-Engineering/
A smishing detector that uses a distilBERT base uncased model. I trained and deployed the model with Amazon SageMaker and served it via a lambda function. The experiments and complete project report are available on the Git repository.

Education

2020 - 2023

Master's Degree in Computer Science

University of Fortaleza - Fortaleza, Ceará, Brazil

2012 - 2019

Bachelor's Degree in Mechanical Engineering

University of Fortaleza - Fortaleza, Ceará, Brazil

Certifications

SEPTEMBER 2022 - PRESENT

AWS Machine Learning Engineer Nanodegree

Udacity

MARCH 2022 - PRESENT

Machine Learning DevOps Engineer Nanodegree

Udacity

JANUARY 2022 - PRESENT

Practical Data Science on the AWS Cloud

Coursera

DECEMBER 2021 - PRESENT

Natural Language Processing Specialization

Coursera

SEPTEMBER 2021 - PRESENT

MicroMasters Program in Artificial Intelligence

Columbia University | via edX

SEPTEMBER 2021 - PRESENT

Machine Learning Engineering for Production (MLOps)

Coursera

AUGUST 2021 - PRESENT

MicroMasters Program in Statistics and Data Science

Massachusetts Institute of Technology | via edX

APRIL 2021 - PRESENT

Natural Language Processing Expert Nanodegree

Udacity

MARCH 2021 - PRESENT

TensorFlow: Advanced Techniques

Coursera

MARCH 2020 - PRESENT

Applied Social Network Analysis in Python

University of Michigan | via Coursera

JULY 2019 - PRESENT

Deep Learning Specialization

Coursera

JULY 2019 - PRESENT

TensorFlow in Practice

Coursera

JANUARY 2019 - PRESENT

Machine Learning Specialization

Stanford | via Coursera

DECEMBER 2018 - PRESENT

Data Scientist in Python

Dataquest Labs, Inc.

Skills

Libraries/APIs

Pandas, NumPy, TensorFlow, PyTorch, Rasa NLU, NetworkX

Tools

Amazon SageMaker, Git, AWS CloudFormation

Languages

Python 3, Python, SQL

Platforms

Jupyter Notebook, Ubuntu, Amazon Web Services (AWS), Visual Studio Code (VS Code), AWS Lambda

Paradigms

ETL, Unit Testing

Other

Machine Learning, Artificial Intelligence (AI), Natural Language Processing (NLP), Statistics, Probability Theory, Sentiment Analysis, Deep Learning, Recurrent Neural Networks (RNNs), Data Analysis, BERT, Generative Pre-trained Transformers (GPT), Natural Language Understanding (NLU), Speech Analytics, Convolutional Neural Networks (CNNs), Social Network Analysis, Hypothesis Testing, Machine Learning Operations (MLOps), Lambda Functions, Lean, Computer Vision, Graph Theory, Quality Control (QC), Statistical Quality Control (SQC), Time Series Analysis, Speech Recognition, Large Language Models (LLMs), LangChain, OpenAI, GitHub Actions

Collaboration That Works

How to Work with Toptal

Toptal matches you directly with global industry experts from our network in hours—not weeks or months.

1

Share your needs

Discuss your requirements and refine your scope in a call with a Toptal domain expert.
2

Choose your talent

Get a short list of expertly matched talent within 24 hours to review, interview, and choose from.
3

Start your risk-free talent trial

Work with your chosen talent on a trial basis for up to two weeks. Pay only if you decide to hire them.

Top talent is in high demand.

Start hiring