Jaime is available for hire

Jaime Leal

Verified Expert in Engineering

Machine Learning Developer

Location

Monterrey, Mexico

Toptal Member Since

December 25, 2019

Jaime has four years of experience working as a data scientist in all stages of the data science pipeline including data cleaning, feature engineering, model building, and model deployment mainly with structured data. He has developed in R and Python, and he has created automatic reports, visualizations, dashboards, RESTful APIs, and packages. Jaime is continuously working to improve his skills.

Machine Learning Data Analysis Python R RStudio Shiny Git GitHub Jupyter Natural Language Processing (NLP)Amazon API Gateway Deep Learning Amazon Web Services (AWS)Amazon EC2 Linux SQL

Portfolio

Linksbridge

RStudio, Dplyr, GitHub, Git, Web Forms, Data Wrangling, CSS, HTML, Databases...

Strong Analytics

Amazon WorkSpaces, Redshift, Amazon Web Services (AWS), RStudio, GitHub, Dplyr...

Teranalytics

Twitter API, RStudio, Sentiment Analysis, Dplyr, Caret, Data Analysis...

Experience

Machine Learning - 4 years Python - 4 years R - 4 years RStudio Shiny - 4 years Data Analysis - 4 years Data Cleaning - 4 years Generative Pre-trained Transformers (GPT) - 2 years Natural Language Processing (NLP) - 2 years

Availability

Part-time

Preferred Environment

RStudio, Spyder, Visual Studio, Git

The most amazing...

...thing I've coded is a two-phase fluid flow simulator in Python.

Work Experience

R Developer

2020 - 2020

Linksbridge

Developed an RShiny application to search vaccinations campaigns in a database and allow the user to update old entries and create new ones.
Wrote SQL queries to allow the application to interact with the database.
Wrote code documentation and used Git for version control.

Technologies: RStudio, Dplyr, GitHub, Git, Web Forms, Data Wrangling, CSS, HTML, Databases, SQL, RStudio Shiny, R, Web Development

Data Engineer

2020 - 2020

Strong Analytics

Worked with a team of developers to create two R libraries.
Developed a dashboard in RShiny with visualizations in ggplot to interact with a data analytics engine.
Wrote unit tests with the testthat library to test that the R packages that we developed worked as intended.
Used Amazon Redshift with R to store and retrieve GB's of data and perform analysis.

Technologies: Amazon WorkSpaces, Redshift, Amazon Web Services (AWS), RStudio, GitHub, Dplyr, Unit Testing, Data Visualization, HTML, CSS, Git, R

Data Scientist

2017 - 2020

Teranalytics

Worked in all stages of the data science pipeline including data cleaning, feature engineering, model building, and model deployment, mainly with structured data.
Developed several interactive RShiny dashboards and deployed them to shinyapps.io.
Managed and used Amazon Web Services (AWS) S3, EC2, Lambda, API Gateway, Cognito, and RDS.
Contributed to projects in various industries including manufacturing, transportation, clinical trials, and marketing.
Build an app to collect status updates and comments from Twitter and Facebook and perform text analysis and sentiment analysis.

Technologies: Twitter API, RStudio, Sentiment Analysis, Dplyr, Caret, Data Analysis, Data Cleaning, AWS Lambda, Amazon S3 (AWS S3), Amazon EC2, Amazon Web Services (AWS), Amazon API Gateway, GitHub, ETL, Generative Pre-trained Transformers (GPT), Natural Language Processing (NLP), GPT, Data Visualization, Machine Learning, Data Science, SQL, Git, Python, R

Experience

Data Science Certification

https://jaimeleal.github.io/data-science-specialization

Coursera data science specialization projects.

Tex Prediction Application

https://jaimeleal.shinyapps.io/capstone-text-prediction/

A simple text prediction application.

Porous Media Flow Simulator

A porous media flow simulator that I made. It is used to simulate flow in a reservoir. The simulator can handle single-phase and two-phase flow, as well as vertical wells, boundary conditions, and different solvers.

Training a Speech Synthesis Model

To train the speech synthesis models you need a dataset consisting of thousands of pairs of audio clips and their transcriptions. Extracting audio clips from recordings is easy. The difficult part is matching each audio clip to its transcript. Even in cases when you already have an accurate transcript, as it is with audiobooks, the process of manually matching each audio clip to its corresponding text is tedious and time-consuming.
A workaround is to use an automatic speech transcription service to transcribe each audio clip.

Visualizing Mexico's Fishing Industry

A Tableau dashboard to visualize the fishing industry performance in Mexico from 2008 to 2014 with information from CONAPESCA. The dashboard shows production per state and by top species, segmented by its origin, capture, or aquaculture.

Movie Recommender System

Select a movie from the search bar and the app will recommend other movies like the one that you selected.

The app uses a content-based recommender system, trained on movie descriptions to suggest movies that are similar to each other. The dataset for the analysis comes from "The Movies Dataset" and it contains 45,000 movies released on or before July 2017.

Visualizing US Power Plants

A Tableau dashboard to explore the electricity production in the United States segmented by location and energy source with data from the US Energy Information Administration. The dashboard gives an overview of the total electricity production of each state and the individual locations of each power plant.

Skills

Languages

Python, R, Markdown, SQL, HTML, CSS, Excel VBA, JavaScript

Frameworks

RStudio Shiny, Django

Libraries/APIs

Caret, REST APIs, Twitter API, Facebook API, Keras, TensorFlow, NumPy, SciPy, Pandas, Matplotlib

Tools

Jupyter, Git, GitHub, Dplyr, Bitbucket, Spyder, Microsoft Access, Tableau, Visual Studio, Amazon WorkSpaces

Paradigms

Data Science, ETL, Unit Testing, REST

Platforms

Jupyter Notebook, Windows, Visual Studio Code (VS Code), Amazon Web Services (AWS), Amazon EC2, AWS Lambda, Linux, Docker, Heroku, RStudio

Other

Machine Learning, Data Cleaning, Data Analysis, Natural Language Processing (NLP), Sentiment Analysis, Speech to Text, Text to Speech (TTS), Amazon API Gateway, Linear Algebra, Numerical Methods, Deep Learning, GPT, Generative Pre-trained Transformers (GPT), Big Data, Web Development, Data Wrangling, Data Visualization

Storage

Amazon S3 (AWS S3), PostgreSQL, Databases, Web Forms, Redshift, Redis

Education

2015 - 2016

Master's Degree in Petroleum Engineering

Texas A & M University - College Station, Texas, USA

Certifications

JULY 2020 - PRESENT

Big Data Modeling and Management Systems

Coursera

JULY 2020 - PRESENT

Introduction to Big Data

Coursera

OCTOBER 2019 - PRESENT

Deep Learning Specialization

Coursera

MARCH 2019 - PRESENT

Data Science

Coursera

Collaboration That Works

How to Work with Toptal

Toptal matches you directly with global industry experts from our network in hours—not weeks or months.

Share your needs

Discuss your requirements and refine your scope in a call with a Toptal domain expert.

Choose your talent

Get a short list of expertly matched talent within 24 hours to review, interview, and choose from.

Start your risk-free talent trial

Work with your chosen talent on a trial basis for up to two weeks. Pay only if you decide to hire them.

Top talent is in high demand.

Start hiring