Customized Remote Work Solutions From the World’s Largest Fully Remote CompanyCustomized Remote Work SolutionsLearn More

Data Science and Databases

Showing 1-8 of 115 results

How to Implement a Data Quality Process

by Alexander Hauskrecht

Data quality is a crucial element of any successful data warehouse solution. As the complexity of data warehouses increases, so does the need for data quality processes. In this article, Toptal Data Quality Developer Alexander Hauskrecht outlines how you can ensure a high degree of data quality and why this process is so important.

16 minute readContinue Reading

SQL Indexes Explained, Pt. 2

by Mirko Marović

Sorting a table can make some queries faster—but the maintenance cost is untenable. Enter real database indexes and their most common implementation structure: the B-tree.

10 minute readContinue Reading

Serializing Complex Objects in JavaScript

by Luke Wilson

The Tanagra.js library is designed to be simple and lightweight, and it currently supports Node.js and ES6 classes. The main implementation supports JSON, and an experimental version supports Google Protocol Buffers.

7 minute readContinue Reading

Optimizing Retailer Revenue with Sales Forecasting AI

by Ahmed Khaled

Retailers often face supply and demand issues that cause them to miss out on potential sales or tie up a lot of money in overstocked products. In this article, Toptal Data Scientist Ahmed Khaled explains how retailers can boost revenues and cut costs with sales forecasts backed by artificial intelligence.

9 minute readContinue Reading

Embeddings in Machine Learning: Making Complex Data Simple

by Yaroslav Kopotilov

Working with non-numerical data can be challenging, even for seasoned data scientists. To make good use of such data, it needs to be transformed. But how? In this article, Toptal Data Scientist Yaroslav Kopotilov will introduce you to embeddings and demonstrate how they can be used to visualize complex data and make it usable.

11 minute readContinue Reading

The Many Applications of Gradient Descent in TensorFlow

by Alan Reiner

TensorFlow is one of the leading tools for training deep learning models. Outside that space, it may seem intimidating and unnecessary, but it has many creative uses—like producing highly effective adversarial input for black-box AI systems.

18 minute readContinue Reading

Modern Web Scraping with Python and Selenium

by Neal Barnett

Web scraping has been around since the early days of the World Wide Web, but scraping modern sites that heavily rely on new technologies is anything but straightforward. In this article, Toptal Software Developer Neal Barnett demonstrates how you can use Python and Selenium to scrape sites that employ a lot of JavaScript, iframes, and certificates.

9 minute readContinue Reading

SQL Indexes Explained, Pt. 1

by Mirko Marović

Properly used, an SQL database index can be so effective that it might seem like magic. But the following series of exercises will show that underneath, the logic of most SQL indexes---and wielding them correctly---is quite straightforward.

8 minute readContinue Reading

Join the Toptal® community.