DataEngineering

Showing 1-2 of 2 results

Share

Apache Spark Optimization Techniques for High-performance Data Processing

Apache Spark is an analytics engine that can handle very large data sets. This guide reveals strategies to optimize its performance using PySpark.

11 minute readContinue Reading
Necati Demir, PhD

Necati Demir, PhD

A Data Engineer’s Guide To Nontraditional Data Storages

With the rise of big data and data science, storage and retrieval have become a critical pipeline component for data use and analysis. Recently, new data storage technologies have emerged. But the question is: Which one should you choose? Which one is best suited for data engineering?

In this article, Toptal Data Scientist Ken Hu compares three prominent storage technologies within the context of data engineering.

7 minute readContinue Reading
Ken Hu

Ken Hu

World-class articles, delivered weekly.

Subscription implies consent to our privacy policy

Join the Toptal® community.