Data Science and Databases

Showing 1-8 of 124 results

Understanding Twitter Dynamics With R and Gephi: Text Analysis and Centrality

by Juan Manuel Ortiz de Zarate

Centrality and text analysis allow users to get more out of their social network data. Here’s how you can leverage them using R and Gephi.

12 minute readContinue Reading

A Deeper Meaning: Topic Modeling in Python

by Federico Albanese

Colloquial language doesn’t lend itself to computation. That’s where natural language processing steps in. Learn how topic modeling helps computers understand human speech.

8 minute readContinue Reading

Social Network Analysis in R and Gephi: Digging Into Twitter

by Juan Manuel Ortiz de Zarate

Thanks to rapid advances in technology, large amounts of data generated on social networks can be analyzed with relative ease, especially for those who use the R programming language and Gephi.

9 minute readContinue Reading

Serve Map Clusters 50x Faster Using Smarter Caching

by Florian Pfisterer

Serving map clusters to a mobile app can cause a significant performance bottleneck. Fortunately, it's a problem that can be solved with this caching strategy.

8 minute readContinue Reading

Ensemble Methods: The Kaggle Machine Learning Champion

by Juan Manuel Ortiz de Zarate

Two heads are better than one. This proverb describes the concept behind ensemble methods in machine learning. Let's examine why ensembles dominate ML competitions and what makes them so powerful.

9 minute readContinue Reading

Graph Data Science With Python/NetworkX

by Federico Albanese

Data inundates us like never before—how can we hope to analyze it? Graphs (networks, not bar graphs) provide an elegant approach. Find out how to start with the Python NetworkX library to describe, visualize, and analyze "graph theory" datasets.

9 minute readContinue Reading

How to Approach Writing an Interpreter From Scratch

by Sakib Hadžiavdić

How source code becomes a running program is often opaque: "Just run the compiler" is all that developers normally need to know. Writing an interpreter from scratch—including its lexer and parser—is an illuminating challenge.

14 minute readContinue Reading

Solving Bottlenecks With SQL Indexes and Partitions

by Mirko Marović

Indexes and partitioning can help with SQL performance, but they're not cure-alls. Through everyday examples of date range and LIKE queries, find out how to "think like an RDBMS" to make yours run faster.

14 minute readContinue Reading

Join the Toptal® community.