DataScience

Showing 1-9 of 9 results

Share

5 Pillars of Responsible Generative AI: A Code of Ethics for the Future

Generative AI advances raise new questions around data ownership, content integrity, algorithmic bias, and more. Here, three experts at the forefront of NLP present recommendations for developing ethical generative AI solutions.

12 minute readContinue Reading
Madelyn Douglas

Madelyn Douglas

In this ask-me-anything-style Q&A, leading Toptal AI developer Joao Diogo de Oliveira fields questions from fellow engineers about resources for pivoting to ML, approaches to large language models, and the most critical future applications of AI.

6 minute readContinue Reading
Joao Diogo de Oliveira

Joao Diogo de Oliveira

Ask an NLP Engineer: From GPT Models to the Ethics of AI

Want to expand your skills amid the current surge of revolutionary language models like GPT-4? In this ask-me-anything-style tutorial, Toptal data scientist and AI engineer Daniel Pérez Rubio fields questions from fellow programmers on a wide range of machine learning, natural language processing, and artificial intelligence topics.

10 minute readContinue Reading
Daniel Pérez Rubio

Daniel Pérez Rubio

Mining for Data Clusters: Social Network Analysis With R and Gephi

Explore X (formerly Twitter) data clusters to uncover user behaviors (e.g., repost and reply patterns) within online communities. This guide focuses on a politically charged data set to illustrate the process of visualizing and analyzing social data.

8 minute readContinue Reading
Juan Manuel Ortiz de Zarate

Juan Manuel Ortiz de Zarate

Python vs. R: Syntactic Sugar Magic

Python and R empower data scientists to solve problems using elegant syntactic sugar, simplifying coding and solution exploration. Each language brings its unique capabilities and approach to bear.

7 minute readContinue Reading
Leandro Roser

Leandro Roser

Graph Data Science With Python/NetworkX

Data inundates us like never before—how can we hope to analyze it? Graphs (networks, not bar graphs) provide an elegant approach. Find out how to start with the Python NetworkX library to describe, visualize, and analyze “graph theory” datasets.

9 minute readContinue Reading
Federico Albanese

Federico Albanese

Stars Realigned: Improving the IMDb Rating System

IMDb ratings have genre bias: For example, dramas tend to score higher. Removing common feature bias and keeping unique characteristics, it’s possible to create a new, refined score based on IMDb information.

10 minute readContinue Reading
Juan Manuel Ortiz de Zarate

Juan Manuel Ortiz de Zarate

A Data Engineer’s Guide To Nontraditional Data Storages

With the rise of big data and data science, storage and retrieval have become a critical pipeline component for data use and analysis. Recently, new data storage technologies have emerged. But the question is: Which one should you choose? Which one is best suited for data engineering?

In this article, Toptal Data Scientist Ken Hu compares three prominent storage technologies within the context of data engineering.

7 minute readContinue Reading
Ken Hu

Ken Hu

Data Mining for Predictive Social Network Analysis

Analysts have come to recognize social network data as a virtual treasure trove of information for sensing public opinion trends and groundswells of support. In this article, Toptal Engineer Elder Santos describes the techniques he employed for a proof-of-concept that effectively analyzed Twitter Trend Topics to predict, as a sample test case, regional voting patterns in the 2014 Brazilian presidential election.

7 minute readContinue Reading
Elder Santos

Elder Santos

World-class articles, delivered weekly.

Subscription implies consent to our privacy policy

Join the Toptal® community.