Data Science and Databases

Showing 1-7 of 138 results

Share

Apache Spark Optimization Techniques for High-performance Data Processing

Apache Spark is an analytics engine that can handle very large data sets. This guide reveals strategies to optimize its performance using PySpark.

11 minute readContinue Reading
Necati Demir, PhD

Necati Demir, PhD

Necati is a software engineer specializing in data science, machine learning, back-end development, and DevOps. He is an AWS Certified Solutions Architect and AWS Certified Machine Learning Specialist with a doctorate in computer engineering. Necati serves as Chief AI Officer and CTO of Datagran, a machine learning automation company that he co-founded.

5 Pillars of Responsible Generative AI: A Code of Ethics for the Future

Generative AI advances raise new questions around data ownership, content integrity, algorithmic bias, and more. Here, three experts at the forefront of NLP present recommendations for developing ethical generative AI solutions.

12 minute readContinue Reading
Madelyn Douglas

Madelyn Douglas

Madelyn is the Lead Editor of Engineering at Toptal and a former software engineer at Meta. She has more than six years of experience researching, writing, and editing for engineering publications, specializing in emerging technologies and AI. She previously served as an editor at USC’s Viterbi School of Engineering and her research on engineering ethics was published at IEEE’s NER 2021 conference.

In this ask-me-anything-style Q&A, leading Toptal AI developer Joao Diogo de Oliveira fields questions from fellow engineers about resources for pivoting to ML, approaches to large language models, and the most critical future applications of AI.

6 minute readContinue Reading
Joao Diogo de Oliveira

Joao Diogo de Oliveira

Joao is an AI developer with more than 10 years of experience at Fortune 100 companies like Procter & Gamble and startups in the healthcare, energy, and finance industries. Joao holds a master’s degree in computer science from the University of Porto and has multiple certifications in ML and deep learning.

Advantages of AI: Using GPT and Diffusion Models for Image Generation

Generative AI is taking the world by storm, with potentially profound impacts on the content we create. Learn the basics of AI image generation and produce sophisticated artistic renderings with this tutorial.

7 minute readContinue Reading
Juan Manuel Ortiz de Zarate

Juan Manuel Ortiz de Zarate

Juan is a developer, data scientist, and doctoral researcher at the University of Buenos Aires where he studies social networks, AI, and NLP. Juan has more than a decade of data science experience and has published papers at ML conferences including SPIRE and ICCS.

Ask an NLP Engineer: From GPT Models to the Ethics of AI

Want to expand your skills amid the current surge of revolutionary language models like GPT-4? In this ask-me-anything-style tutorial, Toptal data scientist and AI engineer Daniel Pérez Rubio fields questions from fellow programmers on a wide range of machine learning, natural language processing, and artificial intelligence topics.

10 minute readContinue Reading
Daniel Pérez Rubio

Daniel Pérez Rubio

Daniel is a data scientist, developer, and former CTO who has specialized in NLP for more than six years, most recently focusing on large language models (LLMs). His experience includes being a senior data scientist at BASF and Daimler.

World-class articles, delivered weekly.

Subscription implies consent to our privacy policy

An Expert Workaround for Executing Complex Entity Framework Core Stored Procedures

Microsoft’s Entity Framework Core is a popular object-relational mapper, but it doesn’t support the return of complex type results from stored procedures. A clever bit of code gets us around this limitation, returning non-database entities with ease.

5 minute readContinue Reading
Pankaj Kansodariya

Pankaj Kansodariya

Pankaj is a back-end developer and Microsoft Certified Professional with more than 18 years of experience within the Microsoft ecosystem, including C#, VB.NET, SQL Server, and cloud computing with Microsoft Azure. He has worked as a .NET developer at companies including Granicus, Gartner, and Jacobs.

Strategic Listening: A Guide to Python Social Media Analysis

Listening is everything—especially when it comes to effective marketing and product design. Gain key market insights from social media data using sentiment analysis and topic modeling in Python.

9 minute readContinue Reading
Federico Albanese

Federico Albanese

Federico is an expert Python developer and data scientist who has worked at Facebook, implementing deep learning models. He is a university lecturer, and his PhD research focuses on natural language processing and machine learning.

Toptal Engineering Expert

Gabriel Courtemanche

Gabriel is a highly efficient and reliable professional who possesses a broad skill set for web application development. He's been working on a range of products and clients—from working on scalability problems in production engineering teams at Shopify and Autodesk to launching new applications for startups. Most of his work consists of leading technical teams, by creating an easy development environment, fixing technical debts, providing best practices code examples, and mentoring devs.
Read more

Previously At

Shopify

World-class articles, delivered weekly.

Subscription implies consent to our privacy policy

Join the Toptal® community.