DuckDB and duckplyr: An in-process database management system in your R script

Abstract

DuckDB is an open-source analytical database management system with clients for several languages, including R. DuckDB offers the functionality of a database system, including high performance, persistence and full SQL support. At the same time, DuckDB has a small footprint with no dependency on an external server it is trivial to deploy using the library(duckdb) call. DuckDB also has deep integrations into client libraries such as dplyr using the duckplyr package. In this talk, I explain how DuckDB achieves its high performance, demonstrate its performance through a live demo and showcase its R integrations.

Type
Publication
Presented at 2024 Conference