Skip to contents

DuckDB can query Parquet files directly without importing. This provides excellent performance with minimal memory usage.

Usage

query_parquet(
  query = NULL,
  data_path = "inst/extdata/parquet/by_year_month",
  stations = NULL,
  date_range = NULL
)

Arguments

query

SQL query or NULL for interactive connection

data_path

Path to Parquet files

stations

Filter for specific stations

date_range

Date range as c(start_date, end_date)

Examples

if (FALSE) { # \dontrun{
# Query recent data
df <- query_parquet(
  "SELECT * FROM buoy_data WHERE wave_height > 5",
  date_range = c(Sys.Date() - 30, Sys.Date())
)
} # }