Upgrade to Pro — share decks privately, control downloads, hide ads and more …

Cubed-xarray lightning talk at SciPy2023

Cubed-xarray lightning talk at SciPy2023

Short 4-minute talk on using the Cubed package as an alternative to dask.array for processing large datasets in Xarray.

Given as a lightning talk at the SciPy Conference 2023 in Austin, TX.

See this blog post for more details (https://xarray.dev/blog/cubed-xarray)

Tom Nicholas

July 17, 2023
Tweet

More Decks by Tom Nicholas

Other Decks in Programming

Transcript

  1. Cubed: Bounded-Memory Serverless Array Processing (in Xarray) *Tom Nicholas Tom

    White *thomas.nicholas@columbia.edu *github.com/TomNicholas
  2. So use dask.array! Dask is great, but it doesn’t always

    succeed… Sometimes unexpectedly exceeds your RAM budget 😕 Q: Can we guarantee distributed array execution respects RAM constraints?
  3. Xarray wraps Cubed OR Dask OR [new things??] Executes via

    Executes via Cubed ?? Tabular data: Array data: