Cubed-xarray lightning talk at SciPy2023

July 17, 2023

150

Cubed-xarray lightning talk at SciPy2023

Short 4-minute talk on using the Cubed package as an alternative to dask.array for processing large datasets in Xarray.

Given as a lightning talk at the SciPy Conference 2023 in Austin, TX.

See this blog post for more details (https://xarray.dev/blog/cubed-xarray)

Tom Nicholas

July 17, 2023

Tweet

More Decks by Tom Nicholas

See All by Tom Nicholas

VirtualiZarr + Icechunk talk at SciPy 2025

0

14

Cubed talk at SciPy 2025

0

10

VirtualiZarr & Icechunk: Build a cloud-optimized datacube in 3 lines

0

120

FROST: Federated Registry of Scientific Things (@ Pangeo Showcase)

0

62

VirtualiZarr talk at MET Office

0

260

VirtualiZarr: Create virtual Zarr stores using xarray syntax

0

220

What's next for Pangeo?

0

120

Cubed: Bounded-Memory Serverless Array Processing (Pangeo showcase)

0

100

Pangeo for Plasma

0

180

Other Decks in Programming

See All in Programming

PostgreSQLのRow Level SecurityをPHPのORMで扱う Eloquent vs Doctrine #phpcon #track2

2

560

PHPでWebSocketサーバーを実装しよう2025

0

310

Google Agent Development Kit でLINE Botを作ってみた

2

260

Claude Code + Container Use と Cursor で作るローカル並列開発環境のススメ / ccc local dev

12

6.7k

テスト駆動Kaggle

1

510

RailsGirls IZUMO スポンサーLT

0

190

「テストは愚直&&網羅的に書くほどよい」という誤解 / Test Smarter, Not Harder

0

190

顧客の画像データをテラバイト単位で配信する画像サーバを WebP にした際に起こった課題とその対応策～継続的な取り組みを添えて～

1

310

ペアプロ × 生成AI 現場での実践と課題について / generative-ai-in-pair-programming

2

20k

システム成長を止めない！本番無停止テーブル移行の全貌

1

220

ニーリーにおけるプロダクトエンジニア

0

890

型で語るカタ

0

530

Featured

See All Featured

10 Git Anti Patterns You Should be Aware of

656

60k

The Cult of Friendly URLs

79

6.5k

Build The Right Thing And Hit Your Dates

37

2.8k

The Web Performance Landscape in 2024 [PerfNow 2024]

8

700

Statistics for Hackers

799

220k

[Rails World 2023 - Day 1 Closing Keynote] - The Magic of Rails

35

2.4k

The Psychology of Web Performance [Beyond Tellerrand 2023]

48

2.9k

The World Runs on Bad Software

69

11k

Testing 201, or: Great Expectations

43

7.6k

The Art of Programming - Codeland 2020

54

13k

GraphQLとの向き合い方2022年版

49

14k

Helping Users Find Their Own Way: Creating Modern Search Experiences

29

2.7k

Transcript

Cubed: Bounded-Memory Serverless Array Processing (in Xarray) *Tom Nicholas Tom
White *[email protected] *github.com/TomNicholas
Big science means *Big* arrays 😍 😬 PBs??
So use dask.array! Dask is great, but it doesn’t always
succeed… Sometimes unexpectedly exceeds your RAM budget 😕 Q: Can we guarantee distributed array execution respects RAM constraints?
Rechunker ✨Cubed✨ (Bounded-memory) A: Yes! For certain operations… 🤔
Invented by Cubed’s Design
Coiled Functions … Serverless execution Deploy one serverless container per
chunk - read from / write to Zarr
Xarray wraps Cubed OR Dask OR [new things??] Executes via
Executes via Cubed ?? Tabular data: Array data:
Read the blog post! https://xarray.dev/blog/cubed-xarray Also thanks Tom White for
writing Cubed! Join the discussion!