Issues affecting Pangeo
● Running out of member when rechunking with dask (work around using
xr.to_zarr(append=True) Memory Backpressure issue (D.E Shaw’s graph
manipulation tools!)
● Dask-cloudprovider very attractive to orgs like USGS: FargateCluster “rate
exceeded” issue
● Community understanding of chunking impact on use
● Dask Performance challenges, e.g. pangeo/#194, dask/#3595
○ More work on graph optimization, high-level graphs, task-fusion, etc..
● Dask-deployment: More work on enabling heterogeneous worker pools,
harmonization among systems, etc...