remote systems Cloud / HPC Xarray provides data structures and intuitive interface for interacting with datasets Parallel computing system allows users deploy clusters of compute nodes for data processing. Dask tells the nodes what to do. Distributed storage “Analysis Ready Data” stored on globally-available distributed storage. PANGEO ARCHITECTURE
large part of the CMIP6 archive in public cloud buckets. Check out pangeo-data.github.io/pangeo-cmip6-cloud for more info. Wondering how to bring your data to the cloud? Check out Pangeo Forge Later today (2:30 EST) - Pangeo Forge: Crowdsourcing Analysis-Ready, Cloud Optimized Data for Ocean, Weather, and Climate Science Ocean Science Meeting [IN08C Open Ocean Science] - Pangeo Forge Mini-Hackathon