Slide 10
Slide 10 text
Analysis-Ready Cloud-Optimized (ARCO) data
Analysis-Ready:
β’ Think in βDatasets/Datacubesβ not
β
f
ilesβ and "folders"
β’ No need for tedious
homogenizing / cleaning steps
β’ Curated and cataloged
Cloud Optimized:
β’ Compatible with object storage
(access via HTTP)
β’ Supports lazy access, intelligent
subsetting, and streaming access
β’ Integrates with high-level analysis
libraries and distributed
frameworks for high parallel
throughput
Abernathey et al., "Cloud-Native Repositories for Big Scienti
fi
c Data," 2021, doi: 10.1109/MCSE.2021.3059437