Upgrade to Pro — share decks privately, control downloads, hide ads and more …

Catalog-driven workflows using CSW

Rich Signell
January 08, 2016

Catalog-driven workflows using CSW

Presentation at ESIP Winter Meeting

Rich Signell

January 08, 2016
Tweet

More Decks by Rich Signell

Other Decks in Science

Transcript

  1. Catalog-driven workflows using CSW Rich Signell , USGS, Woods Hole,

    MA, USA Filipe Fernandes, SECOORA, Brazil Kyle Wilcox, Axiom Data Science, Wickford, RI, USA ESIP Winter Meeting, Washington, DC 2016-01-08
  2. The 4th Network Layer: Data • “We need an end-to-end,

    layer-by-layer, designed information technology … that are composed of no more than a stack of protocols” • “We need open standards… and above all, we need to teach scientists to work in this new layer of data” 2 From the essay: “I have seen the Paradigm Shift, and It Is Us”, byJohn Wilbanks, in the book “The Fourth Paradigm” Data Web TCP/IP Ethernet
  3. US Integrated Ocean Observing System (IOOS® ) • Global Component

    • Coastal Component  17 Federal Agencies  11 Regional Associations
  4. IOOS Core Principles • Adopt open standards & practices •

    Avoid customer-specific stovepipes • Standardized access services implemented at data providers 4 Customer Web access service Data Provider Observations Models
  5. Time Series, Trajectories Meteorology and Wave Buoy in the Gulf

    of Maine. Image courtesy of NOAA. Ocean Glider. Photo by Dave Fratantoni, Woods Hole Oceanographic Institution
  6. IOOS Data Infrastructure Diagram ROMS ADCIRC HYCOM SELFE NCOM NcML

    NcML NcML NcML NcML Common Data Model OPeNDAP NetCDF Subset THREDDS Data Server Standardized (CF-1.6, SGRID-0.1, UGRID-0.9) Virtual Datasets Nonstandard Model Output Data Files Web Services Matlab Panoply IDV Clients NetCDF -Java Library or Broker WMS ncISO ArcGIS NetCDF4 -Python FVCOM Python EDC NetCDF-Java SOS Geoportal Server GeoNetwork CKAN Observed data (buoy, gauge, ADCP, glider) Web Portals pycsw NcML Grid TimeSeries Profile Trajectory TimeSeriesProfile Sgrid Ugrid Nonstandard Data Files Catalog Services Rectilinear ERDDAP WCS
  7. 2015 Boston Light Swim 2015 Aug 15, 7:00 am start

    8 mile swim No wet suit How cold will the water be?
  8. 18

  9. 19

  10. Workflow (3/3) Axiom Data Science – Runs a CSW search

    (in a cron job) on the modeling groups pycsw services, filtering on datasets that contain a project called “CMG_Portal” – Datasets that have valid WMS services are added to the portal See <https://github.com/USGS-CMG/usgs-cmg- portal/wiki> for details of the workflow 22
  11. 23

  12. 25

  13. 27

  14. 28

  15. 30

  16. Benefits of catalog-driven applications • Dynamically adapt to new or

    changing data • Find the machine-to-machine issues – Easy problems that can be fixed in minutes to day – Harder problems to guide future work • Fixes for your workflow benefit everyone • Build success stories • Create reproducible workflows that others can learn from, expand on, or transform • Standardized workflows help develop the 4th network layer for data