Slide 1

Slide 1 text

The eArchiving Initiative Ensuring interoperable, open, transparent, legally compliant and sustainable access to digital records ITPRESS Istanbul, 3.Sep 2024 Gregor Završnik, Geoarh

Slide 2

Slide 2 text

• Motivations for the eArchiving initiative • What is the eArchiving initiative • How do we preserve digital records? • eArchiving - Key activities • Reference implementations • What is next? Introduction

Slide 3

Slide 3 text

Motivations for the eArchiving initiative Why do we need to preserve digital records? And why backup is not archiving.

Slide 4

Slide 4 text

• Missing context, documentation • Long term sustainable access • Unreadable formats? • Reusing old software? Long term accessibility of digital records https://wiki.preterhuman.net/index.php?curid=1725 By Photograph: Robert Jacek Tomczak - Own work, CC BY- SA 3.0, https://commons.wikimedia.org/w/index.php?curid=94360 Parcels Feature class Related ownership table https://desktop.arcgis.com/en/arcmap/latest/manage- data/geodatabases/table-basics.htm

Slide 5

Slide 5 text

• Authenticity • How do we know data is authentic • How do we know we understand the data properly • Provenance • What changes were made to the data 5 Use case 1 - Legal safety (Trust and compliance)

Slide 6

Slide 6 text

Use case 2: Loss of Inter-agency references (Sustainability and business support) • Water permit was issued in 2005 • The permit is issued to a Parcel number (as its location) • Today this parcel doesn’t exist • How can we find it? 6

Slide 7

Slide 7 text

Use Case 3 – Information product recreation (Storing information not Technology) GIS Tools Organization, Queries, Cartographic projections, transformations… Symbology, geoprocessing, exports Vector data Raster data Database Lists, codepages Models

Slide 8

Slide 8 text

Role of the eArchiving Initiative in EU Policies How is the digital economy different?

Slide 9

Slide 9 text

• To foster the availability of data for use • Strengthen data-sharing mechanisms • Improve technical, semantic, organizational and legal interoperability • Simplify data exchange within private and public archives • And much more eArchiving Initiative role in EU Digital Decade

Slide 10

Slide 10 text

No content

Slide 11

Slide 11 text

If data is the new Oil, what drives the economy?

Slide 12

Slide 12 text

No content

Slide 13

Slide 13 text

No content

Slide 14

Slide 14 text

the value of a (tele) communications network is proportional to the square of the number of connected users of the system ( n2) Connectivity brings value to data in the digital economy Metcalfe’s law - The value of connected Archives

Slide 15

Slide 15 text

Long range - time perspective Using data from the past to support and predict the future

Slide 16

Slide 16 text

What is eArchiving Initiative?

Slide 17

Slide 17 text

eArchiving key information • Digital Europe Programme • DG CNECT and E-ARK Consortium • Start date 1st October 2022 • Two years + two possible annual extensions https://digital-strategy.ec.europa.eu/en/activities/earchiving

Slide 18

Slide 18 text

eArchiving Key information 5 Group Members: • Austrian Institute of Technology (AIT), LEAD • Highbury R&D, Ireland • Gabinete Umbus, Spain • KEEP Solutions, Portugal • DLM Forum, Estonia 6 subcontractors: • Association des Archivists Francophones de Belgique • EasyLean, Estonia • Ever Team Software, France • IdenTrust, Belgium • KMD, Denmark • Prisma Cultura, Italy 18 E-ARK Consortium

Slide 19

Slide 19 text

eArchiving Key information 15 organisations via DLM Forum: • Archives of the Republic of Slovenia, • Docbyte, Belgium • ES Solutions, Sweden • Geoarh, Slovenia • Instituto de Engenharia de Sistemas e Computadores – Investigação e Desenvolvimento em Lisboa, (INESC-ID), Portugal, • National Archives of Estonia, Finland and Norway • Stichting Open Preservation, Netherlands • Penwern, U.K. • Poliphon, Hungary • Serda, France • Swiss Federal Archives, • Kommunalförbundet Sydarkivera, Sweden • Kaakkois-suomen Ammattikorkeakoulu (Xamk), Finland 19 E-ARK Consortium

Slide 20

Slide 20 text

• Openness and transparency • Sustainability and legal compliance eArchiving helps people preserve and reuse information over the long-term • Interoperability by default

Slide 21

Slide 21 text

Origins of eArchiving E-ARK Research Project (2014-2017) eArchiving CEF Building block (2018-2019; 2019-2021) eArchiving Initiative (2022 >)

Slide 22

Slide 22 text

• Wide adoption of the E-ARK specifications across a broad range of sectors, domains and countries, evidenced by real impact and engagement, culminating in being the official de facto standard. • Sustainability in all its forms is a key part of our mission. Developing an eArchiving curriculum and Conformance Seal is part of this drive. • We are focussing on Capacity Building and Support to deepen our relationships with our user communities. eArchiving Vision

Slide 23

Slide 23 text

The eArchiving Initiative: the five Activities 23

Slide 24

Slide 24 text

How do we preserve digital records?

Slide 25

Slide 25 text

Open Archival Information System – reference model (ISO 14721) CONTEXT RENDERING INFORMATION BEHAVIOUR STRUCTURE

Slide 26

Slide 26 text

The OAIS content package management model Open Archival Information System (OAIS) P r o d u c e r C o n s u m e r Ingest Pre-Ingest Data Management Preservation Planning Administration Archival Storage SIP Access AIP DIP

Slide 27

Slide 27 text

• Availability • Usability • Trustorthyness / Authenticity • Completeness • Time proof Findable Accessible Interoperable Reusable Archival principles vs. F.A.I.R Data

Slide 28

Slide 28 text

What should we preserve? The Significant Properties Model • Content: conveys information, not necessarily human readable • Context: background information on technical and business environments to which the digital objects relate • Rendering: how the content of the object appears or is recreate • Structure: component parts of the object and how they relate to each other • Behaviour: functionality that is intrinsic to an object 28

Slide 29

Slide 29 text

Context example • Positional accuracy of cadaster parcels: 20m • Positional accuracy of orthophoto image (2006) in this area: 6m Knowing data limitations helps us interpret it 29

Slide 30

Slide 30 text

Example: Rendering – Coordinate systems 30 Finland Dataset Dataset: EPSG:3067 ETRS89 / TM35FIN(E,N) Finland Background map: EPSG:3857 WGS84 OpenStreetMap OSM Standard 30

Slide 31

Slide 31 text

Example: Structure data coming from a complex system Ministry of culture Cultural heritage Ministry of defense Environmental agency Environmental restrictions Ministry of Agriculture Land use Ministry of environment Urban plans Surveying & Mapping Authority Cadaster maps Building permit 31

Slide 32

Slide 32 text

Example: Behavior 32

Slide 33

Slide 33 text

• Self descriptive packages (OAIS Standard) • Based on archival standards and standardized metadata • Documented to preserve the knowledge base Solving the digital preservation issues

Slide 34

Slide 34 text

Specifications CSIP (Common Specification for Information Packages) METS E-ARK SIP (Submission Information Package) METS E-ARK AIP (Archival Information Package) METS E-ARK DIP (Dissemination Information Package) METS CS Archival (Common Specification For Archival Information) CS Preservation (Common Specification For Preservation Metadata) CITS GIS CITS SIARD CITS Geo CITS eHealth1 CITS eHealth2 CITS ERMS CITS … CITS (Content Information Type Specifications) Structural Metadata Preservation & Descriptive Metadata Content & Content Related Metadata

Slide 35

Slide 35 text

Archival package Example • Folder structure • Administrative metadata • Geospatial Data requirements • Documentation requirements • Descriptive metadata (ISO 19115, INSPIRE…)

Slide 36

Slide 36 text

How do we make data more machine understandable? Source: https://5stardata.info/ OL – Open License RE – Readable and structured OF – Open Format URI – Unique Resource ID LD – Linked Data

Slide 37

Slide 37 text

Content Information Type Specification Specifications for documenting archival packages

Slide 38

Slide 38 text

Metadata Interoperability Linked Data Descriptive Preservation Structural

Slide 39

Slide 39 text

• Original representation vs open format representation • Possible multiple types of metadata • Standardised (INSPIRE, ISO 19115...) • Storing data for AI, linked data…. Supporting some AI friendly formats Borders_GeoDCAT Borders_GeoSPARQL Borders_CSV (WKT)

Slide 40

Slide 40 text

Where you can find the Specifications? • www.dilcis.eu • https://github.com/DILCISBoard/

Slide 41

Slide 41 text

eArchiving key activities • CORE Activities • Capacity Building • Support Activities • Synergy Building

Slide 42

Slide 42 text

42 The eArchiving Initiative: the five Activities

Slide 43

Slide 43 text

Task CORE.1 Specifications Maintenance and Knowledge Base • Sub-Task CORE.1.1 Specifications • Sub-Task CORE.1.2 Knowledge Base (includes Reference Architecture and Maturity Model) • Sub Task CORE.1.3 Release Management CORE Key tasks

Slide 44

Slide 44 text

CORE Key tasks Task CORE.2 Creation of new eArchiving specifications: Engineering1 3D CITS (3D Content Information Type Specification (CITS), liaising with Digital Cultural Heritage Data Space, 4CH) 44 https://lotar- international.org/

Slide 45

Slide 45 text

Preserving 3D Objects. 3D model of the Villa of Oplontis Martin Blazeby, POCOS_Vol_1.PDF

Slide 46

Slide 46 text

eArchiving Reference Architecture

Slide 47

Slide 47 text

• Version 2.0 published in 2024 • Online version available at the E-ARK Knowledge Centre • ArchiMate model can be downloaded from the online version • https://kc.dlmforum.eu/earchiving-ra20/ eArchiving Reference Architecture v2.0

Slide 48

Slide 48 text

• EAG Archiving by Design working group • DUTO model by the Dutch National Archives Reference Architecture – Archiving by Design

Slide 49

Slide 49 text

Reference Architecture – Maturity Model

Slide 50

Slide 50 text

• Task CORE.3 Validation Testing Tools and Software libraries, related to the eArchiving specifications • Task CORE.4 Online publication. Covers all the relevant information about the specifications and the other services, on the dedicated section on the Digital strategy website via both Drupal and the CNECT-specific CMS, the “newsroom” • Task CORE.5 Standardisation and interoperability Engagement Other CORE Key tasks https://digital-strategy.ec.europa.eu/en/activities/earchiving

Slide 51

Slide 51 text

https://github.com/E-ARK-Software E-ARK Software (not a hosting Service)

Slide 52

Slide 52 text

• The eArchiving Conformance Seal is intended for digital or electronic archives, and associated solution and service providers, as a sign of quality of digital archiving; long term preservation; and data management; following standards and best practices. • https://seal.e-ark-foundation.eu/ eArchiving Conformance Seal

Slide 53

Slide 53 text

Access Pre- Ingest Ingest Storage View 53 E-ARK specification validation is now available in two flavours: try it on the web or download from GitHub. How do you know if an Information Package conforms to E-ARK specifications?

Slide 54

Slide 54 text

How do you know if a piece of software is eArchiving conformant Access Pre- Ingest Ingest Storage View 54

Slide 55

Slide 55 text

Training • Webinars • Stand-Alone training • eArchiving Curriculum User Capacity Building • Promotion of sustainable access to data • Use of eArchiving models Transnational Specialist Networks • Domain specific conferences • User groups Capacity Building

Slide 56

Slide 56 text

eSignature and eIDAS Data Spaces eHealth and eDelivery Developing synergies with Digital Europe Programme’s activities

Slide 57

Slide 57 text

• Support for existing users • Onboarding Support activity: support@e-ark-foundation.eu

Slide 58

Slide 58 text

Reference implementations

Slide 59

Slide 59 text

Who is using/interested in eArchiving so far?

Slide 60

Slide 60 text

USERS: • Denmark (DNA) • Sweden (Package structure) • Czechia • Slovenia (included in legislation) • Croatia (eKultura Project) • Switzerland (SIARD) • Nederland • … ADOPTING: • Hungary • Spain • Portugal • … Who is using eArchiving so far

Slide 61

Slide 61 text

TAXUD

Slide 62

Slide 62 text

No content

Slide 63

Slide 63 text

What is next?

Slide 64

Slide 64 text

• Core E-ARK Specifications >CEN Standard • Machine understandable packages • HBIM Content Type specification • Academic Curriculum • Engaging more communities Plans for the future

Slide 65

Slide 65 text

Core benefits of adopting eArchiving Extended possibilities for collaboration in tool development Shared development = less cost for each individual partner Digital preservation is affordable for everyone!

Slide 66

Slide 66 text

66 Go here first! support@e-ark-foundation.eu

Slide 67

Slide 67 text

Questions? © European Union 2020 Unless otherwise noted the reuse of this presentation is authorised under the CC BY 4.0 license. For any use or reproduction of elements that are not owned by the EU, permission may need to be sought directly from the respective right holders. Slide xx: element concerned, source: e.g. Fotolia.com; Slide xx: element concerned, source: e.g. iStock.com support@e-ark-foundation.eu gregor@geoarh.si