Upgrade to Pro — share decks privately, control downloads, hide ads and more …

Archive Forge - ORD Hackathon

loleg
January 20, 2021

Archive Forge - ORD Hackathon

Pitch of an ORD Hackathon 2021 project based on Challenge #5. This presentation was authored by Damien Jeannerat with contributions from myself and Giuseppe Peronato.

For details visit https://github.com/gperonato/archive-forge

loleg

January 20, 2021
Tweet

More Decks by loleg

Other Decks in Education

Transcript

  1. ORD Hackaton 2021 5. Web-based platform to build FAIR structured

    archive files e e e e e e e e e e e e ZIP FAIR Damien Jeannerat Oleg Lavrovsky Giuseppe Peronato
  2. ORD Hackaton 2021 Holiday souvenir Who was there + drop

    picture Name, Firstname Visits Place, museum, … Jeannerat, Damien localize on Map + Create zip file… drop document Accept, .png, .jpg create .png, in any case store as https://schema.org/person Sign in with ORCID Some input are optional others are mandatory If image, extract text (OCR) If .docx document, convert into .rtf When conditions are fulfilled… Store to Zenodo Allows to upload to … save as … Create Linked data
  3. ORD Hackaton 2021 Archive forge with linked data {spectrum {Instrument…}

    {Date} {Ref.. {…} } },{ } e (20) e {compound {Name…} {Formula} {INCHI… },{ } Tools (.js or .py ?) to extract metadata for each type of file
  4. ORD Hackaton 2021 Archive forge with linked data {spectrum {Instrument…}

    {Date} {Ref.. {…} } },{ } e (20) e {compound {Name…} {Formula} {INCHI… },{ } {Assignment {Spectrum} {compound} {confidence} {Validation} e Combination of objects
  5. ORD Hackaton 2021 Archive forge with linked data {spectrum {Instrument…}

    {Date} {Ref.. {…} } },{ } e (20) e {compound {Name…} {Formula} {INCHI… },{ } {Assignment {Spectrum} {compound} {confidence} {Validation} e {Author {ORCID} {Institution} {ORCID} } {Reference {DOI} }
  6. ORD Hackaton 2021 1) Front end (Oleg) 2) Back end

    (Guiseppe) -metadata -linked data -manifest file
  7. ORD Hackaton 2021 Conclusions Working (or about to work?) -Form,

    drop area, basic check -Ingestion (check file/folder content, format conversion) -Metadata data generation Future work -ZIP file generation -Linked data generation -Move from back to front end -Simplification for re-use in other fields -Upload (Zenodo, University repositories, Olos, …) -ORCHID login Requires more work -Rules for linked data, RO-create? (SWITCH?) Linked data inside