Upgrade to Pro — share decks privately, control downloads, hide ads and more …

Archive Forge - ORD Hackathon

loleg
January 20, 2021

Archive Forge - ORD Hackathon

Pitch of an ORD Hackathon 2021 project based on Challenge #5. This presentation was authored by Damien Jeannerat with contributions from myself and Giuseppe Peronato.

For details visit https://github.com/gperonato/archive-forge

loleg

January 20, 2021
Tweet

More Decks by loleg

Other Decks in Education

Transcript

  1. ORD Hackaton 2021
    5. Web-based platform to build
    FAIR structured archive files
    e
    e e e
    e
    e e e e
    e
    e
    e
    ZIP
    FAIR
    Damien Jeannerat
    Oleg Lavrovsky
    Giuseppe Peronato

    View Slide

  2. ORD Hackaton 2021
    Holiday souvenir
    Who was there
    +
    drop picture
    Name, Firstname
    Visits
    Place, museum, …
    Jeannerat, Damien
    localize on Map
    +
    Create zip file…
    drop document
    Accept, .png, .jpg
    create .png, in any case
    store as https://schema.org/person
    Sign in with ORCID
    Some input are optional others are mandatory
    If image, extract text (OCR)
    If .docx document, convert into .rtf
    When conditions are fulfilled…
    Store to Zenodo
    Allows to upload to …
    save as …
    Create Linked data

    View Slide

  3. ORD Hackaton 2021
    Structure determination by Nuclear Magnetic Resonnace
    Vanilin

    View Slide

  4. ORD Hackaton 2021
    Archive forge with linked data
    {spectrum
    {Instrument…}
    {Date}
    {Ref..
    {…}
    }
    },{
    }
    e
    (20)
    e
    {compound
    {Name…}
    {Formula}
    {INCHI…
    },{
    }
    Tools (.js or .py ?)
    to extract metadata for each type of file

    View Slide

  5. ORD Hackaton 2021
    Archive forge with linked data
    {spectrum
    {Instrument…}
    {Date}
    {Ref..
    {…}
    }
    },{
    }
    e
    (20)
    e
    {compound
    {Name…}
    {Formula}
    {INCHI…
    },{
    }
    {Assignment
    {Spectrum}
    {compound}
    {confidence}
    {Validation}
    e
    Combination of objects

    View Slide

  6. ORD Hackaton 2021
    Archive forge with linked data
    {spectrum
    {Instrument…}
    {Date}
    {Ref..
    {…}
    }
    },{
    }
    e
    (20)
    e
    {compound
    {Name…}
    {Formula}
    {INCHI…
    },{
    }
    {Assignment
    {Spectrum}
    {compound}
    {confidence}
    {Validation}
    e
    {Author
    {ORCID}
    {Institution}
    {ORCID}
    }
    {Reference
    {DOI}
    }

    View Slide

  7. ORD Hackaton 2021
    1) Front end (Oleg)
    2) Back end (Guiseppe)

    View Slide

  8. ORD Hackaton 2021
    1) Front end (Oleg)
    2) Back end (Guiseppe)
    -metadata
    -linked data
    -manifest file

    View Slide

  9. ORD Hackaton 2021
    https://github.com/gperonato/archive-forge

    View Slide

  10. ORD Hackaton 2021
    Conclusions
    Working (or about to work?)
    -Form, drop area, basic check
    -Ingestion (check file/folder content, format conversion)
    -Metadata data generation
    Future work
    -ZIP file generation
    -Linked data generation
    -Move from back to front end
    -Simplification for re-use in other fields
    -Upload (Zenodo, University repositories, Olos, …)
    -ORCHID login
    Requires more work
    -Rules for linked data, RO-create? (SWITCH?)
    Linked data
    inside

    View Slide