Slide 1

Slide 1 text

Occurrence  data  at  the  INBO   Opening  up  our  data  publica5on  workflow   Peter  Desmet  &  Dimitri  Brosens  

Slide 2

Slide 2 text

No content

Slide 3

Slide 3 text

Biodiversity  research   Suppor5ng  policy,  nature  management  &   interna5onal  repor5ng  

Slide 4

Slide 4 text

Monitoring   Resul5ng  in  lots  of  occurrence  data  

Slide 5

Slide 5 text

Human  observa5ons   Going  back  20  years  and  more  

Slide 6

Slide 6 text

Machine  observa5ons   Mainly  via  lifewatch.inbo.be  

Slide 7

Slide 7 text

Publishing  data   Since  2011,  through  data.inbo.be/ipt  

Slide 8

Slide 8 text

15  datasets   4.6  million  observa5ons  

Slide 9

Slide 9 text

No content

Slide 10

Slide 10 text

Open  data   CC0  +  Norms  for  data  use  

Slide 11

Slide 11 text

Our  publica5on  workflow   Is  not  rocket  science  

Slide 12

Slide 12 text

Create  GitHub  repository   github.com/LifeWatchINBO  

Slide 13

Slide 13 text

Create  test  resource   On  our  sandbox  IPT  

Slide 14

Slide 14 text

Create  Darwin  Core  view   All  our  data  are  (imported)  on  SQL  Server  

Slide 15

Slide 15 text

Write  metadata   As  a  data  paper,  in  Markdown  

Slide 16

Slide 16 text

Document  issues   For  data  and  metadata  

Slide 17

Slide 17 text

Resolve  data  issues   Google  Refine  >  Database  or  mapping  

Slide 18

Slide 18 text

Review  metadata   With  final  approval  by  the  researchers  

Slide 19

Slide 19 text

Migrate  to  produc5on   Copy/pas5ng  

Slide 20

Slide 20 text

Publish  &  register   And  relax…   Tripel  Karmeliet  Beer  ©  Johan  Martens,  licensed  under  CC  BY-­‐NC-­‐SA  2.0  

Slide 21

Slide 21 text

Issues  with  terms   Some  of  which  got  resolved  at  this  mee5ng  

Slide 22

Slide 22 text

How  to  populate  terms?   Defini5ons  ≠  guidelines  (by  design)  

Slide 23

Slide 23 text

taxonID    scien5ficNameID      acceptedNameUsageID        parentNameUsageID          originalNameUsageID            nameAccordingToID              namePublishedInID                taxonConceptID                acceptedNameUsage                  parentNameUsage                    originalNameUsage                      nameAccordingTo                        namePublishedIn                          namePublishedInYear   Too  many  op5ons   E.g.  taxon  terms,  ID  vs  code  

Slide 24

Slide 24 text

Specific  uses  cases   We  need  guidelines!  

Slide 25

Slide 25 text

Metadata  issues  

Slide 26

Slide 26 text

What  is  important?   IPT  vs  GBIF  page  vs  data  paper  

Slide 27

Slide 27 text

Abstract   Contact   Associa5on   Coverage   Methodology   References   Rights/Purpose/Add.   Project   Download   Links   Cita5on   Coverage   Keywords   Associa5on   Project   Methodology   Cita5on   References   Purpose/Rights/Add.   Links   Contact   Abstract   Keywords   IPT  metadata  editor   IPT  resource  page  

Slide 28

Slide 28 text

Coverage   Keywords   Associa5on   Project   Methodology   Cita5on   References   Purpose/Rights/Add.   Links   Contact   Abstract   Coverage   Abstract   Purpose   Addi5onal   Contact   Project   Methodology   Assoc.   References   Cita5on/Rights   ? Links   Download   IPT  metadata  editor   GBIF  resource  page  

Slide 29

Slide 29 text

Coverage   Keywords   Associa5on   Project   Methodology   Cita5on   References   Purpose/Rights/Add.   Links   Contact   Abstract   Abstract   Keywords   Download   Purpose   Coverage   Technical/Rights   Addi5onal   Methodology   Project   Associa5on   References   Contact/Cita5on   Authors   IPT  metadata  editor   Data  paper  

Slide 30

Slide 30 text

Too  many  op5ons   Creator  /  Maintainer  

Slide 31

Slide 31 text

People   Basic  metadata,  associated     par5es,  project  personnel  

Slide 32

Slide 32 text

Study  area  descrip5on,  design     descrip5on,  study  area  extent  

Slide 33

Slide 33 text

We’d  also  like  to  add   Technical  info,  publica5on  interval,  norms  

Slide 34

Slide 34 text

Are  we  hiing  the  limita5ons  of   EML?  

Slide 35

Slide 35 text

Rethink  and  document   We  need  guidelines!  

Slide 36

Slide 36 text

Ways  forward  

Slide 37

Slide 37 text

Collabora5ve  guidelines   Contribute,  fork  &  merge  them  on  GitHub  

Slide 38

Slide 38 text

github.com/LifeWatchINBO/data-­‐publica5on-­‐guidelines  

Slide 39

Slide 39 text

github.com/tdwg/dwc-­‐guidelines  

Slide 40

Slide 40 text

Bejer  integra5on   Between  GitHub  &  IPT  

Slide 41

Slide 41 text

Plug  &  Play  cleaning   APIs  via  Google  Refine  extensions  

Slide 42

Slide 42 text

Collabora5ve  tagging   This  dataset  is  useful  for  this  

Slide 43

Slide 43 text

ckan.org  

Slide 44

Slide 44 text

cer5ficates.theodi.org  

Slide 45

Slide 45 text

Feedback  from  users   Are  we  on  the  right  track?  

Slide 46

Slide 46 text

Thanks!   @peterdesmet   @dimibro   @LifeWatchINBO