Upgrade to Pro — share decks privately, control downloads, hide ads and more …

Entification: The Route to 'Useful' Library Data

De87c771ade1b486944471caee227739?s=47 SWIB14
December 03, 2014

Entification: The Route to 'Useful' Library Data

Presenter: Richard Wallis (OCLC)

Abstract:
Linked Data is all about identifying 'things' then describing them and their relationships in a web of other 'things', or entities. Many library linked data initiatives have focused on directly transforming records into RDF with little linking between the shared concepts captured within those records or to external authoritative representations of the same things. The British Library, with a linked data version of the British National Bibliography, was an early pioneer in attempting to model real world entities as a foundation for their data model. Similar research within OCLC, that led to the release of entities as open linked data from WorldCat.org, such as Works, has demonstrated the benefits of such an approach. It also demonstrates that there is much more than record-by-record format conversion required to successfully achieve a web of real world entities. Significant data mining processes, the open availability of authoritative data hubs (such as VIAF, FAST, Library of Congress), and the use of flexible and widely accepted vocabularies, all play a necessary part in this success. Richard will explore some of the issues and benefits of creating library data as descriptions of real world entities, and share some insights into the processes required and their results.

De87c771ade1b486944471caee227739?s=128

SWIB14

December 03, 2014
Tweet

Transcript

  1. Entification:  The  Route   to  Useful  Library  Data Richard  Wallis

      Technology  Evangelist   @rjw
  2. None
  3. None
  4. None
  5. None
  6. None
  7. None
  8. 6 excellent SWIB's

  9. 6 excellent SWIB's Many great LD Projects So today …..

    Where are we on the web?
  10. Where are we on the web?

  11. None
  12. None
  13. None
  14. None
  15. None
  16. None
  17. None
  18. None
  19. Irrelevant! Invisible on the web!

  20. Irrelevant! Invisible on the web!

  21. None
  22. A  global,  nonprofit  library  cooperative 16,737  members  in  109  countries

      17  offices   5  data  centers
  23. •322  million  cataloging  records   •2.1  billion  holdings   •17.3

     million  e-­‐resources  in  the  WorldCat  knowledge  base   •Nearly  2,000  e-­‐content  collections   •1.5  billion  items  in  WorldCat  Discovery,  including:   • 297  million  peer-­‐reviewed  articles   • 41  million  digital  items   • 33  million  pieces  of  evaluative  content   • 35  million  archival  materials   • 8  million  open-­‐access  items   • and  much  more… Comprehensive  and  global
  24. Comprehensive  and  global 253  million   books 16  million
 e-­‐books

    12  million
 serials 12  million
 visual  materials 7  million
 musical  scores 4  million
 maps English:   83  million German:   25  million French:   18  million Spanish:   8.2  million Chinese:   5.1  million Italian:   3.4  million Dutch:   3.3  million Japanese:   2.9  million Russian:   2.9  million Danish   2.1  million Swedish:   1.9  million Portuguese:   1.3  million Just  a  few  of  
 the  dozens  of
 types  of  content: Some  of  the  485  languages  represented…  
  25. •322  million  cataloging  records   •2.1  billion  holdings   •17.3

     million  e-­‐resources  in  the  WorldCat  knowledge  base   •Nearly  2,000  e-­‐content  collections   •1.5  billion  items  in  WorldCat  Discovery,  including:   • 297  million  peer-­‐reviewed  articles   • 41  million  digital  items   • 33  million  pieces  of  evaluative  content   • 35  million  archival  materials   • 8  million  open-­‐access  items   • and  much  more… Comprehensive  and  global
  26. ≈  2  Billion     Records

  27. ≈  2  Billion     Records 0   01261nam  a22002411

     4500   01   303   005   00000000000000.0   008   990716m19091912enkb  b  000  0  eng   035   __    |9  (DLC)  24002676   906   __    |a  0    |b  ibc    |c  orignew    |d  u    |e  ocip    |f  19    |g  y-­‐gencatlg   955   __    |a  CATALOGER:  This  record,  imported  under  99-­‐219818  duplicated  24-­‐2676   on  PREMARC;  I  have  changed  LCCN  to  that  LCCN,  removed  copy  cataloging  characteristics,  and   deleted  the  PREMARC  record;  please  do  as  NEW  INPUT  and  complete  this  record  based  on  the  item   in  hand;  submit  item  for  selection;  if  retained,  add  as  a  copy.  ta05  07-­‐16-­‐99   __    |a    24002676    |z    99219818   _    |a  DLC    |c  DLC    |a  DA760    |b  .B88  1909   a  Brown,  Peter  Hume,    |d  1849-­‐1918.   ory  of  Scotland,    |c  by  P.  Hume  Brown.   e,    |b  University  Press,    |c  1909-­‐12.   art  fold.)    |c  20  cm.   series   ,  p.  455-­‐464;  v.  3,  p.  [435]-­‐444.   -­‐v.  2.  From  the  accession  of  Mary   o  the  disruption,  1843.  
  28. None
  29. Structured  Data  Objectives

  30. Structured  Data  Objectives • Linking  with  hubs  of  authority  on

     the  web
  31. Structured  Data  Objectives • Linking  with  hubs  of  authority  on

     the  web • viaf.org  –  persons
  32. Structured  Data  Objectives • Linking  with  hubs  of  authority  on

     the  web • viaf.org  –  persons • Library  of  congress  –  subjects
  33. Structured  Data  Objectives • Linking  with  hubs  of  authority  on

     the  web • viaf.org  –  persons • Library  of  congress  –  subjects • Dewey.info  –  classifications
  34. Structured  Data  Objectives • Linking  with  hubs  of  authority  on

     the  web • viaf.org  –  persons • Library  of  congress  –  subjects • Dewey.info  –  classifications • Dbpedia  –  most  things
  35. Structured  Data  Objectives • Linking  with  hubs  of  authority  on

     the  web • viaf.org  –  persons • Library  of  congress  –  subjects • Dewey.info  –  classifications • Dbpedia  –  most  things • Widely  distributed  &  understood
  36. Structured  Data  Objectives • Linking  with  hubs  of  authority  on

     the  web • viaf.org  –  persons • Library  of  congress  –  subjects • Dewey.info  –  classifications • Dbpedia  –  most  things • Widely  distributed  &  understood • Standard  data  access  patterns
  37. Structured  Data  Objectives • Linking  with  hubs  of  authority  on

     the  web • viaf.org  –  persons • Library  of  congress  –  subjects • Dewey.info  –  classifications • Dbpedia  –  most  things • Widely  distributed  &  understood • Standard  data  access  patterns • Common  vocabularies
  38. Structured  Data  Objectives • Linking  with  hubs  of  authority  on

     the  web • viaf.org  –  persons • Library  of  congress  –  subjects • Dewey.info  –  classifications • Dbpedia  –  most  things • Widely  distributed  &  understood • Standard  data  access  patterns • Common  vocabularies • Visibility  in  search  engines
  39. Structured  Data  Objectives • Linking  with  hubs  of  authority  on

     the  web • viaf.org  –  persons • Library  of  congress  –  subjects • Dewey.info  –  classifications • Dbpedia  –  most  things • Widely  distributed  &  understood • Standard  data  access  patterns • Common  vocabularies • Visibility  in  search  engines Conclusions
  40. Structured  Data  Objectives • Linking  with  hubs  of  authority  on

     the  web • viaf.org  –  persons • Library  of  congress  –  subjects • Dewey.info  –  classifications • Dbpedia  –  most  things • Widely  distributed  &  understood • Standard  data  access  patterns • Common  vocabularies • Visibility  in  search  engines Conclusions • Linked  Data
  41. Structured  Data  Objectives • Linking  with  hubs  of  authority  on

     the  web • viaf.org  –  persons • Library  of  congress  –  subjects • Dewey.info  –  classifications • Dbpedia  –  most  things • Widely  distributed  &  understood • Standard  data  access  patterns • Common  vocabularies • Visibility  in  search  engines Conclusions • Linked  Data • RDF  –  RDFa,  RDF/XML,  JSON-­‐LD,  Turtle,  nTriples
  42. Structured  Data  Objectives • Linking  with  hubs  of  authority  on

     the  web • viaf.org  –  persons • Library  of  congress  –  subjects • Dewey.info  –  classifications • Dbpedia  –  most  things • Widely  distributed  &  understood • Standard  data  access  patterns • Common  vocabularies • Visibility  in  search  engines Conclusions • Linked  Data • RDF  –  RDFa,  RDF/XML,  JSON-­‐LD,  Turtle,  nTriples • Canonical  URIs
  43. Structured  Data  Objectives • Linking  with  hubs  of  authority  on

     the  web • viaf.org  –  persons • Library  of  congress  –  subjects • Dewey.info  –  classifications • Dbpedia  –  most  things • Widely  distributed  &  understood • Standard  data  access  patterns • Common  vocabularies • Visibility  in  search  engines Conclusions • Linked  Data • RDF  –  RDFa,  RDF/XML,  JSON-­‐LD,  Turtle,  nTriples • Canonical  URIs • Schema.org
  44. Structured  Data  Objectives • Linking  with  hubs  of  authority  on

     the  web • viaf.org  –  persons • Library  of  congress  –  subjects • Dewey.info  –  classifications • Dbpedia  –  most  things • Widely  distributed  &  understood • Standard  data  access  patterns • Common  vocabularies • Visibility  in  search  engines Conclusions • Linked  Data • RDF  –  RDFa,  RDF/XML,  JSON-­‐LD,  Turtle,  nTriples • Canonical  URIs • Schema.org • Backed  and  recognized  by  Google,  Bing,  Yahoo!,  Yandex
  45. Structured  Data  Objectives • Linking  with  hubs  of  authority  on

     the  web • viaf.org  –  persons • Library  of  congress  –  subjects • Dewey.info  –  classifications • Dbpedia  –  most  things • Widely  distributed  &  understood • Standard  data  access  patterns • Common  vocabularies • Visibility  in  search  engines Conclusions • Linked  Data • RDF  –  RDFa,  RDF/XML,  JSON-­‐LD,  Turtle,  nTriples • Canonical  URIs • Schema.org • Backed  and  recognized  by  Google,  Bing,  Yahoo!,  Yandex • Widely  adopted  &  understood  –  20%  of  web
  46. Structured  Data  Objectives • Linking  with  hubs  of  authority  on

     the  web • viaf.org  –  persons • Library  of  congress  –  subjects • Dewey.info  –  classifications • Dbpedia  –  most  things • Widely  distributed  &  understood • Standard  data  access  patterns • Common  vocabularies • Visibility  in  search  engines Conclusions • Linked  Data • RDF  –  RDFa,  RDF/XML,  JSON-­‐LD,  Turtle,  nTriples • Canonical  URIs • Schema.org • Backed  and  recognized  by  Google,  Bing,  Yahoo!,  Yandex • Widely  adopted  &  understood  –  20%  of  web fairly  obvious y
  47. Structured  Data  Objectives • Linking  with  hubs  of  authority  on

     the  web • viaf.org  –  persons • Library  of  congress  –  subjects • Dewey.info  –  classifications • Dbpedia  –  most  things • Widely  distributed  &  understood • Standard  data  access  patterns • Common  vocabularies • Visibility  in  search  engines Conclusions • Linked  Data • RDF  –  RDFa,  RDF/XML,  JSON-­‐LD,  Turtle,  nTriples • Canonical  URIs • Schema.org • Backed  and  recognized  by  Google,  Bing,  Yahoo!,  Yandex • Widely  adopted  &  understood  –  20%  of  web fairly  obvious y
  48. None
  49. Introducing   Linked  Data Phase  1

  50. Introducing   Linked  Data Phase  1 • First  mine  the

     data
  51. Introducing   Linked  Data Phase  1 • First  mine  the

     data • Records  held  in  Marc • Identify  the  entities
  52. Introducing   Linked  Data Phase  1 • First  mine  the

     data • Records  held  in  Marc • Identify  the  entities • Person,  Organization,  CreativeWork,  etc.
  53. Introducing   Linked  Data Phase  1 • First  mine  the

     data • Records  held  in  Marc • Identify  the  entities • Person,  Organization,  CreativeWork,  etc. • Match  strings  to  things
  54. Introducing   Linked  Data Phase  1 • First  mine  the

     data • Records  held  in  Marc • Identify  the  entities • Person,  Organization,  CreativeWork,  etc. • Match  strings  to  things • People/Organization  names  –  viaf.org,  etc
  55. Introducing   Linked  Data Phase  1 • First  mine  the

     data • Records  held  in  Marc • Identify  the  entities • Person,  Organization,  CreativeWork,  etc. • Match  strings  to  things • People/Organization  names  –  viaf.org,  etc • Subjects  –  Library  of  Congress
  56. Introducing   Linked  Data Phase  1 • First  mine  the

     data • Records  held  in  Marc • Identify  the  entities • Person,  Organization,  CreativeWork,  etc. • Match  strings  to  things • People/Organization  names  –  viaf.org,  etc • Subjects  –  Library  of  Congress
  57. Introducing   Linked  Data Phase  1 • First  mine  the

     data • Records  held  in  Marc • Identify  the  entities • Person,  Organization,  CreativeWork,  etc. • Match  strings  to  things • People/Organization  names  –  viaf.org,  etc • Subjects  –  Library  of  Congress Phase  2
  58. Introducing   Linked  Data Phase  1 • First  mine  the

     data • Records  held  in  Marc • Identify  the  entities • Person,  Organization,  CreativeWork,  etc. • Match  strings  to  things • People/Organization  names  –  viaf.org,  etc • Subjects  –  Library  of  Congress Phase  2 • Model  what  is  of  interest  to  the  Web
  59. Introducing   Linked  Data Phase  1 • First  mine  the

     data • Records  held  in  Marc • Identify  the  entities • Person,  Organization,  CreativeWork,  etc. • Match  strings  to  things • People/Organization  names  –  viaf.org,  etc • Subjects  –  Library  of  Congress Phase  2 • Model  what  is  of  interest  to  the  Web • All  our  data  is  important  to  us
  60. Introducing   Linked  Data Phase  1 • First  mine  the

     data • Records  held  in  Marc • Identify  the  entities • Person,  Organization,  CreativeWork,  etc. • Match  strings  to  things • People/Organization  names  –  viaf.org,  etc • Subjects  –  Library  of  Congress Phase  2 • Model  what  is  of  interest  to  the  Web • All  our  data  is  important  to  us • What  will  draw  people  to  our  resources?
  61. Introducing   Linked  Data Phase  1 • First  mine  the

     data • Records  held  in  Marc • Identify  the  entities • Person,  Organization,  CreativeWork,  etc. • Match  strings  to  things • People/Organization  names  –  viaf.org,  etc • Subjects  –  Library  of  Congress Phase  2 • Model  what  is  of  interest  to  the  Web • All  our  data  is  important  to  us • What  will  draw  people  to  our  resources? • Share  the  way  the  web  does
  62. Introducing   Linked  Data Phase  1 • First  mine  the

     data • Records  held  in  Marc • Identify  the  entities • Person,  Organization,  CreativeWork,  etc. • Match  strings  to  things • People/Organization  names  –  viaf.org,  etc • Subjects  –  Library  of  Congress Phase  2 • Model  what  is  of  interest  to  the  Web • All  our  data  is  important  to  us • What  will  draw  people  to  our  resources? • Share  the  way  the  web  does • Linked  Data
  63. Introducing   Linked  Data Phase  1 • First  mine  the

     data • Records  held  in  Marc • Identify  the  entities • Person,  Organization,  CreativeWork,  etc. • Match  strings  to  things • People/Organization  names  –  viaf.org,  etc • Subjects  –  Library  of  Congress Phase  2 • Model  what  is  of  interest  to  the  Web • All  our  data  is  important  to  us • What  will  draw  people  to  our  resources? • Share  the  way  the  web  does • Linked  Data • Schema.org
  64. Introducing   Linked  Data Phase  1 • First  mine  the

     data • Records  held  in  Marc • Identify  the  entities • Person,  Organization,  CreativeWork,  etc. • Match  strings  to  things • People/Organization  names  –  viaf.org,  etc • Subjects  –  Library  of  Congress Phase  2 • Model  what  is  of  interest  to  the  Web • All  our  data  is  important  to  us • What  will  draw  people  to  our  resources? • Share  the  way  the  web  does • Linked  Data • Schema.org Phase  3
  65. Introducing   Linked  Data Phase  1 • First  mine  the

     data • Records  held  in  Marc • Identify  the  entities • Person,  Organization,  CreativeWork,  etc. • Match  strings  to  things • People/Organization  names  –  viaf.org,  etc • Subjects  –  Library  of  Congress Phase  2 • Model  what  is  of  interest  to  the  Web • All  our  data  is  important  to  us • What  will  draw  people  to  our  resources? • Share  the  way  the  web  does • Linked  Data • Schema.org Phase  3 -­‐      Try  it  out!
  66. None
  67. None
  68. edition author location holding date  of  publication classification publisher title

    source ISBN library  data:
 stored  as  records
  69. edition author location holding date  of  publication classification publisher title

    source ISBN author location holding classification publisher person place object concept organization work library  data:
 stored  as  records title
  70. person place object concept organization work

  71. person place object concept organization work

  72. None
  73. Google  Knowledge  Graph

  74. Knowledge  cards  for  libraries? Google  Knowledge  Graph

  75. person place object concept organization work Commercial  data  stored  as

     entities
  76. person place object concept organization work Commercial  data  stored  as

     entities
  77. person place object concept organization work Commercial  data  stored  as

     entities
  78. person place object concept organization work Commercial  data  stored  as

     entities FRBR: Work/Expression FRBR: Manifestation
  79. • Knowledge  cards   • Fixes  problem  of  “representative  record”

      • It’s  what  users  expect  in  discovery Entities  and  library  workflows:
 Discovery
  80. • Improve  data  quality   – Cascading  updates   •

    A  new  approach  to  cataloging   – Point  and  click  cataloging   – Managing  entities  instead  of  managing  records   • Consistent  with  RDA Entities  and  library  workflows:
 Cataloging
  81. Günter  Grass Born:  16  October  1927
 Gdańsk,  Poland   German

     novelist,  poet,   playwright,  illustrator,   graphic  artist,  sculptor  and   recipient  of  the  1999  Nobel   Prize  in  Literature.   Works Subjects Quotes Find  Günter  Grass  works  at:
 Libraries  near  me  |  Online  Retailers Germany  |  German  literature  |  Historical  fiction
 War  stories  |  Black  humor  |  Fantasy “Even  bad  books  are  books  and  therefore  sacred.”—The  Tin   Drum
  82. Günter  Grass Born:  16  October  1927
 Gdańsk,  Poland   German

     novelist,  poet,   playwright,  illustrator,   graphic  artist,  sculptor  and   recipient  of  the  1999  Nobel   Prize  in  Literature.   Works Subjects Quotes Find  Günter  Grass  works  at:
 Libraries  near  me  |  Online  Retailers Germany  |  German  literature  |  Historical  fiction
 War  stories  |  Black  humor  |  Fantasy “Even  bad  books  are  books  and  therefore  sacred.”—The  Tin   Drum MARC  RECORD
  83. Entities  and  library  workflows:
 Cataloging The  Tin  Drum Summary:  Acclaimed

     as  the  greatest   German  novel  since  the  end  of  World  War  II.     The  Tin  Drum  is  the  story  of  thirty  year  old   Oskar  Matzerath  who  has  lived  through  the   long  Nazi  nightmare  and  who  is  being  held  in   a  mental  institution. Subjects Borrowing  Options   Ebooks  |  Printed  Books  |  Audio  Books   Other  Languages   ! Germany  -­‐  History  |  German  literature  |  Political  fiction
  84. Entities  and  library  workflows:
 Cataloging The  Tin  Drum Summary:  Acclaimed

     as  the  greatest   German  novel  since  the  end  of  World  War  II.     The  Tin  Drum  is  the  story  of  thirty  year  old   Oskar  Matzerath  who  has  lived  through  the   long  Nazi  nightmare  and  who  is  being  held  in   a  mental  institution. Subjects Borrowing  Options   Ebooks  |  Printed  Books  |  Audio  Books   Other  Languages   ! Germany  -­‐  History  |  German  literature  |  Political  fiction Person  Editor  
  85. Entities  and  library  workflows:
 Cataloging The  Tin  Drum Summary:  Acclaimed

     as  the  greatest   German  novel  since  the  end  of  World  War  II.     The  Tin  Drum  is  the  story  of  thirty  year  old   Oskar  Matzerath  who  has  lived  through  the   long  Nazi  nightmare  and  who  is  being  held  in   a  mental  institution. Subjects Borrowing  Options   Ebooks  |  Printed  Books  |  Audio  Books   Other  Languages   ! Germany  -­‐  History  |  German  literature  |  Political  fiction Person  Editor   Person  Authority   • Günter  Grass   • SameAs  ➾   <dbpedia.org>
  86. Entities  and  library  workflows:
 Cataloging The  Tin  Drum Summary:  Acclaimed

     as  the  greatest   German  novel  since  the  end  of  World  War  II.     The  Tin  Drum  is  the  story  of  thirty  year  old   Oskar  Matzerath  who  has  lived  through  the   long  Nazi  nightmare  and  who  is  being  held  in   a  mental  institution. Subjects Borrowing  Options   Ebooks  |  Printed  Books  |  Audio  Books   Other  Languages   ! Germany  -­‐  History  |  German  literature  |  Political  fiction Person  Editor   Person  Authority   • Günter  Grass   • SameAs  ➾   <dbpedia.org> LC  NAF
  87. Entities  and  library  workflows:
 Cataloging The  Tin  Drum Summary:  Acclaimed

     as  the  greatest   German  novel  since  the  end  of  World  War  II.     The  Tin  Drum  is  the  story  of  thirty  year  old   Oskar  Matzerath  who  has  lived  through  the   long  Nazi  nightmare  and  who  is  being  held  in   a  mental  institution. Subjects Borrowing  Options   Ebooks  |  Printed  Books  |  Audio  Books   Other  Languages   ! Germany  -­‐  History  |  German  literature  |  Political  fiction
  88. Entities  and  library  workflows:
 Cataloging The  Tin  Drum Summary:  Acclaimed

     as  the  greatest   German  novel  since  the  end  of  World  War  II.     The  Tin  Drum  is  the  story  of  thirty  year  old   Oskar  Matzerath  who  has  lived  through  the   long  Nazi  nightmare  and  who  is  being  held  in   a  mental  institution. Subjects Borrowing  Options   Ebooks  |  Printed  Books  |  Audio  Books   Other  Languages   ! Germany  -­‐  History  |  German  literature  |  Political  fiction Work  Editor  
  89. Entities  and  library  workflows:
 Cataloging The  Tin  Drum Summary:  Acclaimed

     as  the  greatest   German  novel  since  the  end  of  World  War  II.     The  Tin  Drum  is  the  story  of  thirty  year  old   Oskar  Matzerath  who  has  lived  through  the   long  Nazi  nightmare  and  who  is  being  held  in   a  mental  institution. Subjects Borrowing  Options   Ebooks  |  Printed  Books  |  Audio  Books   Other  Languages   ! Germany  -­‐  History  |  German  literature  |  Political  fiction Work  Editor   Work  Authority   • Title   • Creator  ➾  <Person>
  90. Expression Manifestation  1   Manifestation  2 Manifestation  3 Entities  and

     library  workflows:
 Cataloging The  Tin  Drum Summary:  Acclaimed  as  the  greatest   German  novel  since  the  end  of  World  War  II.     The  Tin  Drum  is  the  story  of  thirty  year  old   Oskar  Matzerath  who  has  lived  through  the   long  Nazi  nightmare  and  who  is  being  held  in   a  mental  institution. Subjects Borrowing  Options   Ebooks  |  Printed  Books  |  Audio  Books   Other  Languages   ! Germany  -­‐  History  |  German  literature  |  Political  fiction Work  Editor   Cascading   Updates Work  Authority   • Title   • Creator  ➾  <Person>
  91. Expression Manifestation  1   Manifestation  2 Manifestation  3 Entities  and

     library  workflows:
 Cataloging The  Tin  Drum Summary:  Acclaimed  as  the  greatest   German  novel  since  the  end  of  World  War  II.     The  Tin  Drum  is  the  story  of  thirty  year  old   Oskar  Matzerath  who  has  lived  through  the   long  Nazi  nightmare  and  who  is  being  held  in   a  mental  institution. Subjects Borrowing  Options   Ebooks  |  Printed  Books  |  Audio  Books   Other  Languages   ! Germany  -­‐  History  |  German  literature  |  Political  fiction Work  Editor   MARC21   Output Cascading   Updates Work  Authority   • Title   • Creator  ➾  <Person>
  92. Entities  and  library  workflows:
 Other  applications

  93. • Interlibrary  Loan   – Borrow  at  the  Work  level

      – Manifestations/Items  are  detail Entities  and  library  workflows:
 Other  applications
  94. • Interlibrary  Loan   – Borrow  at  the  Work  level

      – Manifestations/Items  are  detail • Analytics   – Fixes  “holdings  scatter”  across  manifestations Entities  and  library  workflows:
 Other  applications
  95. • Interlibrary  Loan   – Borrow  at  the  Work  level

      – Manifestations/Items  are  detail • Analytics   – Fixes  “holdings  scatter”  across  manifestations • Other  third  party  applications     – Discovery  API  exposes  library  entities Entities  and  library  workflows:
 Other  applications
  96. Entities  and  library  workflows:
 Web  exposure

  97. • Be  found  on  the  web Entities  and  library  workflows:


    Web  exposure
  98. • Be  found  on  the  web • Connect  your  users

     to  unique  content Entities  and  library  workflows:
 Web  exposure
  99. • Be  found  on  the  web • Connect  your  users

     to  unique  content • What  the  web  requires  for  web  exposure – Aggregation – Familiar  structures – A  Network  of  Links – Entity  Identifiers Entities  and  library  workflows:
 Web  exposure
  100. WorldCat  Entities

  101. WorldCat  Entities Works

  102. WorldCat  Entities Works • 197+  million  Work  descriptions  and  URIs

      • Schema.org   • RDF  Data  formats  –  RDF/XML,  Turtle,  Triples,  JSON-­‐LD   • Links  to  WorldCat  manifestations   • Links  to  Dewey,  LCSH,  LCNAF,  VIAF,  FAST   • Open  Data  license   • Released  April  2014
  103. None
  104. None
  105. None
  106. None
  107. None
  108. Work Place Concept Event Organization Person Cataloging Integration  with  the

     web Cascading  updates More  options Intuitive  searching Bibliographic  Entities
  109. Bibliographic  Entities -­‐  In  the  Web  of  Data

  110. person place object concept organization work Bibliographic  Entities -­‐  In

     the  Web  of  Data
  111. person place object concept organization work Entity  Based  Data  Architecture…

    Bibliographic  Entities -­‐  In  the  Web  of  Data
  112. What  About  Linked  Data? https://www.flickr.com/photos/rileyroxx/169900848/

  113. What  About  Linked  Data? What  about  Linked   Data? Yeah!

     –  what  about   Linked  Data? I  thought  Linked  Data   was  going  to  solve  all   our  problems! https://www.flickr.com/photos/rileyroxx/169900848/
  114. Linked  Data

  115. • A  Technology • Standard  on  the  Web  –  RDF,

     URIs,  Vocabularies • Identifying  and  Linking  resources  on  the  Web • Important  powerful  enabling  technology Linked  Data
  116. • A  Technology • Standard  on  the  Web  –  RDF,

     URIs,  Vocabularies • Identifying  and  Linking  resources  on  the  Web • Important  powerful  enabling  technology • But  only  a  technology… Linked  Data
  117. • A  Technology • Standard  on  the  Web  –  RDF,

     URIs,  Vocabularies • Identifying  and  Linking  resources  on  the  Web • Important  powerful  enabling  technology • But  only  a  technology…      for  the  systems  folks  to  worry  about Linked  Data
  118. • A  Technology • Standard  on  the  Web  –  RDF,

     URIs,  Vocabularies • Identifying  and  Linking  resources  on  the  Web • Important  powerful  enabling  technology • But  only  a  technology…      for  the  systems  folks  to  worry  about • Real  benefits  flow  from: Entity  Based  Data  Architecture   Linked  Data
  119. • A  Technology • Standard  on  the  Web  –  RDF,

     URIs,  Vocabularies • Identifying  and  Linking  resources  on  the  Web • Important  powerful  enabling  technology • But  only  a  technology…      for  the  systems  folks  to  worry  about • Real  benefits  flow  from: Entity  Based  Data  Architecture   Powered  by  Linked  Data Linked  Data
  120. Entity  Based  Data  

  121. Entity  Based  Data  on  the  Web  

  122. Entity  Based  Data  on  the  Web  

  123. Entity  Based  Data  on  the  Web  

  124. Entity  Based  Data  on  the  Web  

  125. Entity  Based  Data  on  the  Web   Knowledge  Graphs

  126. Entity  Based  Data  on  the  Web   Knowledge  Graphs

  127. Entity  Based  Data  on  the  Web   Knowledge  Graphs

  128. Entity  Based  Data  on  the  Web   Knowledge  Graphs

  129. Entity  Based  Data  on  the  Web   Knowledge  Graphs Semantic

     Search
  130. Why  is  the  Web  Adopting  This?

  131. Why  is  the  Web  Adopting  This? (Entities,  Semantic  Search,  Linked

     Data)    
  132. • To  get  their  products/resources  in  front  of  users -

    Next  Generation  SEO Why  is  the  Web  Adopting  This? (Entities,  Semantic  Search,  Linked  Data)    
  133. • To  get  their  products/resources  in  front  of  users -

    Next  Generation  SEO • It  is  a  shared  approach  from  the  Search  Engines - But  not  exclusive  to  them Why  is  the  Web  Adopting  This? (Entities,  Semantic  Search,  Linked  Data)    
  134. None
  135. Syndication  For  Libraries

  136. Syndication  For  Libraries • Aggregate  to  a  central  site  

    - National  Library,  Consortia,  WorldCat.org
  137. Syndication  For  Libraries • Aggregate  to  a  central  site  

    - National  Library,  Consortia,  WorldCat.org • Publish  details  to  syndication  partners - WorldCat:  Amazon,  Google  Scholar,  EasyBib,   EBSCO,  OpenLibrary,  FindMyLibrary,  RedLaser,   Yelp,  …
  138. Syndication  For  Libraries • Aggregate  to  a  central  site  

    - National  Library,  Consortia,  WorldCat.org • Publish  details  to  syndication  partners - WorldCat:  Amazon,  Google  Scholar,  EasyBib,   EBSCO,  OpenLibrary,  FindMyLibrary,  RedLaser,   Yelp,  … • Links  from  aggregators  to  individual  libraries - Find  in  a  library  
  139. Syndication  For  Libraries • Aggregate  to  a  central  site  

    - National  Library,  Consortia,  WorldCat.org • Publish  details  to  syndication  partners - WorldCat:  Amazon,  Google  Scholar,  EasyBib,   EBSCO,  OpenLibrary,  FindMyLibrary,  RedLaser,   Yelp,  … • Links  from  aggregators  to  individual  libraries - Find  in  a  library                                                                          WorldCat   Syndication   Year  to  June  2013     ! • 77  Million  referrals  from  partners   • 8.7  Million  click-­‐through  to  libraries  
  140. Syndication  For  Libraries • Aggregate  to  a  central  site  

    - National  Library,  Consortia,  WorldCat.org • Publish  details  to  syndication  partners - WorldCat:  Amazon,  Google  Scholar,  EasyBib,   EBSCO,  OpenLibrary,  FindMyLibrary,  RedLaser,   Yelp,  … • Links  from  aggregators  to  individual  libraries - Find  in  a  library   Today
  141. Syndication  For  Libraries • Aggregate  to  a  central  site  

    - National  Library,  Consortia,  WorldCat.org • Publish  details  to  syndication  partners - WorldCat:  Amazon,  Google  Scholar,  EasyBib,   EBSCO,  OpenLibrary,  FindMyLibrary,  RedLaser,   Yelp,  … • Links  from  aggregators  to  individual  libraries - Find  in  a  library   Today Efficient  but  indirect
  142. Syndication  For  Libraries

  143. Syndication  For  Libraries On  the  Web  of  Data

  144. Syndication  For  Libraries • Individual  libraries  publish  resource  data On

     the  Web  of  Data
  145. Syndication  For  Libraries • Individual  libraries  publish  resource  data -

    Linked  Data  in  local  discovery  interfaces On  the  Web  of  Data
  146. Syndication  For  Libraries • Individual  libraries  publish  resource  data -

    Linked  Data  in  local  discovery  interfaces - Links  to  authoritative  hubs  –  set  global  context • VIAF,  LoC,  WorldCat  Works,  … On  the  Web  of  Data
  147. Syndication  For  Libraries • Individual  libraries  publish  resource  data -

    Linked  Data  in  local  discovery  interfaces - Links  to  authoritative  hubs  –  set  global  context • VIAF,  LoC,  WorldCat  Works,  … • Recognized  and  identified  on  the  Web On  the  Web  of  Data
  148. Syndication  For  Libraries • Individual  libraries  publish  resource  data -

    Linked  Data  in  local  discovery  interfaces - Links  to  authoritative  hubs  –  set  global  context • VIAF,  LoC,  WorldCat  Works,  … • Recognized  and  identified  on  the  Web - Google,  Bing,  Yahoo!,  Yandex,  etc. - Where  our  users  are! On  the  Web  of  Data
  149. Syndication  For  Libraries • Individual  libraries  publish  resource  data -

    Linked  Data  in  local  discovery  interfaces - Links  to  authoritative  hubs  –  set  global  context • VIAF,  LoC,  WorldCat  Works,  … • Recognized  and  identified  on  the  Web - Google,  Bing,  Yahoo!,  Yandex,  etc. - Where  our  users  are! • Users  referred  directly  to  resources  in  the  library   On  the  Web  of  Data
  150. Syndication  For  Libraries • Individual  libraries  publish  resource  data -

    Linked  Data  in  local  discovery  interfaces - Links  to  authoritative  hubs  –  set  global  context • VIAF,  LoC,  WorldCat  Works,  … • Recognized  and  identified  on  the  Web - Google,  Bing,  Yahoo!,  Yandex,  etc. - Where  our  users  are! • Users  referred  directly  to  resources  in  the  library   On  the  Web  of  Data Direct  and  Effective
  151. Syndication  For  Libraries • Individual  libraries  publish  resource  data -

    Linked  Data  in  local  discovery  interfaces - Links  to  authoritative  hubs  –  set  global  context • VIAF,  LoC,  WorldCat  Works,  … • Recognized  and  identified  on  the  Web - Google,  Bing,  Yahoo!,  Yandex,  etc. - Where  our  users  are! • Users  referred  directly  to  resources  in  the  library   On  the  Web  of  Data Direct  and  Effective
  152. Syndication  For  Libraries • Individual  libraries  publish  resource  data -

    Linked  Data  in  local  discovery  interfaces - Links  to  authoritative  hubs  –  set  global  context • VIAF,  LoC,  WorldCat  Works,  … • Recognized  and  identified  on  the  Web - Google,  Bing,  Yahoo!,  Yandex,  etc. - Where  our  users  are! • Users  referred  directly  to  resources  in  the  library   On  the  Web  of  Data Direct  and  Effective
  153. Tell them about our resources… http://www.flickr.com/photos/boston_public_library/6220572487

  154. Tell them about our resources… …using their language and methods

    http://www.flickr.com/photos/boston_public_library/6220572487
  155. None
  156. Has it been a waste of time?

  157. Has it been a waste of time? Am I saying….

  158. Has it been a waste of time? Throw it all

    away and use… Am I saying….
  159. Has it been a waste of time? Throw it all

    away and use… No!…. Am I saying….
  160. Don't throw the baby out with the bath water

  161. Don't throw the baby out with the bath water Sharing

     for  discovery  on  the  web   As  part  of  a  Global  Knowledge  Graph
  162. Don't throw the baby out with the bath water Sharing

     for  discovery  on  the  web   As  part  of  a  Global  Knowledge  Graph Entification
  163. Don't throw the baby out with the bath water Sharing

     for  discovery  on  the  web   As  part  of  a  Global  Knowledge  Graph Entification Identify  the  entities  in  your  data:
  164. Don't throw the baby out with the bath water Sharing

     for  discovery  on  the  web   As  part  of  a  Global  Knowledge  Graph Entification Identify  the  entities  in  your  data: • Describe  them  using  appropriate  vocabularies
  165. Don't throw the baby out with the bath water Sharing

     for  discovery  on  the  web   As  part  of  a  Global  Knowledge  Graph Entification Identify  the  entities  in  your  data: • Describe  them  using  appropriate  vocabularies • Describe  the  relationships  between  them
  166. Don't throw the baby out with the bath water Sharing

     for  discovery  on  the  web   As  part  of  a  Global  Knowledge  Graph Entification Identify  the  entities  in  your  data: • Describe  them  using  appropriate  vocabularies • Describe  the  relationships  between  them • Place  them  in  a  global  context  –  link  to  authoritative  hubs
  167. Don't throw the baby out with the bath water Sharing

     for  discovery  on  the  web   As  part  of  a  Global  Knowledge  Graph Entification Identify  the  entities  in  your  data: • Describe  them  using  appropriate  vocabularies • Describe  the  relationships  between  them • Place  them  in  a  global  context  –  link  to  authoritative  hubs • Liberate  the  value  in  your  data!
  168. Don't throw the baby out with the bath water Sharing

     for  discovery  on  the  web   As  part  of  a  Global  Knowledge  Graph Entification Identify  the  entities  in  your  data: • Describe  them  using  appropriate  vocabularies • Describe  the  relationships  between  them • Place  them  in  a  global  context  –  link  to  authoritative  hubs • Liberate  the  value  in  your  data! ENTITIES
  169. Don't throw the baby out with the bath water Sharing

     for  discovery  on  the  web   As  part  of  a  Global  Knowledge  Graph Entification Identify  the  entities  in  your  data: • Describe  them  using  appropriate  vocabularies • Describe  the  relationships  between  them • Place  them  in  a  global  context  –  link  to  authoritative  hubs • Liberate  the  value  in  your  data! And  also  share  them  on  the  web  –  a  job  for  Schema.org ENTITIES
  170. None
  171. Why  Catalog?

  172. Why  Catalog? So  we  can  find  things

  173. Why  Catalog? So  we  can  find  things Why  Share  on

     the  Web?
  174. Why  Catalog? So  we  can  find  things Why  Share  on

     the  Web? So  today’s  users   can  find  our  things
  175. OCLC  Entity  Based  Data  Strategy

  176. OCLC  Entity  Based  Data  Strategy ✓VIAF,  ISNI,  FAST  Publish  Linked

     Data 2010
  177. OCLC  Entity  Based  Data  Strategy ✓VIAF,  ISNI,  FAST  Publish  Linked

     Data ✓WorldCat.org  Linked  Data  Release  –  using  Schema.org 2012   2010
  178. OCLC  Entity  Based  Data  Strategy ✓VIAF,  ISNI,  FAST  Publish  Linked

     Data ✓WorldCat.org  Linked  Data  Release  –  using  Schema.org ✓Data  mining  of  WorldCat  resources 2012   2013 2010
  179. OCLC  Entity  Based  Data  Strategy ✓VIAF,  ISNI,  FAST  Publish  Linked

     Data ✓WorldCat.org  Linked  Data  Release  –  using  Schema.org ✓Data  mining  of  WorldCat  resources ✓WorldCat  Works  Released  –  using  Schema.org 2012   2014 2013 2010
  180. OCLC  Entity  Based  Data  Strategy ✓VIAF,  ISNI,  FAST  Publish  Linked

     Data ✓WorldCat.org  Linked  Data  Release  –  using  Schema.org ✓Data  mining  of  WorldCat  resources ✓WorldCat  Works  Released  –  using  Schema.org ✓Schema.org  added  to  VIAF  RDF 2012   2014 2013 2010
  181. OCLC  Entity  Based  Data  Strategy ✓VIAF,  ISNI,  FAST  Publish  Linked

     Data ✓WorldCat.org  Linked  Data  Release  –  using  Schema.org ✓Data  mining  of  WorldCat  resources ✓WorldCat  Works  Released  –  using  Schema.org ✓Schema.org  added  to  VIAF  RDF 2012   2014 2013 2010
  182. OCLC  Entity  Based  Data  Strategy ✓VIAF,  ISNI,  FAST  Publish  Linked

     Data ✓WorldCat.org  Linked  Data  Release  –  using  Schema.org ✓Data  mining  of  WorldCat  resources ✓WorldCat  Works  Released  –  using  Schema.org ✓Schema.org  added  to  VIAF  RDF 2012   2014 ➢Application  Integration ➢WorldCat  Discovery ➢Analytics ➢Discovery  API ➢Cataloging 2015 2013 2010
  183. OCLC  Entity  Based  Data  Strategy ✓VIAF,  ISNI,  FAST  Publish  Linked

     Data ✓WorldCat.org  Linked  Data  Release  –  using  Schema.org ✓Data  mining  of  WorldCat  resources ✓WorldCat  Works  Released  –  using  Schema.org ✓Schema.org  added  to  VIAF  RDF 2012   2014 ➢Application  Integration ➢WorldCat  Discovery ➢Analytics ➢Discovery  API ➢Cataloging 2015 ➢More  Entities  Released ➢Person ➢Organization ➢Event ➢Concept 2013 2010
  184. OCLC  Entity  Based  Data  Strategy ✓VIAF,  ISNI,  FAST  Publish  Linked

     Data ✓WorldCat.org  Linked  Data  Release  –  using  Schema.org ✓Data  mining  of  WorldCat  resources ✓WorldCat  Works  Released  –  using  Schema.org ✓Schema.org  added  to  VIAF  RDF 2012   2014 ➢Application  Integration ➢WorldCat  Discovery ➢Analytics ➢Discovery  API ➢Cataloging 2015 ➢More  Entities  Released ➢Person ➢Organization ➢Event ➢Concept ➢New  Products               ➢New  Services 2013 2016 2010
  185. OCLC  Entity  Based  Data  Strategy ✓VIAF,  ISNI,  FAST  Publish  Linked

     Data ✓WorldCat.org  Linked  Data  Release  –  using  Schema.org ✓Data  mining  of  WorldCat  resources ✓WorldCat  Works  Released  –  using  Schema.org ✓Schema.org  added  to  VIAF  RDF 2012   2014 ➢Application  Integration ➢WorldCat  Discovery ➢Analytics ➢Discovery  API ➢Cataloging 2015 ➢More  Entities  Released ➢Person ➢Organization ➢Event ➢Concept ➢New  Products               ➢Continuing  Evangelism ➢New  Services ➢Continuing  Innovation 2013 2016 2010
  186. OCLC  Entity  Based  Data  Strategy ✓VIAF,  ISNI,  FAST  Publish  Linked

     Data ✓WorldCat.org  Linked  Data  Release  –  using  Schema.org ✓Data  mining  of  WorldCat  resources ✓WorldCat  Works  Released  –  using  Schema.org ✓Schema.org  added  to  VIAF  RDF 2012   2014 ➢Application  Integration ➢WorldCat  Discovery ➢Analytics ➢Discovery  API ➢Cataloging 2015 ➢More  Entities  Released ➢Person ➢Organization ➢Event ➢Concept ➢New  Products               ➢Continuing  Evangelism ➢New  Services ➢Continuing  Innovation 2013 2016 2010
  187. OCLC  Entity  Based  Data  Strategy ✓VIAF,  ISNI,  FAST  Publish  Linked

     Data ✓WorldCat.org  Linked  Data  Release  –  using  Schema.org ✓Data  mining  of  WorldCat  resources ✓WorldCat  Works  Released  –  using  Schema.org ✓Schema.org  added  to  VIAF  RDF 2012   2014 ➢Application  Integration ➢WorldCat  Discovery ➢Analytics ➢Discovery  API ➢Cataloging 2015 ➢More  Entities  Released ➢Person ➢Organization ➢Event ➢Concept ➢New  Products               ➢Continuing  Evangelism ➢New  Services ➢Continuing  Innovation 2013 2016 2010
  188. OCLC  Entity  Based  Data  Strategy ✓VIAF,  ISNI,  FAST  Publish  Linked

     Data ✓WorldCat.org  Linked  Data  Release  –  using  Schema.org ✓Data  mining  of  WorldCat  resources ✓WorldCat  Works  Released  –  using  Schema.org ✓Schema.org  added  to  VIAF  RDF 2012   2014 ➢Application  Integration ➢WorldCat  Discovery ➢Analytics ➢Discovery  API ➢Cataloging 2015 ➢More  Entities  Released ➢Person ➢Organization ➢Event ➢Concept ➢New  Products               ➢Continuing  Evangelism ➢New  Services ➢Continuing  Innovation 2013 2016 2010
  189. OCLC  Entity  Based  Data  Strategy ✓VIAF,  ISNI,  FAST  Publish  Linked

     Data ✓WorldCat.org  Linked  Data  Release  –  using  Schema.org ✓Data  mining  of  WorldCat  resources ✓WorldCat  Works  Released  –  using  Schema.org ✓Schema.org  added  to  VIAF  RDF 2012   2014 ➢Application  Integration ➢WorldCat  Discovery ➢Analytics ➢Discovery  API ➢Cataloging 2015 ➢More  Entities  Released ➢Person ➢Organization ➢Event ➢Concept ➢New  Products               ➢Continuing  Evangelism ➢New  Services ➢Continuing  Innovation 2013 2016 2010
  190. OCLC  Entity  Based  Data  Strategy ✓VIAF,  ISNI,  FAST  Publish  Linked

     Data ✓WorldCat.org  Linked  Data  Release  –  using  Schema.org ✓Data  mining  of  WorldCat  resources ✓WorldCat  Works  Released  –  using  Schema.org ✓Schema.org  added  to  VIAF  RDF 2012   2014 ➢Application  Integration ➢WorldCat  Discovery ➢Analytics ➢Discovery  API ➢Cataloging 2015 ➢More  Entities  Released ➢Person ➢Organization ➢Event ➢Concept ➢New  Products               ➢Continuing  Evangelism ➢New  Services ➢Continuing  Innovation 2013 2016 2010
  191. OCLC  Entity  Based  Data  Strategy ✓VIAF,  ISNI,  FAST  Publish  Linked

     Data ✓WorldCat.org  Linked  Data  Release  –  using  Schema.org ✓Data  mining  of  WorldCat  resources ✓WorldCat  Works  Released  –  using  Schema.org ✓Schema.org  added  to  VIAF  RDF 2012   2014 ➢Application  Integration ➢WorldCat  Discovery ➢Analytics ➢Discovery  API ➢Cataloging 2015 ➢More  Entities  Released ➢Person ➢Organization ➢Event ➢Concept ➢New  Products               ➢Continuing  Evangelism ➢New  Services ➢Continuing  Innovation 2013 2016 2010
  192. OCLC  Entity  Based  Data  Strategy ✓VIAF,  ISNI,  FAST  Publish  Linked

     Data ✓WorldCat.org  Linked  Data  Release  –  using  Schema.org ✓Data  mining  of  WorldCat  resources ✓WorldCat  Works  Released  –  using  Schema.org ✓Schema.org  added  to  VIAF  RDF 2012   2014 ➢Application  Integration ➢WorldCat  Discovery ➢Analytics ➢Discovery  API ➢Cataloging 2015 ➢More  Entities  Released ➢Person ➢Organization ➢Event ➢Concept ➢New  Products               ➢Continuing  Evangelism ➢New  Services ➢Continuing  Innovation 2013 2016 2010
  193. Where  are  our  users?

  194. Where  are  our  users?

  195. 6 excellent SWIB's ! Many great LD Projects

  196. but 6 excellent SWIB's ! Many great LD Projects

  197. but 6 excellent SWIB's ! Many great LD Projects If

     users  can't  discover  our  resources
  198. but 6 excellent SWIB's ! Many great LD Projects If

     users  can't  discover  our  resources What  is  the  point?
  199. 6 excellent SWIB's ! Many great LD Projects

  200. 6 excellent SWIB's ! Many great LD Projects 5

  201. 6 excellent SWIB's ! Many great LD Projects 7 5

    more y
  202. 6 excellent SWIB's ! Many great LD Projects 7 5

    more y
  203. Entification:  The  Route   to  Useful  Library  Data Richard  Wallis

      Technology  Evangelist   @rjw
  204. Entification:  The  Route   to  Useful  Library  Data Richard  Wallis

      Technology  Evangelist   @rjw http://slideshare.net/rjw