Upgrade to Pro — share decks privately, control downloads, hide ads and more …

Entification: The Route to 'Useful' Library Data

SWIB14
December 03, 2014

Entification: The Route to 'Useful' Library Data

Presenter: Richard Wallis (OCLC)

Abstract:
Linked Data is all about identifying 'things' then describing them and their relationships in a web of other 'things', or entities. Many library linked data initiatives have focused on directly transforming records into RDF with little linking between the shared concepts captured within those records or to external authoritative representations of the same things. The British Library, with a linked data version of the British National Bibliography, was an early pioneer in attempting to model real world entities as a foundation for their data model. Similar research within OCLC, that led to the release of entities as open linked data from WorldCat.org, such as Works, has demonstrated the benefits of such an approach. It also demonstrates that there is much more than record-by-record format conversion required to successfully achieve a web of real world entities. Significant data mining processes, the open availability of authoritative data hubs (such as VIAF, FAST, Library of Congress), and the use of flexible and widely accepted vocabularies, all play a necessary part in this success. Richard will explore some of the issues and benefits of creating library data as descriptions of real world entities, and share some insights into the processes required and their results.

SWIB14

December 03, 2014
Tweet

More Decks by SWIB14

Other Decks in Technology

Transcript

  1. •322  million  cataloging  records   •2.1  billion  holdings   •17.3

     million  e-­‐resources  in  the  WorldCat  knowledge  base   •Nearly  2,000  e-­‐content  collections   •1.5  billion  items  in  WorldCat  Discovery,  including:   • 297  million  peer-­‐reviewed  articles   • 41  million  digital  items   • 33  million  pieces  of  evaluative  content   • 35  million  archival  materials   • 8  million  open-­‐access  items   • and  much  more… Comprehensive  and  global
  2. Comprehensive  and  global 253  million   books 16  million
 e-­‐books

    12  million
 serials 12  million
 visual  materials 7  million
 musical  scores 4  million
 maps English:   83  million German:   25  million French:   18  million Spanish:   8.2  million Chinese:   5.1  million Italian:   3.4  million Dutch:   3.3  million Japanese:   2.9  million Russian:   2.9  million Danish   2.1  million Swedish:   1.9  million Portuguese:   1.3  million Just  a  few  of  
 the  dozens  of
 types  of  content: Some  of  the  485  languages  represented…  
  3. •322  million  cataloging  records   •2.1  billion  holdings   •17.3

     million  e-­‐resources  in  the  WorldCat  knowledge  base   •Nearly  2,000  e-­‐content  collections   •1.5  billion  items  in  WorldCat  Discovery,  including:   • 297  million  peer-­‐reviewed  articles   • 41  million  digital  items   • 33  million  pieces  of  evaluative  content   • 35  million  archival  materials   • 8  million  open-­‐access  items   • and  much  more… Comprehensive  and  global
  4. ≈  2  Billion     Records 0   01261nam  a22002411

     4500   01   303   005   00000000000000.0   008   990716m19091912enkb  b  000  0  eng   035   __    |9  (DLC)  24002676   906   __    |a  0    |b  ibc    |c  orignew    |d  u    |e  ocip    |f  19    |g  y-­‐gencatlg   955   __    |a  CATALOGER:  This  record,  imported  under  99-­‐219818  duplicated  24-­‐2676   on  PREMARC;  I  have  changed  LCCN  to  that  LCCN,  removed  copy  cataloging  characteristics,  and   deleted  the  PREMARC  record;  please  do  as  NEW  INPUT  and  complete  this  record  based  on  the  item   in  hand;  submit  item  for  selection;  if  retained,  add  as  a  copy.  ta05  07-­‐16-­‐99   __    |a    24002676    |z    99219818   _    |a  DLC    |c  DLC    |a  DA760    |b  .B88  1909   a  Brown,  Peter  Hume,    |d  1849-­‐1918.   ory  of  Scotland,    |c  by  P.  Hume  Brown.   e,    |b  University  Press,    |c  1909-­‐12.   art  fold.)    |c  20  cm.   series   ,  p.  455-­‐464;  v.  3,  p.  [435]-­‐444.   -­‐v.  2.  From  the  accession  of  Mary   o  the  disruption,  1843.  
  5. Structured  Data  Objectives • Linking  with  hubs  of  authority  on

     the  web • viaf.org  –  persons • Library  of  congress  –  subjects
  6. Structured  Data  Objectives • Linking  with  hubs  of  authority  on

     the  web • viaf.org  –  persons • Library  of  congress  –  subjects • Dewey.info  –  classifications
  7. Structured  Data  Objectives • Linking  with  hubs  of  authority  on

     the  web • viaf.org  –  persons • Library  of  congress  –  subjects • Dewey.info  –  classifications • Dbpedia  –  most  things
  8. Structured  Data  Objectives • Linking  with  hubs  of  authority  on

     the  web • viaf.org  –  persons • Library  of  congress  –  subjects • Dewey.info  –  classifications • Dbpedia  –  most  things • Widely  distributed  &  understood
  9. Structured  Data  Objectives • Linking  with  hubs  of  authority  on

     the  web • viaf.org  –  persons • Library  of  congress  –  subjects • Dewey.info  –  classifications • Dbpedia  –  most  things • Widely  distributed  &  understood • Standard  data  access  patterns
  10. Structured  Data  Objectives • Linking  with  hubs  of  authority  on

     the  web • viaf.org  –  persons • Library  of  congress  –  subjects • Dewey.info  –  classifications • Dbpedia  –  most  things • Widely  distributed  &  understood • Standard  data  access  patterns • Common  vocabularies
  11. Structured  Data  Objectives • Linking  with  hubs  of  authority  on

     the  web • viaf.org  –  persons • Library  of  congress  –  subjects • Dewey.info  –  classifications • Dbpedia  –  most  things • Widely  distributed  &  understood • Standard  data  access  patterns • Common  vocabularies • Visibility  in  search  engines
  12. Structured  Data  Objectives • Linking  with  hubs  of  authority  on

     the  web • viaf.org  –  persons • Library  of  congress  –  subjects • Dewey.info  –  classifications • Dbpedia  –  most  things • Widely  distributed  &  understood • Standard  data  access  patterns • Common  vocabularies • Visibility  in  search  engines Conclusions
  13. Structured  Data  Objectives • Linking  with  hubs  of  authority  on

     the  web • viaf.org  –  persons • Library  of  congress  –  subjects • Dewey.info  –  classifications • Dbpedia  –  most  things • Widely  distributed  &  understood • Standard  data  access  patterns • Common  vocabularies • Visibility  in  search  engines Conclusions • Linked  Data
  14. Structured  Data  Objectives • Linking  with  hubs  of  authority  on

     the  web • viaf.org  –  persons • Library  of  congress  –  subjects • Dewey.info  –  classifications • Dbpedia  –  most  things • Widely  distributed  &  understood • Standard  data  access  patterns • Common  vocabularies • Visibility  in  search  engines Conclusions • Linked  Data • RDF  –  RDFa,  RDF/XML,  JSON-­‐LD,  Turtle,  nTriples
  15. Structured  Data  Objectives • Linking  with  hubs  of  authority  on

     the  web • viaf.org  –  persons • Library  of  congress  –  subjects • Dewey.info  –  classifications • Dbpedia  –  most  things • Widely  distributed  &  understood • Standard  data  access  patterns • Common  vocabularies • Visibility  in  search  engines Conclusions • Linked  Data • RDF  –  RDFa,  RDF/XML,  JSON-­‐LD,  Turtle,  nTriples • Canonical  URIs
  16. Structured  Data  Objectives • Linking  with  hubs  of  authority  on

     the  web • viaf.org  –  persons • Library  of  congress  –  subjects • Dewey.info  –  classifications • Dbpedia  –  most  things • Widely  distributed  &  understood • Standard  data  access  patterns • Common  vocabularies • Visibility  in  search  engines Conclusions • Linked  Data • RDF  –  RDFa,  RDF/XML,  JSON-­‐LD,  Turtle,  nTriples • Canonical  URIs • Schema.org
  17. Structured  Data  Objectives • Linking  with  hubs  of  authority  on

     the  web • viaf.org  –  persons • Library  of  congress  –  subjects • Dewey.info  –  classifications • Dbpedia  –  most  things • Widely  distributed  &  understood • Standard  data  access  patterns • Common  vocabularies • Visibility  in  search  engines Conclusions • Linked  Data • RDF  –  RDFa,  RDF/XML,  JSON-­‐LD,  Turtle,  nTriples • Canonical  URIs • Schema.org • Backed  and  recognized  by  Google,  Bing,  Yahoo!,  Yandex
  18. Structured  Data  Objectives • Linking  with  hubs  of  authority  on

     the  web • viaf.org  –  persons • Library  of  congress  –  subjects • Dewey.info  –  classifications • Dbpedia  –  most  things • Widely  distributed  &  understood • Standard  data  access  patterns • Common  vocabularies • Visibility  in  search  engines Conclusions • Linked  Data • RDF  –  RDFa,  RDF/XML,  JSON-­‐LD,  Turtle,  nTriples • Canonical  URIs • Schema.org • Backed  and  recognized  by  Google,  Bing,  Yahoo!,  Yandex • Widely  adopted  &  understood  –  20%  of  web
  19. Structured  Data  Objectives • Linking  with  hubs  of  authority  on

     the  web • viaf.org  –  persons • Library  of  congress  –  subjects • Dewey.info  –  classifications • Dbpedia  –  most  things • Widely  distributed  &  understood • Standard  data  access  patterns • Common  vocabularies • Visibility  in  search  engines Conclusions • Linked  Data • RDF  –  RDFa,  RDF/XML,  JSON-­‐LD,  Turtle,  nTriples • Canonical  URIs • Schema.org • Backed  and  recognized  by  Google,  Bing,  Yahoo!,  Yandex • Widely  adopted  &  understood  –  20%  of  web fairly  obvious y
  20. Structured  Data  Objectives • Linking  with  hubs  of  authority  on

     the  web • viaf.org  –  persons • Library  of  congress  –  subjects • Dewey.info  –  classifications • Dbpedia  –  most  things • Widely  distributed  &  understood • Standard  data  access  patterns • Common  vocabularies • Visibility  in  search  engines Conclusions • Linked  Data • RDF  –  RDFa,  RDF/XML,  JSON-­‐LD,  Turtle,  nTriples • Canonical  URIs • Schema.org • Backed  and  recognized  by  Google,  Bing,  Yahoo!,  Yandex • Widely  adopted  &  understood  –  20%  of  web fairly  obvious y
  21. Introducing   Linked  Data Phase  1 • First  mine  the

     data • Records  held  in  Marc • Identify  the  entities
  22. Introducing   Linked  Data Phase  1 • First  mine  the

     data • Records  held  in  Marc • Identify  the  entities • Person,  Organization,  CreativeWork,  etc.
  23. Introducing   Linked  Data Phase  1 • First  mine  the

     data • Records  held  in  Marc • Identify  the  entities • Person,  Organization,  CreativeWork,  etc. • Match  strings  to  things
  24. Introducing   Linked  Data Phase  1 • First  mine  the

     data • Records  held  in  Marc • Identify  the  entities • Person,  Organization,  CreativeWork,  etc. • Match  strings  to  things • People/Organization  names  –  viaf.org,  etc
  25. Introducing   Linked  Data Phase  1 • First  mine  the

     data • Records  held  in  Marc • Identify  the  entities • Person,  Organization,  CreativeWork,  etc. • Match  strings  to  things • People/Organization  names  –  viaf.org,  etc • Subjects  –  Library  of  Congress
  26. Introducing   Linked  Data Phase  1 • First  mine  the

     data • Records  held  in  Marc • Identify  the  entities • Person,  Organization,  CreativeWork,  etc. • Match  strings  to  things • People/Organization  names  –  viaf.org,  etc • Subjects  –  Library  of  Congress
  27. Introducing   Linked  Data Phase  1 • First  mine  the

     data • Records  held  in  Marc • Identify  the  entities • Person,  Organization,  CreativeWork,  etc. • Match  strings  to  things • People/Organization  names  –  viaf.org,  etc • Subjects  –  Library  of  Congress Phase  2
  28. Introducing   Linked  Data Phase  1 • First  mine  the

     data • Records  held  in  Marc • Identify  the  entities • Person,  Organization,  CreativeWork,  etc. • Match  strings  to  things • People/Organization  names  –  viaf.org,  etc • Subjects  –  Library  of  Congress Phase  2 • Model  what  is  of  interest  to  the  Web
  29. Introducing   Linked  Data Phase  1 • First  mine  the

     data • Records  held  in  Marc • Identify  the  entities • Person,  Organization,  CreativeWork,  etc. • Match  strings  to  things • People/Organization  names  –  viaf.org,  etc • Subjects  –  Library  of  Congress Phase  2 • Model  what  is  of  interest  to  the  Web • All  our  data  is  important  to  us
  30. Introducing   Linked  Data Phase  1 • First  mine  the

     data • Records  held  in  Marc • Identify  the  entities • Person,  Organization,  CreativeWork,  etc. • Match  strings  to  things • People/Organization  names  –  viaf.org,  etc • Subjects  –  Library  of  Congress Phase  2 • Model  what  is  of  interest  to  the  Web • All  our  data  is  important  to  us • What  will  draw  people  to  our  resources?
  31. Introducing   Linked  Data Phase  1 • First  mine  the

     data • Records  held  in  Marc • Identify  the  entities • Person,  Organization,  CreativeWork,  etc. • Match  strings  to  things • People/Organization  names  –  viaf.org,  etc • Subjects  –  Library  of  Congress Phase  2 • Model  what  is  of  interest  to  the  Web • All  our  data  is  important  to  us • What  will  draw  people  to  our  resources? • Share  the  way  the  web  does
  32. Introducing   Linked  Data Phase  1 • First  mine  the

     data • Records  held  in  Marc • Identify  the  entities • Person,  Organization,  CreativeWork,  etc. • Match  strings  to  things • People/Organization  names  –  viaf.org,  etc • Subjects  –  Library  of  Congress Phase  2 • Model  what  is  of  interest  to  the  Web • All  our  data  is  important  to  us • What  will  draw  people  to  our  resources? • Share  the  way  the  web  does • Linked  Data
  33. Introducing   Linked  Data Phase  1 • First  mine  the

     data • Records  held  in  Marc • Identify  the  entities • Person,  Organization,  CreativeWork,  etc. • Match  strings  to  things • People/Organization  names  –  viaf.org,  etc • Subjects  –  Library  of  Congress Phase  2 • Model  what  is  of  interest  to  the  Web • All  our  data  is  important  to  us • What  will  draw  people  to  our  resources? • Share  the  way  the  web  does • Linked  Data • Schema.org
  34. Introducing   Linked  Data Phase  1 • First  mine  the

     data • Records  held  in  Marc • Identify  the  entities • Person,  Organization,  CreativeWork,  etc. • Match  strings  to  things • People/Organization  names  –  viaf.org,  etc • Subjects  –  Library  of  Congress Phase  2 • Model  what  is  of  interest  to  the  Web • All  our  data  is  important  to  us • What  will  draw  people  to  our  resources? • Share  the  way  the  web  does • Linked  Data • Schema.org Phase  3
  35. Introducing   Linked  Data Phase  1 • First  mine  the

     data • Records  held  in  Marc • Identify  the  entities • Person,  Organization,  CreativeWork,  etc. • Match  strings  to  things • People/Organization  names  –  viaf.org,  etc • Subjects  –  Library  of  Congress Phase  2 • Model  what  is  of  interest  to  the  Web • All  our  data  is  important  to  us • What  will  draw  people  to  our  resources? • Share  the  way  the  web  does • Linked  Data • Schema.org Phase  3 -­‐      Try  it  out!
  36. edition author location holding date  of  publication classification publisher title

    source ISBN author location holding classification publisher person place object concept organization work library  data:
 stored  as  records title
  37. person place object concept organization work Commercial  data  stored  as

     entities FRBR: Work/Expression FRBR: Manifestation
  38. • Knowledge  cards   • Fixes  problem  of  “representative  record”

      • It’s  what  users  expect  in  discovery Entities  and  library  workflows:
 Discovery
  39. • Improve  data  quality   – Cascading  updates   •

    A  new  approach  to  cataloging   – Point  and  click  cataloging   – Managing  entities  instead  of  managing  records   • Consistent  with  RDA Entities  and  library  workflows:
 Cataloging
  40. Günter  Grass Born:  16  October  1927
 Gdańsk,  Poland   German

     novelist,  poet,   playwright,  illustrator,   graphic  artist,  sculptor  and   recipient  of  the  1999  Nobel   Prize  in  Literature.   Works Subjects Quotes Find  Günter  Grass  works  at:
 Libraries  near  me  |  Online  Retailers Germany  |  German  literature  |  Historical  fiction
 War  stories  |  Black  humor  |  Fantasy “Even  bad  books  are  books  and  therefore  sacred.”—The  Tin   Drum
  41. Günter  Grass Born:  16  October  1927
 Gdańsk,  Poland   German

     novelist,  poet,   playwright,  illustrator,   graphic  artist,  sculptor  and   recipient  of  the  1999  Nobel   Prize  in  Literature.   Works Subjects Quotes Find  Günter  Grass  works  at:
 Libraries  near  me  |  Online  Retailers Germany  |  German  literature  |  Historical  fiction
 War  stories  |  Black  humor  |  Fantasy “Even  bad  books  are  books  and  therefore  sacred.”—The  Tin   Drum MARC  RECORD
  42. Entities  and  library  workflows:
 Cataloging The  Tin  Drum Summary:  Acclaimed

     as  the  greatest   German  novel  since  the  end  of  World  War  II.     The  Tin  Drum  is  the  story  of  thirty  year  old   Oskar  Matzerath  who  has  lived  through  the   long  Nazi  nightmare  and  who  is  being  held  in   a  mental  institution. Subjects Borrowing  Options   Ebooks  |  Printed  Books  |  Audio  Books   Other  Languages   ! Germany  -­‐  History  |  German  literature  |  Political  fiction
  43. Entities  and  library  workflows:
 Cataloging The  Tin  Drum Summary:  Acclaimed

     as  the  greatest   German  novel  since  the  end  of  World  War  II.     The  Tin  Drum  is  the  story  of  thirty  year  old   Oskar  Matzerath  who  has  lived  through  the   long  Nazi  nightmare  and  who  is  being  held  in   a  mental  institution. Subjects Borrowing  Options   Ebooks  |  Printed  Books  |  Audio  Books   Other  Languages   ! Germany  -­‐  History  |  German  literature  |  Political  fiction Person  Editor  
  44. Entities  and  library  workflows:
 Cataloging The  Tin  Drum Summary:  Acclaimed

     as  the  greatest   German  novel  since  the  end  of  World  War  II.     The  Tin  Drum  is  the  story  of  thirty  year  old   Oskar  Matzerath  who  has  lived  through  the   long  Nazi  nightmare  and  who  is  being  held  in   a  mental  institution. Subjects Borrowing  Options   Ebooks  |  Printed  Books  |  Audio  Books   Other  Languages   ! Germany  -­‐  History  |  German  literature  |  Political  fiction Person  Editor   Person  Authority   • Günter  Grass   • SameAs  ➾   <dbpedia.org>
  45. Entities  and  library  workflows:
 Cataloging The  Tin  Drum Summary:  Acclaimed

     as  the  greatest   German  novel  since  the  end  of  World  War  II.     The  Tin  Drum  is  the  story  of  thirty  year  old   Oskar  Matzerath  who  has  lived  through  the   long  Nazi  nightmare  and  who  is  being  held  in   a  mental  institution. Subjects Borrowing  Options   Ebooks  |  Printed  Books  |  Audio  Books   Other  Languages   ! Germany  -­‐  History  |  German  literature  |  Political  fiction Person  Editor   Person  Authority   • Günter  Grass   • SameAs  ➾   <dbpedia.org> LC  NAF
  46. Entities  and  library  workflows:
 Cataloging The  Tin  Drum Summary:  Acclaimed

     as  the  greatest   German  novel  since  the  end  of  World  War  II.     The  Tin  Drum  is  the  story  of  thirty  year  old   Oskar  Matzerath  who  has  lived  through  the   long  Nazi  nightmare  and  who  is  being  held  in   a  mental  institution. Subjects Borrowing  Options   Ebooks  |  Printed  Books  |  Audio  Books   Other  Languages   ! Germany  -­‐  History  |  German  literature  |  Political  fiction
  47. Entities  and  library  workflows:
 Cataloging The  Tin  Drum Summary:  Acclaimed

     as  the  greatest   German  novel  since  the  end  of  World  War  II.     The  Tin  Drum  is  the  story  of  thirty  year  old   Oskar  Matzerath  who  has  lived  through  the   long  Nazi  nightmare  and  who  is  being  held  in   a  mental  institution. Subjects Borrowing  Options   Ebooks  |  Printed  Books  |  Audio  Books   Other  Languages   ! Germany  -­‐  History  |  German  literature  |  Political  fiction Work  Editor  
  48. Entities  and  library  workflows:
 Cataloging The  Tin  Drum Summary:  Acclaimed

     as  the  greatest   German  novel  since  the  end  of  World  War  II.     The  Tin  Drum  is  the  story  of  thirty  year  old   Oskar  Matzerath  who  has  lived  through  the   long  Nazi  nightmare  and  who  is  being  held  in   a  mental  institution. Subjects Borrowing  Options   Ebooks  |  Printed  Books  |  Audio  Books   Other  Languages   ! Germany  -­‐  History  |  German  literature  |  Political  fiction Work  Editor   Work  Authority   • Title   • Creator  ➾  <Person>
  49. Expression Manifestation  1   Manifestation  2 Manifestation  3 Entities  and

     library  workflows:
 Cataloging The  Tin  Drum Summary:  Acclaimed  as  the  greatest   German  novel  since  the  end  of  World  War  II.     The  Tin  Drum  is  the  story  of  thirty  year  old   Oskar  Matzerath  who  has  lived  through  the   long  Nazi  nightmare  and  who  is  being  held  in   a  mental  institution. Subjects Borrowing  Options   Ebooks  |  Printed  Books  |  Audio  Books   Other  Languages   ! Germany  -­‐  History  |  German  literature  |  Political  fiction Work  Editor   Cascading   Updates Work  Authority   • Title   • Creator  ➾  <Person>
  50. Expression Manifestation  1   Manifestation  2 Manifestation  3 Entities  and

     library  workflows:
 Cataloging The  Tin  Drum Summary:  Acclaimed  as  the  greatest   German  novel  since  the  end  of  World  War  II.     The  Tin  Drum  is  the  story  of  thirty  year  old   Oskar  Matzerath  who  has  lived  through  the   long  Nazi  nightmare  and  who  is  being  held  in   a  mental  institution. Subjects Borrowing  Options   Ebooks  |  Printed  Books  |  Audio  Books   Other  Languages   ! Germany  -­‐  History  |  German  literature  |  Political  fiction Work  Editor   MARC21   Output Cascading   Updates Work  Authority   • Title   • Creator  ➾  <Person>
  51. • Interlibrary  Loan   – Borrow  at  the  Work  level

      – Manifestations/Items  are  detail Entities  and  library  workflows:
 Other  applications
  52. • Interlibrary  Loan   – Borrow  at  the  Work  level

      – Manifestations/Items  are  detail • Analytics   – Fixes  “holdings  scatter”  across  manifestations Entities  and  library  workflows:
 Other  applications
  53. • Interlibrary  Loan   – Borrow  at  the  Work  level

      – Manifestations/Items  are  detail • Analytics   – Fixes  “holdings  scatter”  across  manifestations • Other  third  party  applications     – Discovery  API  exposes  library  entities Entities  and  library  workflows:
 Other  applications
  54. • Be  found  on  the  web • Connect  your  users

     to  unique  content Entities  and  library  workflows:
 Web  exposure
  55. • Be  found  on  the  web • Connect  your  users

     to  unique  content • What  the  web  requires  for  web  exposure – Aggregation – Familiar  structures – A  Network  of  Links – Entity  Identifiers Entities  and  library  workflows:
 Web  exposure
  56. WorldCat  Entities Works • 197+  million  Work  descriptions  and  URIs

      • Schema.org   • RDF  Data  formats  –  RDF/XML,  Turtle,  Triples,  JSON-­‐LD   • Links  to  WorldCat  manifestations   • Links  to  Dewey,  LCSH,  LCNAF,  VIAF,  FAST   • Open  Data  license   • Released  April  2014
  57. Work Place Concept Event Organization Person Cataloging Integration  with  the

     web Cascading  updates More  options Intuitive  searching Bibliographic  Entities
  58. person place object concept organization work Entity  Based  Data  Architecture…

    Bibliographic  Entities -­‐  In  the  Web  of  Data
  59. What  About  Linked  Data? What  about  Linked   Data? Yeah!

     –  what  about   Linked  Data? I  thought  Linked  Data   was  going  to  solve  all   our  problems! https://www.flickr.com/photos/rileyroxx/169900848/
  60. • A  Technology • Standard  on  the  Web  –  RDF,

     URIs,  Vocabularies • Identifying  and  Linking  resources  on  the  Web • Important  powerful  enabling  technology Linked  Data
  61. • A  Technology • Standard  on  the  Web  –  RDF,

     URIs,  Vocabularies • Identifying  and  Linking  resources  on  the  Web • Important  powerful  enabling  technology • But  only  a  technology… Linked  Data
  62. • A  Technology • Standard  on  the  Web  –  RDF,

     URIs,  Vocabularies • Identifying  and  Linking  resources  on  the  Web • Important  powerful  enabling  technology • But  only  a  technology…      for  the  systems  folks  to  worry  about Linked  Data
  63. • A  Technology • Standard  on  the  Web  –  RDF,

     URIs,  Vocabularies • Identifying  and  Linking  resources  on  the  Web • Important  powerful  enabling  technology • But  only  a  technology…      for  the  systems  folks  to  worry  about • Real  benefits  flow  from: Entity  Based  Data  Architecture   Linked  Data
  64. • A  Technology • Standard  on  the  Web  –  RDF,

     URIs,  Vocabularies • Identifying  and  Linking  resources  on  the  Web • Important  powerful  enabling  technology • But  only  a  technology…      for  the  systems  folks  to  worry  about • Real  benefits  flow  from: Entity  Based  Data  Architecture   Powered  by  Linked  Data Linked  Data
  65. • To  get  their  products/resources  in  front  of  users -

    Next  Generation  SEO Why  is  the  Web  Adopting  This? (Entities,  Semantic  Search,  Linked  Data)    
  66. • To  get  their  products/resources  in  front  of  users -

    Next  Generation  SEO • It  is  a  shared  approach  from  the  Search  Engines - But  not  exclusive  to  them Why  is  the  Web  Adopting  This? (Entities,  Semantic  Search,  Linked  Data)    
  67. Syndication  For  Libraries • Aggregate  to  a  central  site  

    - National  Library,  Consortia,  WorldCat.org
  68. Syndication  For  Libraries • Aggregate  to  a  central  site  

    - National  Library,  Consortia,  WorldCat.org • Publish  details  to  syndication  partners - WorldCat:  Amazon,  Google  Scholar,  EasyBib,   EBSCO,  OpenLibrary,  FindMyLibrary,  RedLaser,   Yelp,  …
  69. Syndication  For  Libraries • Aggregate  to  a  central  site  

    - National  Library,  Consortia,  WorldCat.org • Publish  details  to  syndication  partners - WorldCat:  Amazon,  Google  Scholar,  EasyBib,   EBSCO,  OpenLibrary,  FindMyLibrary,  RedLaser,   Yelp,  … • Links  from  aggregators  to  individual  libraries - Find  in  a  library  
  70. Syndication  For  Libraries • Aggregate  to  a  central  site  

    - National  Library,  Consortia,  WorldCat.org • Publish  details  to  syndication  partners - WorldCat:  Amazon,  Google  Scholar,  EasyBib,   EBSCO,  OpenLibrary,  FindMyLibrary,  RedLaser,   Yelp,  … • Links  from  aggregators  to  individual  libraries - Find  in  a  library                                                                          WorldCat   Syndication   Year  to  June  2013     ! • 77  Million  referrals  from  partners   • 8.7  Million  click-­‐through  to  libraries  
  71. Syndication  For  Libraries • Aggregate  to  a  central  site  

    - National  Library,  Consortia,  WorldCat.org • Publish  details  to  syndication  partners - WorldCat:  Amazon,  Google  Scholar,  EasyBib,   EBSCO,  OpenLibrary,  FindMyLibrary,  RedLaser,   Yelp,  … • Links  from  aggregators  to  individual  libraries - Find  in  a  library   Today
  72. Syndication  For  Libraries • Aggregate  to  a  central  site  

    - National  Library,  Consortia,  WorldCat.org • Publish  details  to  syndication  partners - WorldCat:  Amazon,  Google  Scholar,  EasyBib,   EBSCO,  OpenLibrary,  FindMyLibrary,  RedLaser,   Yelp,  … • Links  from  aggregators  to  individual  libraries - Find  in  a  library   Today Efficient  but  indirect
  73. Syndication  For  Libraries • Individual  libraries  publish  resource  data -

    Linked  Data  in  local  discovery  interfaces On  the  Web  of  Data
  74. Syndication  For  Libraries • Individual  libraries  publish  resource  data -

    Linked  Data  in  local  discovery  interfaces - Links  to  authoritative  hubs  –  set  global  context • VIAF,  LoC,  WorldCat  Works,  … On  the  Web  of  Data
  75. Syndication  For  Libraries • Individual  libraries  publish  resource  data -

    Linked  Data  in  local  discovery  interfaces - Links  to  authoritative  hubs  –  set  global  context • VIAF,  LoC,  WorldCat  Works,  … • Recognized  and  identified  on  the  Web On  the  Web  of  Data
  76. Syndication  For  Libraries • Individual  libraries  publish  resource  data -

    Linked  Data  in  local  discovery  interfaces - Links  to  authoritative  hubs  –  set  global  context • VIAF,  LoC,  WorldCat  Works,  … • Recognized  and  identified  on  the  Web - Google,  Bing,  Yahoo!,  Yandex,  etc. - Where  our  users  are! On  the  Web  of  Data
  77. Syndication  For  Libraries • Individual  libraries  publish  resource  data -

    Linked  Data  in  local  discovery  interfaces - Links  to  authoritative  hubs  –  set  global  context • VIAF,  LoC,  WorldCat  Works,  … • Recognized  and  identified  on  the  Web - Google,  Bing,  Yahoo!,  Yandex,  etc. - Where  our  users  are! • Users  referred  directly  to  resources  in  the  library   On  the  Web  of  Data
  78. Syndication  For  Libraries • Individual  libraries  publish  resource  data -

    Linked  Data  in  local  discovery  interfaces - Links  to  authoritative  hubs  –  set  global  context • VIAF,  LoC,  WorldCat  Works,  … • Recognized  and  identified  on  the  Web - Google,  Bing,  Yahoo!,  Yandex,  etc. - Where  our  users  are! • Users  referred  directly  to  resources  in  the  library   On  the  Web  of  Data Direct  and  Effective
  79. Syndication  For  Libraries • Individual  libraries  publish  resource  data -

    Linked  Data  in  local  discovery  interfaces - Links  to  authoritative  hubs  –  set  global  context • VIAF,  LoC,  WorldCat  Works,  … • Recognized  and  identified  on  the  Web - Google,  Bing,  Yahoo!,  Yandex,  etc. - Where  our  users  are! • Users  referred  directly  to  resources  in  the  library   On  the  Web  of  Data Direct  and  Effective
  80. Syndication  For  Libraries • Individual  libraries  publish  resource  data -

    Linked  Data  in  local  discovery  interfaces - Links  to  authoritative  hubs  –  set  global  context • VIAF,  LoC,  WorldCat  Works,  … • Recognized  and  identified  on  the  Web - Google,  Bing,  Yahoo!,  Yandex,  etc. - Where  our  users  are! • Users  referred  directly  to  resources  in  the  library   On  the  Web  of  Data Direct  and  Effective
  81. Tell them about our resources… …using their language and methods

    http://www.flickr.com/photos/boston_public_library/6220572487
  82. Has it been a waste of time? Throw it all

    away and use… Am I saying….
  83. Has it been a waste of time? Throw it all

    away and use… No!…. Am I saying….
  84. Don't throw the baby out with the bath water Sharing

     for  discovery  on  the  web   As  part  of  a  Global  Knowledge  Graph
  85. Don't throw the baby out with the bath water Sharing

     for  discovery  on  the  web   As  part  of  a  Global  Knowledge  Graph Entification
  86. Don't throw the baby out with the bath water Sharing

     for  discovery  on  the  web   As  part  of  a  Global  Knowledge  Graph Entification Identify  the  entities  in  your  data:
  87. Don't throw the baby out with the bath water Sharing

     for  discovery  on  the  web   As  part  of  a  Global  Knowledge  Graph Entification Identify  the  entities  in  your  data: • Describe  them  using  appropriate  vocabularies
  88. Don't throw the baby out with the bath water Sharing

     for  discovery  on  the  web   As  part  of  a  Global  Knowledge  Graph Entification Identify  the  entities  in  your  data: • Describe  them  using  appropriate  vocabularies • Describe  the  relationships  between  them
  89. Don't throw the baby out with the bath water Sharing

     for  discovery  on  the  web   As  part  of  a  Global  Knowledge  Graph Entification Identify  the  entities  in  your  data: • Describe  them  using  appropriate  vocabularies • Describe  the  relationships  between  them • Place  them  in  a  global  context  –  link  to  authoritative  hubs
  90. Don't throw the baby out with the bath water Sharing

     for  discovery  on  the  web   As  part  of  a  Global  Knowledge  Graph Entification Identify  the  entities  in  your  data: • Describe  them  using  appropriate  vocabularies • Describe  the  relationships  between  them • Place  them  in  a  global  context  –  link  to  authoritative  hubs • Liberate  the  value  in  your  data!
  91. Don't throw the baby out with the bath water Sharing

     for  discovery  on  the  web   As  part  of  a  Global  Knowledge  Graph Entification Identify  the  entities  in  your  data: • Describe  them  using  appropriate  vocabularies • Describe  the  relationships  between  them • Place  them  in  a  global  context  –  link  to  authoritative  hubs • Liberate  the  value  in  your  data! ENTITIES
  92. Don't throw the baby out with the bath water Sharing

     for  discovery  on  the  web   As  part  of  a  Global  Knowledge  Graph Entification Identify  the  entities  in  your  data: • Describe  them  using  appropriate  vocabularies • Describe  the  relationships  between  them • Place  them  in  a  global  context  –  link  to  authoritative  hubs • Liberate  the  value  in  your  data! And  also  share  them  on  the  web  –  a  job  for  Schema.org ENTITIES
  93. Why  Catalog? So  we  can  find  things Why  Share  on

     the  Web? So  today’s  users   can  find  our  things
  94. OCLC  Entity  Based  Data  Strategy ✓VIAF,  ISNI,  FAST  Publish  Linked

     Data ✓WorldCat.org  Linked  Data  Release  –  using  Schema.org 2012   2010
  95. OCLC  Entity  Based  Data  Strategy ✓VIAF,  ISNI,  FAST  Publish  Linked

     Data ✓WorldCat.org  Linked  Data  Release  –  using  Schema.org ✓Data  mining  of  WorldCat  resources 2012   2013 2010
  96. OCLC  Entity  Based  Data  Strategy ✓VIAF,  ISNI,  FAST  Publish  Linked

     Data ✓WorldCat.org  Linked  Data  Release  –  using  Schema.org ✓Data  mining  of  WorldCat  resources ✓WorldCat  Works  Released  –  using  Schema.org 2012   2014 2013 2010
  97. OCLC  Entity  Based  Data  Strategy ✓VIAF,  ISNI,  FAST  Publish  Linked

     Data ✓WorldCat.org  Linked  Data  Release  –  using  Schema.org ✓Data  mining  of  WorldCat  resources ✓WorldCat  Works  Released  –  using  Schema.org ✓Schema.org  added  to  VIAF  RDF 2012   2014 2013 2010
  98. OCLC  Entity  Based  Data  Strategy ✓VIAF,  ISNI,  FAST  Publish  Linked

     Data ✓WorldCat.org  Linked  Data  Release  –  using  Schema.org ✓Data  mining  of  WorldCat  resources ✓WorldCat  Works  Released  –  using  Schema.org ✓Schema.org  added  to  VIAF  RDF 2012   2014 2013 2010
  99. OCLC  Entity  Based  Data  Strategy ✓VIAF,  ISNI,  FAST  Publish  Linked

     Data ✓WorldCat.org  Linked  Data  Release  –  using  Schema.org ✓Data  mining  of  WorldCat  resources ✓WorldCat  Works  Released  –  using  Schema.org ✓Schema.org  added  to  VIAF  RDF 2012   2014 ➢Application  Integration ➢WorldCat  Discovery ➢Analytics ➢Discovery  API ➢Cataloging 2015 2013 2010
  100. OCLC  Entity  Based  Data  Strategy ✓VIAF,  ISNI,  FAST  Publish  Linked

     Data ✓WorldCat.org  Linked  Data  Release  –  using  Schema.org ✓Data  mining  of  WorldCat  resources ✓WorldCat  Works  Released  –  using  Schema.org ✓Schema.org  added  to  VIAF  RDF 2012   2014 ➢Application  Integration ➢WorldCat  Discovery ➢Analytics ➢Discovery  API ➢Cataloging 2015 ➢More  Entities  Released ➢Person ➢Organization ➢Event ➢Concept 2013 2010
  101. OCLC  Entity  Based  Data  Strategy ✓VIAF,  ISNI,  FAST  Publish  Linked

     Data ✓WorldCat.org  Linked  Data  Release  –  using  Schema.org ✓Data  mining  of  WorldCat  resources ✓WorldCat  Works  Released  –  using  Schema.org ✓Schema.org  added  to  VIAF  RDF 2012   2014 ➢Application  Integration ➢WorldCat  Discovery ➢Analytics ➢Discovery  API ➢Cataloging 2015 ➢More  Entities  Released ➢Person ➢Organization ➢Event ➢Concept ➢New  Products               ➢New  Services 2013 2016 2010
  102. OCLC  Entity  Based  Data  Strategy ✓VIAF,  ISNI,  FAST  Publish  Linked

     Data ✓WorldCat.org  Linked  Data  Release  –  using  Schema.org ✓Data  mining  of  WorldCat  resources ✓WorldCat  Works  Released  –  using  Schema.org ✓Schema.org  added  to  VIAF  RDF 2012   2014 ➢Application  Integration ➢WorldCat  Discovery ➢Analytics ➢Discovery  API ➢Cataloging 2015 ➢More  Entities  Released ➢Person ➢Organization ➢Event ➢Concept ➢New  Products               ➢Continuing  Evangelism ➢New  Services ➢Continuing  Innovation 2013 2016 2010
  103. OCLC  Entity  Based  Data  Strategy ✓VIAF,  ISNI,  FAST  Publish  Linked

     Data ✓WorldCat.org  Linked  Data  Release  –  using  Schema.org ✓Data  mining  of  WorldCat  resources ✓WorldCat  Works  Released  –  using  Schema.org ✓Schema.org  added  to  VIAF  RDF 2012   2014 ➢Application  Integration ➢WorldCat  Discovery ➢Analytics ➢Discovery  API ➢Cataloging 2015 ➢More  Entities  Released ➢Person ➢Organization ➢Event ➢Concept ➢New  Products               ➢Continuing  Evangelism ➢New  Services ➢Continuing  Innovation 2013 2016 2010
  104. OCLC  Entity  Based  Data  Strategy ✓VIAF,  ISNI,  FAST  Publish  Linked

     Data ✓WorldCat.org  Linked  Data  Release  –  using  Schema.org ✓Data  mining  of  WorldCat  resources ✓WorldCat  Works  Released  –  using  Schema.org ✓Schema.org  added  to  VIAF  RDF 2012   2014 ➢Application  Integration ➢WorldCat  Discovery ➢Analytics ➢Discovery  API ➢Cataloging 2015 ➢More  Entities  Released ➢Person ➢Organization ➢Event ➢Concept ➢New  Products               ➢Continuing  Evangelism ➢New  Services ➢Continuing  Innovation 2013 2016 2010
  105. OCLC  Entity  Based  Data  Strategy ✓VIAF,  ISNI,  FAST  Publish  Linked

     Data ✓WorldCat.org  Linked  Data  Release  –  using  Schema.org ✓Data  mining  of  WorldCat  resources ✓WorldCat  Works  Released  –  using  Schema.org ✓Schema.org  added  to  VIAF  RDF 2012   2014 ➢Application  Integration ➢WorldCat  Discovery ➢Analytics ➢Discovery  API ➢Cataloging 2015 ➢More  Entities  Released ➢Person ➢Organization ➢Event ➢Concept ➢New  Products               ➢Continuing  Evangelism ➢New  Services ➢Continuing  Innovation 2013 2016 2010
  106. OCLC  Entity  Based  Data  Strategy ✓VIAF,  ISNI,  FAST  Publish  Linked

     Data ✓WorldCat.org  Linked  Data  Release  –  using  Schema.org ✓Data  mining  of  WorldCat  resources ✓WorldCat  Works  Released  –  using  Schema.org ✓Schema.org  added  to  VIAF  RDF 2012   2014 ➢Application  Integration ➢WorldCat  Discovery ➢Analytics ➢Discovery  API ➢Cataloging 2015 ➢More  Entities  Released ➢Person ➢Organization ➢Event ➢Concept ➢New  Products               ➢Continuing  Evangelism ➢New  Services ➢Continuing  Innovation 2013 2016 2010
  107. OCLC  Entity  Based  Data  Strategy ✓VIAF,  ISNI,  FAST  Publish  Linked

     Data ✓WorldCat.org  Linked  Data  Release  –  using  Schema.org ✓Data  mining  of  WorldCat  resources ✓WorldCat  Works  Released  –  using  Schema.org ✓Schema.org  added  to  VIAF  RDF 2012   2014 ➢Application  Integration ➢WorldCat  Discovery ➢Analytics ➢Discovery  API ➢Cataloging 2015 ➢More  Entities  Released ➢Person ➢Organization ➢Event ➢Concept ➢New  Products               ➢Continuing  Evangelism ➢New  Services ➢Continuing  Innovation 2013 2016 2010
  108. OCLC  Entity  Based  Data  Strategy ✓VIAF,  ISNI,  FAST  Publish  Linked

     Data ✓WorldCat.org  Linked  Data  Release  –  using  Schema.org ✓Data  mining  of  WorldCat  resources ✓WorldCat  Works  Released  –  using  Schema.org ✓Schema.org  added  to  VIAF  RDF 2012   2014 ➢Application  Integration ➢WorldCat  Discovery ➢Analytics ➢Discovery  API ➢Cataloging 2015 ➢More  Entities  Released ➢Person ➢Organization ➢Event ➢Concept ➢New  Products               ➢Continuing  Evangelism ➢New  Services ➢Continuing  Innovation 2013 2016 2010
  109. OCLC  Entity  Based  Data  Strategy ✓VIAF,  ISNI,  FAST  Publish  Linked

     Data ✓WorldCat.org  Linked  Data  Release  –  using  Schema.org ✓Data  mining  of  WorldCat  resources ✓WorldCat  Works  Released  –  using  Schema.org ✓Schema.org  added  to  VIAF  RDF 2012   2014 ➢Application  Integration ➢WorldCat  Discovery ➢Analytics ➢Discovery  API ➢Cataloging 2015 ➢More  Entities  Released ➢Person ➢Organization ➢Event ➢Concept ➢New  Products               ➢Continuing  Evangelism ➢New  Services ➢Continuing  Innovation 2013 2016 2010
  110. but 6 excellent SWIB's ! Many great LD Projects If

     users  can't  discover  our  resources
  111. but 6 excellent SWIB's ! Many great LD Projects If

     users  can't  discover  our  resources What  is  the  point?
  112. Entification:  The  Route   to  Useful  Library  Data Richard  Wallis

      Technology  Evangelist   @rjw http://slideshare.net/rjw