Upgrade to Pro — share decks privately, control downloads, hide ads and more …

DLF 2012 SNAC/NAAC

tingletech
November 04, 2012

DLF 2012 SNAC/NAAC

Denver, Colorado
Sunday, November 4, 2012
Adrian Turner, California Digital Library
Ray R. Larson, School of Information, UC Berkeley
Brian Tingle, California Digital Library

http://www.diglib.org/forums/2012forum/social-networks-and-archival-context-project/

tingletech

November 04, 2012
Tweet

More Decks by tingletech

Other Decks in Research

Transcript

  1. Ray R. Larson, School of Information, UC Berkeley
    RayR.Larson,SchoolofInformation,UCBerkeley
    BrianTingle,CaliforniaDigitalLibrary
    AdrianTurner,CaliforniaDigitalLibrary
    2012DLF Forum|Denver,CO

    View Slide

  2. http://socialarchive.iath.virginia.edu

    View Slide

  3. ArchivalName
    Authority System
    AuthoritySystem

    View Slide

  4. Hamilton Alexander 1757 1804 G S
    Hamilton,Alexander,1757Ͳ1804
    Luce,ClareBoothe,1903Ͳ
    1987
    Patton,GeorgeS.
    (GeorgeSmith),
    1885Ͳ1945
    Oppenheimer,J.Robert,1904Ͳ
    1967
    Sontag,Susan,1933Ͳ2004
    Washington,George,1732Ͳ1799
    ArchivalName
    Authority System
    Pattonfamily
    Whitman,Walt,1819Ͳ1892
    Wright,Lloyd,1890Ͳ1978
    AuthoritySystem

    View Slide

  5. Anthony,SusanB
    Franklin,Benjamin,1706Ͳ1790
    Fuller,R.Buckminster
    (Richard Buckminster) 1895 1983
    Hamilton Alexander 1757 1804 G S
    y
    BerkeleyFreeChurch
    (RichardBuckminster),1895Ͳ1983
    Hamilton,Alexander,1757Ͳ1804
    Hamilton,Alexander,1757Ͳ1804
    Luce,ClareBoothe,1903Ͳ
    1987
    Patton,GeorgeS.
    (GeorgeSmith),
    1885Ͳ1945
    Bernstein,Leonard,
    1918Ͳ
    Luce,ClareBoothe,1903Ͳ1987
    Oppenheimer,J.Robert,1904Ͳ1967
    Oppenheimer,J.Robert,1904Ͳ
    1967
    Sontag,Susan,1933Ͳ2004
    Washington,George,1732Ͳ1799
    ArchivalName
    Authority System
    1918
    Block,Herbert,1909Ͳ2001
    Pattonfamily
    ( h)
    Pattonfamily
    Whitman,Walt,1819Ͳ1892
    Wright,Lloyd,1890Ͳ1978
    AuthoritySystem
    Bush,Vannevar,1890Ͳ1974
    kf l
    Patton,GeorgeS.(GeorgeSmith),
    Frankfurter,Felix,1882Ͳ1965

    View Slide

  6. Engelland,Jurgen (George).
    Enwall,Ogie (Aage).
    Erickson,
    Selma Inez
    Walfred.
    Norwick,Goodman.
    Nygaard, Lars Thomas Holmes Anna Gudrun Hauge
    SelmaInez.
    Fahl,HansJohanFredrik.
    Fet,PeterLaurits.
    Flones,Edward.
    Fredrickson,Hans.
    F d i k S F d i k
    Nygaard,LarsThomas.
    Odmark,ElsieKarlson.
    Ohrt,Sigfrid Eidsness.
    Oliver,Kole Skaflestad.
    Olson,AlvinE.
    Opsal,CatoTorvald.
    Holmes,AnnaGudrunHauge.
    Holmes,EliasKristoffersonVelholmen.
    Hoset,Ole.
    Howard,BarnettAllen,b.1827.
    Hytmo,Guri Olsdatter.
    Johnson,Andrew(AndersJohansson).
    Hamilton,Alexander,1757Ͳ1804 1885Ͳ1945
    Fredrickson,SvenFredrick.
    Garberg,Peder.
    Gillam,ChandlerB.,1833Ͳ1899.
    Halseth,OttoHjalmer.
    Handeland,MarthaTweiten.
    H A S h id
    Petersen,GretaJensen.
    Rasmussen,Martin.
    Rinne,EstherWiirre.
    Rodneyfamily
    Sandback,GeorgeBrun.
    S Si t A d
    Johnson,Phiea PetersenStahl.
    Johnson,ThelmaIrene
    Underdal.
    Jorgenson, Jorgen Aadneram.
    Luce,ClareBoothe,1903Ͳ
    1987
    Sontag,Susan,1933Ͳ2004
    Washington,George,1732Ͳ1799
    Hansen,AnneSchmidt.
    Hansen,Sylvia(Solveig).
    Haug,OlgaKarolineNilsen.
    Hemmestad,OlgaKristineBrodahl.
    Henry,OscarM.,1851Ͳ1916.
    H l A G d
    Saure,Sivert Andreas.
    Enwall,Ogie (Aage).
    Erickson,
    SelmaInez.
    Fahl Hans Johan Fredrik
    Jorgenson,JorgenAadneram.
    Kjersem,OleJohnson.
    Knudsen,Johanne.
    Kofoed,Thorvald Andreas.
    Larsen,Elias.
    Oppenheimer,J.Robert,1904Ͳ
    1967
    Whitman,Walt,1819Ͳ1892
    Flones,Edward.
    Fredrickson,Hans.
    ArchivalName
    Authority System
    Holmes,AnnaGudrun
    Hauge.
    Holmes,EliasKristofferson
    Velholmen
    Fahl,HansJohanFredrik.
    Fet,PeterLaurits.Norberg,JonasWalfred.
    Norwick,Goodman.
    Nygaard,LarsThomas.
    Odmark,ElsieKarlson.
    Oh t Si f id Eid
    Lillelien,Thor.
    Loe,OttoCalvin.
    Molund,ErikWilhelm.
    Nakkerud,IngaAmandaTreland.
    Nakkerud,Trygve Bloch.
    Nelson Amanda
    Pattonfamily Fredrickson,SvenFredrick.
    Garberg,Peder.
    Gillam,ChandlerB.,1833Ͳ1899.
    Halseth,OttoHjalmer.
    AuthoritySystem
    Velholmen.
    Hoset,Ole.
    Howard,BarnettAllen,b.1827.
    Hytmo,Guri Olsdatter.
    Knudsen, Johanne.
    Ohrt,Sigfrid Eidsness.
    Oliver,Kole Skaflestad.
    Olson,AlvinE.
    Opsal,CatoTorvald.
    Petersen,GretaJensen.
    R M ti
    Nelson,Amanda.
    Nerland,Einar Magnus.
    Nielsen,Einer.
    Nilsen,MarthaDagsvik.
    Nissen Ole Andreas Nissenivert Andreas
    Patton,GeorgeS.
    (GeorgeSmith),
    .
    Wright,Lloyd,1890Ͳ1978
    Knudsen,Johanne.
    Kofoed,Thorvald Andreas.
    Nakkerud,IngaAmandaTreland.
    Nakkerud,Trygve Bloch.
    Nelson,Amanda.
    Nerland,Einar Magnus.
    Rasmussen,Martin.
    Rinne,EstherWiirre.
    Rodneyfamily
    Sandback,GeorgeBrun.
    Saure,SHandeland,Martha
    Nissen,OleAndreasNissenivert Andreas.
    Johnson,Andrew(AndersJohansson).
    Johnson,Phiea PetersenStahl.
    Johnson,ThelmaIrene
    Underdal
    Nielsen,Einer.
    Nilsen,MarthaDagsvik.
    Nissen,OleAndreasNissen.
    Norberg,Jonas
    Tweiten.
    Hansen,AnneSchmidt.
    Hansen,Sylvia(Solveig).
    Haug,OlgaKarolineNilsen.
    Underdal.
    Jorgenson,JorgenAadneram.
    Kjersem,OleJohnson.

    View Slide

  7. Engelland,Jurgen (George).
    Enwall,Ogie (Aage).
    Erickson,
    SelmaInez.
    Fahl,HansJohanFredrik.
    Nelson,Amanda.
    Nerland,Einar Magnus.
    Nielsen,Einer.
    Nilsen,MarthaDagsvik.
    Nissen,OleAndreasNissen.
    Hoset,Ole.
    Howard,BarnettAllen,b.1827.
    Hytmo,Guri Olsdatter.
    Johnson,Andrew(AndersJohansson).
    Johnson, Phiea Petersen Stahl.
    Engelland,Jurgen (George).
    Enwall,Ogie (Aage).
    E i k
    Nelson,Amanda.
    Nerland,Einar Magnus.
    Ni l Ei
    Hoset,Ole.
    Howard,BarnettAllen,b.1827.
    H t G i Ol d tt
    Engelland,Jurgen (George).
    Enwall,Ogie (Aage).
    E i k
    Nelson,Amanda.
    Nerland,Einar Magnus.
    Ni l Ei
    Hoset,Ole.
    Howard,BarnettAllen,b.1827.
    H t G
    Anthony,SusanB
    Franklin,Benjamin,1706Ͳ1790
    Fuller,R.Buckminster
    (Richard Buckminster) 1895 1983
    Hamilton Alexander 1757 1804 G S
    Fet,PeterLaurits.
    Flones,Edward.
    Fredrickson,Hans.
    Fredrickson,SvenFredrick.
    Garberg,Peder.
    Gillam Chandler B 1833 1899
    ,
    Norberg,JonasWalfred.
    Norwick,Goodman.
    Nygaard,LarsThomas.
    Odmark,ElsieKarlson.
    Johnson,Phiea PetersenStahl.
    Johnson,ThelmaIrene
    Underdal.
    Jorgenson,JorgenAadneram.
    Kj Ol J h
    Erickson,
    SelmaInez.
    Fahl,HansJohanFredrik.
    Fet,PeterLaurits.
    Flones,Edward.
    Fredrickson Hans
    Nielsen,Einer.
    Nilsen,MarthaDagsvik.
    Nissen,OleAndreasNissen.
    Norberg,JonasWalfred.
    Norwick, Goodman.
    Hytmo,Guri Olsdatter.
    Johnson,Andrew(AndersJohansson).
    Johnson,Phiea PetersenStahl.
    Johnson,ThelmaIrene
    Underdal
    Erickson,
    SelmaInez.
    Fahl,HansJohanFredrik.
    Fet,PeterLaurits.
    Flones,Edward.
    Fredrickson Hans
    Nielsen,Einer.
    Nilsen,MarthaDagsvik.
    Nissen,OleAndreasNissen.
    Norberg,JonasWalfred.
    Norwick, Goodman.
    Hytmo,Gu
    Johnson,Andrew(Anders
    Johnson,Phiea Peterse
    Johnson,Thelm
    U
    y
    BerkeleyFreeChurch
    (RichardBuckminster),1895Ͳ1983
    Hamilton,Alexander,1757Ͳ1804
    Hamilton,Alexander,1757Ͳ1804
    Luce,ClareBoothe,1903Ͳ
    1987
    Patton,GeorgeS.
    (GeorgeSmith),
    1885Ͳ1945
    Gillam,ChandlerB.,1833Ͳ1899.
    Halseth,OttoHjalmer.
    Handeland,MarthaTweiten.
    Hansen,AnneSchmidt.
    Hansen,Sylvia(Solveig).
    Haug,OlgaKarolineNilsen.
    Ohrt,Sigfrid Eidsness.
    Oliver,Kole Skaflestad.
    Olson,AlvinE.
    Opsal,CatoTorvald.
    Petersen,GretaJensen.
    Rasmussen Martin
    Kjersem,OleJohnson.
    Knudsen,Johanne.
    Kofoed,Thorvald Andreas.
    Larsen,Elias.
    Lillelien, Thor.
    Fredrickson,Hans.
    Fredrickson,SvenFredrick.
    Garberg,Peder.
    Gillam,ChandlerB.,1833Ͳ1899.
    Halseth,OttoHjalmer.
    Handeland,MarthaTweiten.
    Norwick,Goodman.
    Nygaard,LarsThomas.
    Odmark,ElsieKarlson.
    Ohrt,Sigfrid Eidsness.
    Oliver,Kole Skaflestad.
    Olson, Alvin E.
    Underdal.
    Jorgenson,JorgenAadneram.
    Kjersem,OleJohnson.
    Knudsen,Johanne.
    Fredrickson,Hans.
    Fredrickson,SvenFredrick.
    Garberg,Peder.
    Gillam,ChandlerB.,1833Ͳ1899.
    Halseth,OttoHjalmer.
    Handeland,MarthaTweiten.
    Norwick,Goodman.
    Nygaard,LarsThomas.
    Odmark,ElsieKarlson.
    Ohrt,Sigfrid Eidsness.
    Oliver,Kole Skaflestad.
    Olson, Alvin E.
    U
    Jorgenson,JorgenAad
    Kjersem,OleJ
    Knudsen,J
    Bernstein,Leonard,
    1918Ͳ
    Luce,ClareBoothe,1903Ͳ1987
    Oppenheimer,J.Robert,1904Ͳ1967
    Oppenheimer,J.Robert,1904Ͳ
    1967
    Sontag,Susan,1933Ͳ2004
    Washington,George,1732Ͳ1799
    g, g
    Hemmestad,OlgaKristineBrodahl.
    Henry,OscarM.,1851Ͳ1916.
    Holmes,AnnaGudrun
    Hauge.
    Rasmussen,Martin.
    Rinne,EstherWiirre.
    Rodneyfamily
    Sandback,GeorgeBrun.
    Saure,Sivert Andreas.
    Enwall,Ogie (Aage).
    Lillelien,Thor.
    Loe,OttoCalvin.
    Molund,ErikWilhelm.
    Nakkerud,IngaAmandaTreland.
    Nakkerud,Trygve Bloch.
    Nelson,Amanda.
    Hansen,AnneSchmidt.
    Hansen,Sylvia(Solveig).
    Haug,OlgaKarolineNilsen.
    Hemmestad,OlgaKristineBrodahl.
    Henry,OscarM.,1851Ͳ1916.
    l d
    Olson,AlvinE.
    Opsal,CatoTorvald.
    Petersen,GretaJensen.
    Rasmussen,Martin.
    Rinne,EstherWiirre.
    Rodneyfamily
    Kofoed,Thorvald Andreas.
    Larsen,Elias.
    Lillelien,Thor.
    Loe,OttoCalvin.
    Molund,ErikWilhelm.
    N kk d I A d l d
    Hansen,AnneSchmidt.
    Hansen,Sylvia(Solveig).
    Haug,OlgaKarolineNilsen.
    Hemmestad,OlgaKristineBrodahl.
    Henry,OscarM.,1851Ͳ1916.
    l d
    Olson,AlvinE.
    Opsal,CatoTorvald.
    Petersen,GretaJensen.
    Rasmussen,Martin.
    Rinne,EstherWiirre.
    Rodneyfamily
    Kofoed,Thorv
    Larsen,Elias.
    Lillelien,Thor.
    Loe,OttoCalvin.
    Molund,ErikWilhelm.
    N kk d I A d
    ArchivalName
    Authority System
    1918
    Block,Herbert,1909Ͳ2001
    Pattonfamily
    ( h)
    Pattonfamily
    Whitman,Walt,1819Ͳ1892
    Wright,Lloyd,1890Ͳ1978
    g
    Holmes,EliasKristofferson
    Velholmen.
    Hoset,Ole.
    Howard, Barnett Allen, b. 1827.
    g ( g )
    Erickson,
    SelmaInez.
    Fahl,HansJohanFredrik.
    Fet,PeterLaurits.
    Fl Ed d
    Nerland,Einar Magnus.
    Nielsen,Einer.
    Nilsen,MarthaDagsvik.
    Nissen,OleAndreasNissen.
    Holmes,AnnaGudrun
    Hauge.
    Holmes,EliasKristofferson
    V lh l
    Sandback,GeorgeBrun.
    Saure,Sivert Andreas.
    Enwall,Ogie (Aage).
    Erickson,
    Selma Inez
    Nakkerud,IngaAmandaTreland.
    Nakkerud,Trygve Bloch.
    Nelson,Amanda.
    Nerland,Einar Magnus.
    Nielsen Einer
    Holmes,AnnaGudrun
    Hauge.
    Holmes,EliasKristofferson
    V lh l
    Sandback,GeorgeBrun.
    Saure,Sivert Andreas.
    Enwall,Ogie (Aage).
    Erickson,
    Selma Inez
    Nakkerud,IngaAmanda
    Nakkerud,Trygve Bl
    Nelson,Amanda
    Nerland,Einar Magnus.
    Nielsen Ei
    AuthoritySystem
    Bush,Vannevar,1890Ͳ1974
    kf l
    Patton,GeorgeS.(GeorgeSmith),
    Howard,BarnettAllen,b.1827.
    Hytmo,Guri Olsdatter.
    Johnson,Andrew(AndersJohansson).
    Johnson,Phiea PetersenStahl.
    Johnson,ThelmaIreneUnderdal.
    Jorgenson Jorgen Aadneram
    Flones,Edward.
    Fredrickson,Hans.
    Fredrickson,SvenFredrick.
    Garberg,Peder.
    Gillam,ChandlerB.,1833Ͳ1899.
    Norberg,JonasWalfred.
    Norwick,Goodman.
    Nygaard,LarsThomas.
    Odmark,ElsieKarlson.
    Ohrt,Sigfrid Eidsness.
    Oliver Kole Skaflestad
    Velholmen.
    Hoset,Ole.
    Howard,BarnettAllen,b.1827.
    Hytmo,Guri Olsdatter.
    Johnson Andrew (Anders Johansson)
    SelmaInez.
    Fahl,HansJohanFredrik.
    Fet,PeterLaurits.
    Flones,Edward.
    Fredrickson,Hans.
    Fredrickson, Sven Fredrick.
    Nielsen,Einer.
    Nilsen,MarthaDagsvik.
    Nissen,OleAndreasNissen.
    Norberg,JonasWalfred.
    Norwick,Goodman.
    Nygaard Lars Thomas
    Velholmen.
    Hoset,Ole.
    Howard,BarnettAllen,b.1827.
    Hytmo,Guri Olsdatter.
    Johnson Andrew (Anders Johansson)
    SelmaInez.
    Fahl,HansJohanFredrik.
    Fet,PeterLaurits.
    Flones,Edward.
    Fredrickson,Hans.
    Fredrickson, Sven Fredrick.
    Nielsen,Ei
    Nilsen,MarthaDagsvik.
    Nissen,OleAndreasNissen.
    Norberg,JonasWalfred.
    Norwick,Goodman.
    Nygaard Lars Thomas
    Frankfurter,Felix,1882Ͳ1965
    Jorgenson,JorgenAadneram.
    Kjersem,OleJohnson.
    Knudsen,Johanne.
    Kofoed,Thorvald Andreas.
    Larsen,Elias.
    Lillelien, Thor.
    Halseth,OttoHjalmer.
    Handeland,MarthaTweiten.
    Hansen,AnneSchmidt.
    Hansen,Sylvia(Solveig).
    Haug,OlgaKarolineNilsen.
    H d Ol K i i B d hl
    Oliver,Kole Skaflestad.
    Olson,AlvinE.
    Opsal,CatoTorvald.
    Petersen,GretaJensen.
    Rasmussen,Martin.
    Rinne,EstherWiirre.
    Johnson,Andrew(AndersJohansson).
    Johnson,Phiea PetersenStahl.
    Johnson,ThelmaIreneUnderdal.
    Jorgenson,JorgenAadneram.
    Kjersem,OleJohnson.
    Knudsen,Johanne.
    Fredrickson,SvenFredrick.
    Garberg,Peder.
    Gillam,ChandlerB.,1833Ͳ1899.
    Halseth,OttoHjalmer.
    Handeland,MarthaTweiten.
    Hansen Anne Schmidt
    Nygaard,LarsThomas.
    Odmark,ElsieKarlson.
    Ohrt,Sigfrid Eidsness.
    Oliver,Kole Skaflestad.
    Olson,AlvinE.
    Opsal,CatoTorvald.
    Johnson,Andrew(AndersJohansson).
    Johnson,Phiea PetersenStahl.
    Johnson,ThelmaIreneUnderdal.
    Jorgenson,JorgenAadneram.
    Kjersem,OleJohnson.
    Knudsen,Johanne.
    Fredrickson,SvenFredrick.
    Garberg,Peder.
    Gillam,ChandlerB.,1833Ͳ1899.
    Halseth,OttoHjalmer.
    Handeland,MarthaTweiten.
    Hansen Anne Schmidt
    Nygaard,LarsThomas.
    Odmark,ElsieKarlson.
    Ohrt,Sigfrid Eidsness.
    Oliver,Kole Skaflestad.
    Olson,AlvinE.
    Opsal,CatoTorvald.
    Lillelien,Thor.
    Loe,OttoCalvin.
    Molund,ErikWilhelm.
    Nakkerud,IngaAmandaTreland.
    Nakkerud,Trygve Bloch.
    Hemmestad,OlgaKristineBrodahl.
    Henry,OscarM.,1851Ͳ1916.
    Holmes,AnnaGudrunHauge.
    Holmes,EliasKristoffersonVelholmen.
    ,
    Rodneyfamily
    Sandback,GeorgeBrun.
    Saure,Sivert Andreas.
    ,
    Kofoed,Thorvald Andreas.
    Larsen,Elias.
    Lillelien,Thor.
    Loe,OttoCalvin.
    Molund,ErikWilhelm.
    N kk d I A d T l d
    Hansen,AnneSchmidt.
    Hansen,Sylvia(Solveig).
    Haug,OlgaKarolineNilsen.
    Hemmestad,OlgaKristineBrodahl.
    Henry,OscarM.,1851Ͳ1916.
    Holmes,AnnaGudrunHauge.
    Petersen,GretaJensen.
    Rasmussen,Martin.
    Rinne,EstherWiirre.
    Rodneyfamily
    Sandback,GeorgeBrun.
    Saure Sivert Andreas
    ,
    Kofoed,Thorvald Andreas.
    Larsen,Elias.
    Lillelien,Thor.
    Loe,OttoCalvin.
    Molund,ErikWilhelm.
    N kk d I A d T l d
    Hansen,AnneSchmidt.
    Hansen,Sylvia(Solveig).
    Haug,OlgaKarolineNilsen.
    Hemmestad,OlgaKristineBrodahl.
    Henry,OscarM.,1851Ͳ1916.
    Holmes,AnnaGudrunHauge.
    Petersen,GretaJensen.
    Rasmussen,Martin.
    Rinne,EstherWiirre.
    Rodneyfamily
    Sandback,GeorgeBrun.
    Saure Sivert Andreas

    View Slide

  8. ArchivalName
    Authority System
    ArchivalName
    AuthoritySystem
    AuthoritySystem

    View Slide

  9. ArchivalName
    Authority System
    ArchivalName
    AuthoritySystem
    AuthoritySystem

    View Slide

  10. ArchivalName
    Authority System
    AuthoritySystem

    View Slide

  11. Background
    Background
    • Researchanddemonstrationproject
    • Multi year funding
    • MultiͲyearfunding
    • NationalEndowmentfortheHumanities
    (2010Ͳ2012)
    (
    • AndrewW.MellonFoundation(2012Ͳ
    2014)
    2014)

    View Slide

  12. Objectives
    Objectives
    l l f
    1. DeveloptoolsforextractingEACͲCPF
    records,drawingonexistingdata(EAD
    , g g (
    findingaids,MARCrecords)
    2 Match merge and enhance; build a
    2. Match,merge,andenhance;builda
    largetestcorpusofEACͲCPFrecords
    3. Createaprototypebiographical
    resource and access system using
    resourceandaccesssystem,using
    thoserecords

    View Slide

  13. Objectives
    Objectives
    l l f
    1. DeveloptoolsforextractingEACͲCPF
    records,drawingonexistingdata(EAD
    , g g (
    findingaids,MARCrecords)
    2 Match merge and enhance; build a
    2. Match,merge,andenhance;builda
    largetestcorpusofEACͲCPFrecords
    3. Createaprototypebiographical
    resource and access system using
    resourceandaccesssystem,using
    thoserecords

    View Slide

  14. Objectives
    Objectives
    l l f
    1. DeveloptoolsforextractingEACͲCPF
    records,drawingonexistingdata(EAD
    , g g (
    findingaids,MARCrecords)
    2 Match merge and enhance; build a
    2. Match,merge,andenhance;builda
    largetestcorpusofEACͲCPFrecords
    3. Createaprototypebiographical
    resource and access system using
    resourceandaccesssystem,using
    thoserecords

    View Slide

  15. Project Team
    ProjectTeam
    • UniversityofVirginia,Institutefor
    Advanced Technology in the Humanities
    AdvancedTechnologyintheHumanities
    – DanielPitti(PI)andWorthyMartin
    • UCBerkeleySchoolofInformation
    – Ray Larson and Yiming Liu
    RayLarsonandYimingLiu
    • CaliforniaDigitalLibrary
    – RachaelHu,BrianTingle,andAdrianTurner

    View Slide

  16. Project Team
    ProjectTeam
    • TerryCatapano(ColumbiaUniversity)
    • SaraSprenkle(WashingtonandLeeUniversity)
    • SarahWells(UniversityofVirginia)
    • Kathy Wisser (Simmons Graduate School of Library
    • KathyWisser(SimmonsGraduateSchoolofLibrary
    andInformationScience)
    T L h (U i it f Illi i S h l f Lib
    • TomLynch(UniversityofIllinoisSchoolofLibrary
    andInformationScience)

    View Slide

  17. View Slide

  18. EAC CPF
    EACͲCPF
    • XMLͲbaseddatastructurestandardfor
    encodingarchivalauthorityrecords
    g y
    • Authorizednameheadingsfortheentity
    i hi l/hi i l f h i
    • Biographical/historicalcontextfortheentity
    • Linkstoresourcescreatedbytheentity
    y y
    • Linkstoresourcesabouttheentity

    View Slide

  19. View Slide

  20. Title
    Title

    View Slide

  21. Title
    Title

    View Slide

  22. Title
    Title

    View Slide

  23. Data Sources
    DataSources
    EAD fi di id
    • EADfindingaids[~150,000]
    – 13regionalandstatewideconsortia
    – 35 repositories in US, UK, and France; multiple US federal
    35repositoriesinUS,UK,andFrance;multipleUSfederal
    agencies
    • MARC21records[~1.5million]
    OCLC W ldC t
    – OCLCWorldCat
    • Authorityrecords
    – OCLC Research: Virtual International Authority File (VIAF)
    OCLCResearch:VirtualInternationalAuthorityFile(VIAF)
    [~16million]
    – GettyVocabularyProgram:UnionListofArtistNames(ULAN)
    [~120,000]
    [ ]
    – AdditionalnamerecordsfromArchivesnationales,British
    Library,NARA,NewYorkStateArchives,andSmithsonian
    InstitutionArchives

    View Slide

  24. Consortia Individualinstitutions
    P i t
    •ArchivesFlorida
    •ArchivesHub(UK)
    •ArizonaArchivesOnline
    •EAD FACTORY (OhioLink)
    •AmericanPhilosophicalSociety
    •Archivesnationales(France)
    •ArchivesofAmericanArt
    •Bibliothèque nationale de France
    •NorthwesternUniversity
    •PrincetonUniversity
    •RutgersUniversity
    •Smithsonian Institution Archives
    • Points
    •EADFACTORY(OhioLink)
    •FiveColleges
    •MaineArchivalCollections
    Online(MACON)
    BibliothèquenationaledeFrance
    •BnFArchivesetmanuscripts
    •FrenchUnionCatalog
    •BrighamYoungUniversity
    SmithsonianInstitutionArchives
    •SyracuseUniversity
    •UniversityofAlabama
    •UniversityofChicago
    ( )
    •NorthwestDigitalArchives
    (NWDA)
    •OnlineArchiveofCalifornia
    •Philadelphia Area
    •ChurchofLatterDaySaints
    Archives
    •ColumbiaUniversity
    •Cornell University
    •UniversityofConnecticut
    •UniversityofDelaware
    •UniversityofFlorida
    •University of Illinois
    •PhiladelphiaArea
    ConsortiumofSpecial
    CollectionsLibraries(PACSCL)
    •RhodeIslandArchival&
    CornellUniversity
    •DukeUniversity
    •HarvardUniversity
    •IndianaUniversity
    UniversityofIllinois
    •UniversityofKansas
    •UniversityofMaryland
    •UniversityofMichiganBentley&
    ManuscriptCollectionsOnline
    (RIAMCO)
    •RockyMountainOnline
    Archive (RMOA)
    •LibraryofCongress(publicly
    availablewithoutrestriction)
    •MinnesotaHistoricalSociety
    •Massachusetts Institute of
    SpecialCollections
    •UniversityofMinnesota
    •UniversityofNebraska
    •University of North Carolina
    Archive(RMOA)
    •TexasArchivalResources
    Online(TARO)
    •VirginiaHeritage
    MassachusettsInstituteof
    Technology
    •NationalLibraryofMedicine
    •NewYorkPublicLibrary
    UniversityofNorthCarolina,
    ChapelHill
    •UniversityofUtah
    •UtahStateArchives
    •NewYorkUniversity
    •NorthCarolinaState
    •UtahStateUniversity
    •YaleUniversity

    View Slide

  25. Data Sources
    DataSources
    EAD fi di id
    • EADfindingaids[~150,000]
    – 13regionalandstatewideconsortia
    – 35 repositories in US, UK, and France; multiple US federal
    35repositoriesinUS,UK,andFrance;multipleUSfederal
    agencies
    • MARC21records[~1.5million]
    OCLC W ldC t
    – OCLCWorldCat
    • Authorityrecords
    – OCLC Research: Virtual International Authority File (VIAF)
    OCLCResearch:VirtualInternationalAuthorityFile(VIAF)
    [~16million]
    – GettyVocabularyProgram:UnionListofArtistNames(ULAN)
    [~120,000]
    [ ]
    – AdditionalnamerecordsfromArchivesnationales,British
    Library,NARA,NewYorkStateArchives,andSmithsonian
    InstitutionArchives

    View Slide

  26. Data Sources
    DataSources
    EAD fi di id
    • EADfindingaids[~150,000]
    – 13regionalandstatewideconsortia
    – 35 repositories in US, UK, and France; multiple US federal
    35repositoriesinUS,UK,andFrance;multipleUSfederal
    agencies
    • MARC21records[~1.5million]
    OCLC W ldC t
    – OCLCWorldCat
    • Authorityrecords
    – OCLC Research: Virtual International Authority File (VIAF)
    OCLCResearch:VirtualInternationalAuthorityFile(VIAF)
    [~16million]
    – GettyVocabularyProgram:UnionListofArtistNames(ULAN)
    [~120,000]
    [ ]
    – AdditionalnamerecordsfromArchivesnationales,British
    Library,NARA,NewYorkStateArchives,andSmithsonian
    InstitutionArchives

    View Slide

  27. Prototype Access System
    PrototypeAccessSystem
    • text
    //
    http://socialarchive.iath.virginia.edu

    View Slide

  28. SNAC
    SNAC
    Social Networks and Archival Context
    SocialNetworksandArchivalContext

    View Slide

  29. SNAC
    SNAC
    Social Networks and Archival Context
    SocialNetworksandArchivalContext

    View Slide

  30. NAAC
    NAAC
    National Archival Authorities Cooperative
    NationalArchivalAuthoritiesCooperative

    View Slide

  31. NAAC
    NAAC
    National Archival Authorities Cooperative
    NationalArchivalAuthoritiesCooperative
    http://socialarchive.iath.virginia.edu/
    NAAC_index.html

    View Slide

  32. Activities
    Activities
    1. CultivateEACͲCPF expertiseacrossthe
    archival community through 140 SAAͲ
    archivalcommunity,through140SAA
    hostedworkshops
    2. Developablueprintforasustainable,
    national archival authority cooperative
    nationalarchivalauthoritycooperative

    View Slide

  33. Activities
    Activities
    1. CultivateEACͲCPF expertiseacrossthe
    archival community through 140 SAAͲ
    archivalcommunity,through140SAA
    hostedworkshops
    2. Developablueprintforasustainable,
    national archival authority cooperative
    nationalarchivalauthoritycooperative

    View Slide

  34. Activities
    Activities
    1. CultivateEACͲCPF expertiseacrossthe
    archival community through 140 SAAͲ
    archivalcommunity,through140SAA
    hostedworkshops
    2. Developablueprintforasustainable,
    national archival authority cooperative
    nationalarchivalauthoritycooperative
    Staytunedforfall2013!

    View Slide

  35. Ray R. Larson, School of Information, UC Berkeley
    RayR.Larson,SchoolofInformation,UCBerkeley
    BrianTingle,CaliforniaDigitalLibrary
    AdrianTurner,CaliforniaDigitalLibrary
    2012DLF Forum|Denver,CO

    View Slide

  36. BrianTingleandAdrianTurner
    RBMS
    PreͲConference2012
    SanDiego,CA

    View Slide

  37. 2012-11-04 - SLIDE
    DLF 2012 - Denver
    The Social Networks and Archival
    Context Project: Status Report
    Adrian Turner*, Ray R. Larson**, Brian Tingle*
    *California Digital Library
    **University of California, Berkeley - School of Information
    Thanks to Daniel V. Pitti of the Institute for Advanced Technology in the
    Humanities, University of Virginia, and Brian Tingle of the California Digital
    Library for many of the slides here

    View Slide

  38. 2012-11-04 - SLIDE
    DLF 2012 - Denver
    Funding and People
    • Funding and Timeline
    – National Endowment for the Humanities
    – May 2010-April 2012
    – Andrew W. Mellon Foundation
    – May 2012-April 2014
    • People
    – Daniel Pitti (PI) and Worthy Martin (Institute for Advanced
    Technology in the Humanities, University of Virginia)
    – Adrian Turner and Brian Tingle (California Digital Library,
    University of California)
    – Ray Larson (School of Information, University of California,
    Berkeley)

    View Slide

  39. 2012-11-04 - SLIDE
    DLF 2012 - Denver
    Two Interrelated Project
    • Further the transformation of archival description
    (separate description of records from description of people
    documented in them) in order to …
    • Enhance access to archival resources, though in fact all
    cultural heritage resources
    • Enhance understanding of resources by providing the
    social-professional context within which people lived and
    worked

    View Slide

  40. 2012-11-04 - SLIDE
    DLF 2012 - Denver
    The Source Data
    • EAD-encoded finding aids (guides to archival
    records)
    – 150K
    – Primarily from U.S. sources, but also U.K. and
    France
    • Archival authority records (360K)
    – National Archives and Records Administration
    – State Archive of New York
    – Smithsonian Institution
    – British Library
    – National Archives (France) & BnF
    • WorldCat Archival Descriptions: 2M

    View Slide

  41. 2012-11-04 - SLIDE
    DLF 2012 - Denver
    Library and Museum Authority Records
    • Getty Vocabulary Program: Union List of
    Artist Names (293K personal and corporate
    names)
    • Virtual International Authority File (16M+
    cluster records)
    – Contributed from around the world by national
    libraries and others

    View Slide

  42. 2012-11-04 - SLIDE
    DLF 2012 - Denver
    Methods and Processing
    • Extract EAC-CPF records from existing EAD-
    encoded archival descriptions
    – Extracting both creators and referenced CPF
    names
    • Match EAC-CPF records against one another and
    against existing authority records (ULAN, VIAF,
    LCNAF)
    – Enhance EAC-CPF by normalizing entries, adding
    alternative entries, titles (VIAF), and historical data
    (ULAN)
    • Create a prototype historical resource and access
    system
    – Historical data and social-professional networks
    – Links to archive, library, and museum resources (by
    and about)

    View Slide

  43. 2012-11-04 - SLIDE
    DLF 2012 - Denver
    Example EAD Record (Hub)



    GB 0133 TAB




    Tabley Muniments




    John Rylands University Library of
    Manchester



    150 Deansgate


    Manchester


    ... (Parts removed )…




    University of Manchester, John Rylands University Library of Manchester

    REPOSITORYCODE = "0133">
    GB 0133 TAB


    Tabley Muniments


    19th century



    1.24 cu.m




    Warren, family, of Tabley, Cheshire


    Warren, John Byrne Leicester, 1835-1895, 3rd Baron de Tabley, poet



    View Slide

  44. 2012-11-04 - SLIDE
    DLF 2012 - Denver
    Example EAD Record (Hub)


    Administrative/Biographical History


    The poet John Byrne Leicester Warren, later 3rd and last Baron de Tabley, of Tabley near Knutsford, Cheshire,
    was born in 1835, the son of the 2nd Baron de Tabley (1811-1887), and his wife, Catherina. His mother was Italian,
    the daughter of the count de Soglio, and Warren spent much of his early childhood with her in Italy and Greece. He
    was educated at Eton and Christ Church, Oxford. At Oxford he published a volume of poetry. Originally he
    published under the pseudonyms George F. Preston (1859-1862) and William Lancaster (1863-1868), but latterly
    under his own name.


    His early verse included

    Praeterita

    (1863),

    Eclogues and Monodramas

    (1864),

    Studies in Verse

    (1865),

    Philocletes

    (1866), and

    Orestes

    (1868). His early work was Tennysonian in style, but he was later to be influenced by both Browning and
    Swinburne. In 1873 he produced …. (some data removed)…

    View Slide

  45. 2012-11-04 - SLIDE
    DLF 2012 - Denver
    Example EAD Record (Hub)


    Scope and Content


    The collection consists mainly of the personal papers of the 3rd Baron de Tabley. The papers reflect his interests in
    literature, politics, botany and numismatics and include correspondence with numerous prominent later Victorian
    figures. Attention should also be drawn to de Tabley’s extensive and important collection of armorial bookplates.


    Correspondents include Sir Mountstuart Grant Duff, Edmund Gosse, Lord Houghton, A.C.Benson, and Robert
    Bridges. There are volumes of Tabley's essays and verse, as well as a considerable number of notebooks and
    loose manuscripts of verse and other writings. There are various bundles and boxes relating to
    "Coins", "Botany", "Poetry", "Literary", "Financial"
    and bookplates.





    Preliminary survey list.




    There is correspondence with the 3rd Baron de Tabley among the Edward Freeman Papers, held at JRULM.
    The Library also has custody of the important Tabley Book Collection.




    The family and estate papers of the Leicester-Warren Family of Tabley are held by Cheshire Record
    Office. Some of these papers were originally in the custody of the John Rylands University Library
    of Manchester.



    View Slide

  46. 2012-11-04 - SLIDE
    DLF 2012 - Denver
    Example EAD Record (Hub)


    Index terms


    Tabley Inferior
    Cheshire SJ7378


    Benson
    Arthur Christopher
    1862-1923


    Bridges
    Robert Seymour
    1844-1930


    Duff
    Sir
    Mountstuart Elphinstone Grant
    1829-1906
    Knight


    Gosse
    Sir
    Edmund William
    1849-1928
    Knight


    Milnes
    Richard Monckton
    1809-1885
    1st Baron Houghton


    Bookplates


    Botany


    Numismatics


    Poetry
    Modern
    19th century




    View Slide

  47. 2012-11-04 - SLIDE
    DLF 2012 - Denver
    2010-2012 Extraction Results
    • Source data: 30,000 finding aids
    • EAC-CPF records extracted
    – LoC: 43,702 from 1,159 finding aids
    – OAC: 91,811 from ~15,400
    – NWDA: 22,609 from 5,160
    – VH: 15,175 from 8,390
    – Total 173,297

    View Slide

  48. 2012-11-04 - SLIDE
    DLF 2012 - Denver
    Methods and Processing
    • Extract EAC-CPF records from existing EAD-
    encoded archival descriptions
    – Extracting both creators and referenced CPF names
    • Match EAC-CPF records against one another
    and against existing authority records (ULAN,
    VIAF, LCNAF)
    – Enhance EAC-CPF by normalizing entries, adding
    alternative entries, titles (VIAF), and historical data
    (ULAN)
    • Create a prototype historical resource and access
    system
    – Historical data and social-professional networks
    – Links to archive, library, and museum resources (by
    and about)

    View Slide

  49. 2012-11-04 - SLIDE
    DLF 2012 - Denver
    The Problem
    • Proliferation of the forms of names
    – Different names for the same person
    – Different people with the same names
    • Examples
    – from Books in Print (semi-controlled but not
    consistent)
    – ERIC author index (not controlled)

    View Slide

  50. 2012-11-04 - SLIDE
    DLF 2012 - Denver
    Goethe
    …etc…

    View Slide

  51. 2012-11-04 - SLIDE
    DLF 2012 - Denver
    John Muir

    View Slide

  52. 2012-11-04 - SLIDE
    DLF 2012 - Denver
    Library and Archive Authority
    • Library (or bibliographic) authority control is almost
    exclusively about the control of names
    • Archival authority control involves biographical-
    historical description of the CPF entity
    – Descriptions based on controlled vocabularies, for
    example, occupations, place of birth and death
    – But also biographical-historical description
    • Prose
    • Chronological list
    • Archival authority control provides context for
    understanding records, the context of their
    creation, the provenance

    View Slide

  53. 2012-11-04 - SLIDE
    DLF 2012 - Denver
    Repository of
    merged EAC
    Records
    EAC Repository
    VIAF Repository
    Connect
    exactly
    matching
    records
    Connect
    records using
    name
    authority
    information
    Repository of
    connected EAC
    Records
    (MongoDB)
    Merge
    Cheshire
    Search
    Merging EAC-CPF Records
    LCNAF Repository ULAN Repository

    View Slide

  54. 2012-11-04 - SLIDE
    DLF 2012 - Denver
    Repository of
    merged EAC
    Records
    EAC Repository
    VIAF Repository
    Connect
    exactly
    matching
    records
    Connect
    records using
    name
    authority
    information
    Repository of
    connected EAC
    Records
    (MongoDB)
    Merge
    Cheshire
    Search
    Merging EAC-CPF Records

    View Slide

  55. 2012-11-04 - SLIDE
    DLF 2012 - Denver
    Connect Exact Matches
    • The EAC-CPF records provide the names
    without having to parse texts, etc.
    • Allows us to use some simple methods like
    exact matching
    – Assume identical name entries means the
    same person/corporate body/family
    – Enter the full names and record IDs into a
    database and flag IDs with same names for
    merging

    View Slide

  56. 2012-11-04 - SLIDE
    DLF 2012 - Denver
    But…
    • Exact merging assumes that archives are
    following LC cataloging practice in their
    EAD records
    – There are some problems with this assumption

    View Slide

  57. 2012-11-04 - SLIDE
    DLF 2012 - Denver
    Some failures for merging…
    • Different abbreviations:
    – A. & G. Carisch & C.
    – A. & G. Carisch & Co.
    • And spacing issues:
    – A. C. Peters & Bro.
    – A. C. Peters & Brother.
    – A. C. Peters. (??)
    – A. C.Peters & Bro.
    • Completeness and alternate rules
    – Tabb, John B. (John Banister), 1845-1909.
    – Tabb, John Banister, 1845-1909.
    • Also differing transliterations for non-Latin scripts

    View Slide

  58. 2012-11-04 - SLIDE
    DLF 2012 - Denver
    More…
    • Variant romanizations (and spacing):
    – M. P. Belaieff.
    – M. P. Belaïeff.
    – M. P. Bieliaev.
    – M.P. Belaïeff.
    – M.P.Belaïeff.
    • Initials vs. names:
    – Zabolotskii, N.A.
    – Zabolotskii, Nikolai Alekseevich, 1903-1958.
    – Zabolotskii.

    View Slide

  59. 2012-11-04 - SLIDE
    DLF 2012 - Denver
    More…
    • Inverted order vs. uninverted
    – Taylor, Zachary, 1784-1850.
    – Zachary Taylor.
    • Various combinations:
    – Tchaikovsky, Peter I.
    – Tchaikovsky, Pëtr Il.
    – Tchaikovsky, Piotr Ilyich.
    – Tchaikovsky, Pyotr Il.
    – Tchaikovsky, Pyotr Ilyich.

    View Slide

  60. 2012-11-04 - SLIDE
    DLF 2012 - Denver
    Repository of
    merged EAC
    Records
    EAC Repository
    VIAF Repository
    Connect
    exactly
    matching
    records
    Connect
    records using
    name
    authority
    information
    Repository of
    connected EAC
    Records
    (MongoDB)
    Merge
    Cheshire
    Search
    Merging EAC-CPF Records

    View Slide

  61. 2012-11-04 - SLIDE
    DLF 2012 - Denver
    Search Authority Files
    • For each name, formulate a search of the
    VIAF database using the Cheshire system
    (SGML/XML retrieval system with
    probabilistic and Boolean matching)
    – Search both the “authoritative” and “non-
    authoritative” forms
    – Consider any name matching a non-
    authoritative form to be a candidate match for
    the authoritative form
    – Flag EAC records that match the same
    authority record as potential matches

    View Slide

  62. 2012-11-04 - SLIDE
    DLF 2012 - Denver
    Shingle Language Model for names
    Name: Einstein Albert
    Shingle sequence: ein, ins, nst, ste, tei, ein … , ert
    Probability that the sequence (ins, nst, ste) follows ein is very high for the
    name einstein
    Krishna Janakiraman and Sean Marimpietri - Biograph
    NGRAM or Shingle Matching

    View Slide

  63. 2012-11-04 - SLIDE
    DLF 2012 - Denver
    Name 1 : Einstein Albert Name 2 : Ainshtain Albert Name 3 : Albert Einstein
    ein
    ins
    nst
    ste
    ein In
    n a
    alb
    ert
    al
    rte
    tei
    ein
    Ain
    ins
    nsh
    sht
    hta tai
    ain
    alb
    ert
    al
    rte
    tei
    ein
    ein
    ins
    nst
    ste
    ein In
    n a
    alb
    ert
    al
    rte
    tei
    ein
    lbe
    lbe lbe
    Shingle Language Model for names
    Krishna Janakiraman and Sean Marimpietri - Biograph

    View Slide

  64. 2012-11-04 - SLIDE
    DLF 2012 - Denver
    Repository of
    merged EAC
    Records
    EAC Repository
    VIAF Repository
    Connect
    exactly
    matching
    records
    Connect
    records using
    name
    authority
    information
    Repository of
    connected EAC
    Records
    (MongoDB)
    Merge
    Cheshire
    Search
    Merging EAC-CPF Records

    View Slide

  65. 2012-11-04 - SLIDE
    DLF 2012 - Denver
    Merge Flagged Records
    • For all of the exact matches and authority
    matches
    – Use the Authoritative form of the name
    – Combine data from each match into a single
    EAC-CPF record
    – Retain all source record IDs and information
    • Finally, output the merged EAC-CPF
    records

    View Slide

  66. 2012-11-04 - SLIDE
    DLF 2012 - Denver
    Inputs to SNAC merging
    • LoC: 43,702 EAC-CPF records derived from 1159
    finding aids
    • OAC: 91,814 EAC-CPF records derived from
    ~15,400 finding aids
    • NWDA: 24952 EAC-CPF records derived from
    5,568 finding aids
    • VH: 15,175 EAC-CPF records
    • Total: 175,688 Input EAC records for merging
    • Result: 128,781 “unique” names

    View Slide

  67. 2012-11-04 - SLIDE
    DLF 2012 - Denver
    Another view of the numbers…
    • 95624 Person names merged from 125555
    Person records
    • 31287 Institutions merged from 47189
    Institution records
    • 1980 Families merged from 2899 Family
    records

    View Slide

  68. 2012-11-04 - SLIDE
    DLF 2012 - Denver
    Merging Conclusions
    • There will not be a single merging method,
    but a staged set of approaches that will
    allow us to go from the simplest exact
    matches, to (we hope) reliably identifying
    various variant forms of a name, etc. when
    corroborated by contextual (date, etc.)
    information

    View Slide

  69. 2012-11-04 - SLIDE
    DLF 2012 - Denver
    Next
    • Developing an updateable database of
    merged EAC data (dumping Mongo for
    PostgreSQL)
    – Will permit incremental addition of new data
    and support editing and “forced” merges
    • Process the 2M WorldCat archival
    descriptions
    • Process the 150,000 finding aids
    • Convert several hundred thousand archival
    authority records into EAC-CPF and match/
    merge process

    View Slide

  70. 2012-11-04 - SLIDE
    DLF 2012 - Denver
    Methods and Processing
    • Extract EAC-CPF records from existing EAD-
    encoded archival descriptions
    – Extracting both creators and referenced CPF names
    • Match EAC-CPF records against one another and
    against existing authority records (ULAN, VIAF,
    LCNAF)
    – Enhance EAC-CPF by normalizing entries, adding
    alternative entries, titles (VIAF), and historical data
    (ULAN)
    • Create a prototype historical resource and
    access system
    – Historical data and social-professional networks
    – Links to archive, library, and museum resources
    (by and about)

    View Slide

  71. 2012-11-04 - SLIDE
    DLF 2012 - Denver
    For More Information
    • http://socialarchive.iath.virginia.edu/
    (Project website)
    • http://socialarchive.iath.virginia.edu/xtf/
    search (public prototype)

    View Slide

  72. Historic Social Networks Prototype access system

    View Slide

  73. View Slide

  74. Outline
    • User Persona!
    • Search and Display!
    • Network graph visualization!
    • Linked Data / RDF!
    • Future Plans

    View Slide

  75. Meet the target users
    • Randy: Graduate student working on a PhD that involves biographies and the study of diplomatic
    families and networks. Sometimes he comes to the site looking for information on specific people; other
    times he is looking for information on a specific subject or event. He also TAs an undergraduate history
    class and sometimes has to help students find topics for papers. "
    • Connie: Works at an institution that contributed records to the project. Is going to be asking
    themselves how this site would be useful to their users. Wants to understand how their records were
    used and what the added value is."
    • Quincy: Library School Student working to QA record matching.
    "
    • Adele: Person doing authority work during collection processing.
    "
    • Lenny: Lenny likes linked data, and wants to be able to mine the links that have been established
    programatically.
    Personas are fictional characters created to represent the different user types within a targeted demographic, attitude and/or behavior set that might use a site, brand
    or product in a similar way. http://en.wikipedia.org/wiki/Persona_(marketing)

    View Slide

  76. Outline
    • User Persona!
    • Search and Display
    • Network graph visualization!
    • Linked Data / RDF!
    • Future Plans

    View Slide

  77. View Slide

  78. View Slide

  79. View Slide

  80. View Slide

  81. View Slide

  82. View Slide

  83. View Slide

  84. Advanced limits match EAC sections

    View Slide

  85. View Slide

  86. View Slide

  87. View Slide

  88. View Slide

  89. View Slide

  90. View Slide

  91. View Slide

  92. View Slide

  93. View Slide

  94. View Slide

  95. View Slide

  96. View Slide

  97. View Slide

  98. View Slide

  99. View Slide

  100. View Slide

  101. View Slide

  102. Outline
    • User Persona!
    • Search and Display!
    • Network graph visualization
    • Context widget (needs new name)

    • Linked Data / RDF!
    • Future Plans

    View Slide

  103. Tinkerpop graph database stack
    • Simple "property graph" model!
    • "JDBC for graph databases" [SNAC is using Neo4J for
    the graphDB]!
    • XPath like "gremlin" for graph query!
    • REST interfaces with "Rexster"!
    • For me, this was 10 to 100 times easier than using RDF

    View Slide

  104. View Slide

  105. View Slide

  106. View Slide

  107. View Slide

  108. View Slide

  109. View Slide

  110. View Slide

  111. View Slide

  112. View Slide

  113. Outline
    • User Persona!
    • Search and Display!
    • Network graph visualization!
    • Linked Data / RDF
    • Future Plans

    View Slide

  114. What is Linked Open Data?
    • w3c Semantic Web Technology Stack!
    • Web of atomized Data, not a web of documents!
    • RDF; OWL ontologies; SPARQL queries; triple/quad/quint
    stores!
    • httpRange14; content negotiation; CURIE!
    • No restrictions on data use; free and easy license!
    • Lenny wants it, but does Randy?

    View Slide

  115. What is Linked Open Data?
    • Getting to the good stuff!
    • Blue underlined text!
    • Pulling in data from multiple sources, in an intelligent
    way, into a "document"!
    • Understand and discover relationships!
    • Open access for research, education, private study and
    other fair use

    View Slide

  116. RDFa owl:sameAs

    View Slide

  117. HTML 5 microdata in chron list

    View Slide

  118. Thanks Ed Summers!
    RDF of the social graph

    View Slide

  119. View Slide

  120. View Slide

  121. View Slide

  122. http://templates.xdams.net/IBC/ontology/eac-cpf.rdf
    Silvia Mazzini"
    regesta.exe srl

    View Slide

  123. View Slide

  124. &mode=xml2owl [experimental]

    View Slide

  125. My opinion on the use cases for w3c RDF tech
    • Good for publishing data!
    • Good for controlled vocabularies!
    • Data models?!
    • Most people with open source RDF-store type systems
    do the real stuff with solr!
    • Consider a graph database

    View Slide

  126. View Slide

  127. Outline
    • User Persona!
    • Search and Display!
    • Linked Data / RDF!
    • Network graph visualization!
    • Future Plans

    View Slide

  128. Future Plans
    • Conduct assessment activities involving members of target
    audiences to establish mental model of users for design work!
    • Scale interface to millions of names!
    • Visualizations useful and integrated (network and geospatial)!
    • Stable URLs between batches for linked data!
    • Social and personalization features (gateway to crowdsourcing)!
    • Integration with local systems (such as with the context widget)

    View Slide

  129. • Photo attribution http://www.flickr.com/photos/
    dsevilla/139656712/in/photostream/!
    • http://xtf.cdlib.org/ !
    • http://code.google.com/p/eac-graph-load/source/
    browse/README.txt!
    • http://tinkerpop.com/!
    • http://thejit.org/!
    • https://github.com/tingletech/snac-related-widget

    View Slide