Upgrade to Pro — share decks privately, control downloads, hide ads and more …

VIVO Use-Cases for Collaborative Science: From ...

VIVO Use-Cases for Collaborative Science: From Researcher Networks to Semantic User Interfaces for Data

These are the slides from my panel talk at the #VIVO13 Conference entitled "VIVO Use-Cases for Collaborative Science: From Researcher Networks to Semantic User Interfaces for Data." This conference was held in St. Louis, MO on August 15-16, 2013.

Robert H. McDonald

August 15, 2013
Tweet

More Decks by Robert H. McDonald

Other Decks in Education

Transcript

  1. 1 Panel: VIVO Use-Cases for Collaborative Science: From Researcher Networks

    to Semantic User Interfaces for Data Robert H. McDonald – Indiana University Hao Xu – University of North Carolina Patrick West – Rensselaer Polytechnic Institute William K. Barnett – Indiana University
  2. Sustainable  Environment  –  Ac4onable  Data  |  @SEADdatanet   SEAD VIVO

     Conference  2013   August  15,  2013   Robert  H.  McDonald   Indiana  University  |  @mcdonald  
  3. SEAD  Partners   Margaret  Hedstrom,  PI   Ann  Zimmerman  

    James  Myers   Beth  Plale   Katy  Börner   Robert  H.  McDonald       Praveen  Kumar              George  Alter  &  Bryan  Beecher    
  4. Data  Challenges  in  Sustainability  Research   •  Many  dimensions,  many

     coordinate  systems,  many   scales,  many  formats,  many  providers  and  users,  …  
  5. SEAD’s  Goals   •  Provide  data  services  that  address  the

     pressing  needs   of  researchers  working  toward  sustainability   •  Integrate  these  services  into  an  generalizable  “Ac4ve   and  Social  Cura4on”  infrastructure  well-­‐suited  to  the   social  structure  and  economics  of  long-­‐tail  research   communi4es   •  Develop  capabili4es  to  package  and  migrate  datasets   to  a  federated  repository  infrastructure  for  long-­‐term   preserva4on   •  Educa4on,  outreach,  &  training,  to  maximize  value  and   disseminate  SEAD’s  contribu4ons  to  other  projects  and   communi4es    
  6. SEAD  Prototype  Community:  NCED   •  Na4onal  Center  for  Earth

      Surface  Dynamics  (NCED)   one  overarching  ques7on:   "How  will  the  coupled  system   of  physical,  biological,   geochemical,  and  human   processes  that  shape  the   surface  of  the  Earth  respond   to  changes  in  climate,  land   use,  environmental   management,  and  other   forcings?"    
  7. Linked  Data  for  Community  Explora4on   Data  Publica4on  and  Research

     Analy4cs   People   •   Affilia4on   •   Publica4ons   •   Research  ac4vi4es   •   Teaching   •   Service   Research   •   Publica4ons   •   Authors  +  collaborators   •   Cita4ons   •   Data  Publica4ons   •   Subject  areas   •   IDs   Organiza-­‐ 4ons   •   Funding   •   People   •   Publica4ons   •   Data   •   Loca4on   Data   • Research  Data   Management   •   Data  Publica4ons   •   Research  Rela4onship  to   Data   SEAD   Seman7c   Layer   •  Linked  things,  people  and   ac4vi4es   •  Rich  “bird’s  eye”  picture     •  Comparisons  over  4me   •  Dynamic  views  (mul4ple   diagrams  and  views)   •  Easier  to  create  highlights   and  track  cita4ons  and   progress  
  8.  Summary  of  SEAD’s  Contribu4ons     •  Provides  researchers  with

     access  to   heterogeneous  data  collec4ons  needed  for   sustainability  science   •  Supports  data  management  and  ac4ve  cura4on   that  improves  and  adds  value  to  data   •  Creates  a  rich  discovery  environment  of  data,   publica4ons,  and  exper4se   •  Ensures  long-­‐term  preserva4on  of  data  with   publica4ons  through  interoperability  with  trusted   repositories  
  9. More  SEAD  Informa4on   •  Follow  us  on  Twiier  @SEADdatanet

      •  See  all  of  our  demo  videos  (collec4on  within  ACR)   •  hip://bit.ly/1cHhkjw   •  Check  out  our  Web  site   •  hip://sead-­‐data.net  (ACR/Social  Network/VirtA)   •  SEAD  Demo  -­‐  hip://sead-­‐demo.ncsa.illinois.edu     •  Contact  Us:       Robert  H.  McDonald  |  [email protected]  
  10. 12 Questions for Discussion 1.  What are the current challenges

    in integrating the semantic frameworks for people (vivo) and resources (eagle-i, deep carbon observatory)? How can this integration be aligned with VIVO or other tools? 2.  What are the challenges in scalability for very large collections (Ex. in DFC they are expecting 100s of millions of files or iPlant has 10K researcher profiles) how can we plan and prepare for display of that many files or profiles and what implications does this have for the i/o load on Vivo for registering resources?
  11. 13 Questions for Discussion 3.  If there is a preferred

    triple store for use in scaling VIVO? What are the current best options? 4.  What have others done to fine tune VIVO for scalability - conversion of triples to RDBMS for transaction management - extension of VIVO to something like elastic search that has better capability for real-time indexing than solr? 5.  Are others using URIs such as DOIs with all Vivo resources that are registered? If so what are the challenges in that type of implementation?