Upgrade to Pro — share decks privately, control downloads, hide ads and more …

SATORI - A System for Ontology-Guided Visual Exploration of Biomedical Data Repositories

SATORI - A System for Ontology-Guided Visual Exploration of Biomedical Data Repositories

These are the slides from my master thesis presentation. SATORI is an ontology-guided visual exploration system for data repositories, which combines powerful metadata search with a treemap and a node-link diagram that visualize the repository structure, provide context to retrieved data sets, and serve as an interface to drive semantic querying and exploration, and thereby support the information foraging loop. SATORI is web-based, open-source, and integrated in the Refinery Platform – an application for biomedical data management, analysis, and visualization.

Fritz Lekschas

April 15, 2016
Tweet

More Decks by Fritz Lekschas

Other Decks in Science

Transcript

  1. Fritz Lekschas SATORI Apr 15, 2016 !1 A System for

    Ontology-Guided Visual Exploration of Biomedical Data Repositories
  2. Fritz Lekschas GOAL A System for Ontology-guided Visual Exploration of

    Biomedical Data Repositories » Refinery » Biological Context » Search & Visualization
 » RNA-Seq, ChIP-Seq, … » Stem Cell Commons, … Apr 15, 2016 !3
  3. Fritz Lekschas BIG DATA Opportunities • Test hypothesis without data

    generation • Enrich in-house generated data • Meta analysis • Large scale data mining Challenges ➢ Find relevant data Apr 15, 2016 !6
  4. Fritz Lekschas USER ROLES Data Analyst Analysing raw data to

    test a specific hypothesis Data Curator Develop and maintain annotation strategies Apr 15, 2016 !7 Project Leader Propose ideas to address challenges and foresee trends
  5. Fritz Lekschas LIVE DEMO *All bugs have been introduced for

    the sake of entertainment only. Please do not report them. Apr 15, 2016 !18
  6. Fritz Lekschas CONCLUSION • Ontology-guided visualizations enrich exploration: overview &

    higher-level terms • Users need initial training or guidance • Need unified query interface • Utility of ontologies crucially depend on the annotation quality Apr 15, 2016 !20
  7. Fritz Lekschas FUTURE WORK • Evaluate generality of SATORI •

    Unified query interface • Invert order nodes in the graph • Unify concept of the graph with faceted browsing Apr 15, 2016 !21
  8. Fritz Lekschas PRECISION & RECALL Data Set Organism Cell Type

    Disease Seq. Techno. 1 Human Hepatocyte Healthy RNA-Seq Mouse Hepatocyte Healthy RNA-Seq 2 Human Podocyte Healthy RNA-Seq Human Podocyte Congenital nephrotic syndrome RNA-Seq 3 Human Fibroblast Healthy Microarray C57BL/6 Fibroblast Healthy Microarray 4 Human Fibroblast Cardiac fibrosis Microarray 5 Human Fibroblast Healthy RNA-Seq Charact. Value Precision Recall Charact. Value Precision Recall Organism Human 2 / 2 2 / 5 Disease Healthy 2 / 2 2 / 5 Mouse 1 / 2 1 / 2 Cardiac Fibrosis 1 / 2 1 / 1 Cell Type Fibroblast 2 / 2 2 / 3 Seq. Techno. Microarray 2 / 2 2 / 2 Apr 15, 2016 !25 Search: “Human Fibroblast Microarray”