Upgrade to Pro — share decks privately, control downloads, hide ads and more …

SATORI - A System for Ontology-Guided Visual Exploration of Biomedical Data Repositories

SATORI - A System for Ontology-Guided Visual Exploration of Biomedical Data Repositories

These are the slides from my master thesis presentation. SATORI is an ontology-guided visual exploration system for data repositories, which combines powerful metadata search with a treemap and a node-link diagram that visualize the repository structure, provide context to retrieved data sets, and serve as an interface to drive semantic querying and exploration, and thereby support the information foraging loop. SATORI is web-based, open-source, and integrated in the Refinery Platform – an application for biomedical data management, analysis, and visualization.

Fritz Lekschas

April 15, 2016
Tweet

More Decks by Fritz Lekschas

Other Decks in Science

Transcript

  1. Fritz Lekschas
    SATORI
    Apr 15, 2016 !1
    A System for Ontology-Guided Visual
    Exploration of Biomedical Data Repositories

    View Slide

  2. Fritz Lekschas


    “Seeing into one's true nature”
    Apr 15, 2016 !2

    View Slide

  3. Fritz Lekschas
    GOAL
    A System for
    Ontology-guided
    Visual Exploration of
    Biomedical Data
    Repositories
    » Refinery
    » Biological Context
    » Search & Visualization

    » RNA-Seq, ChIP-Seq, …
    » Stem Cell Commons, …
    Apr 15, 2016 !3

    View Slide

  4. Fritz Lekschas
    WHY?
    Apr 15, 2016 !4

    View Slide

  5. Jules J. Berman, 2013
    “BIG DATA”
    Volume—Larger studies
    Variety—More studies
    Velocity—Constantly changing data
    Apr 15, 2016 !5

    View Slide

  6. Fritz Lekschas
    BIG DATA
    Opportunities
    • Test hypothesis without data
    generation
    • Enrich in-house generated
    data
    • Meta analysis
    • Large scale data mining
    Challenges
    ➢ Find relevant data
    Apr 15, 2016 !6

    View Slide

  7. Fritz Lekschas
    USER ROLES
    Data Analyst
    Analysing raw data
    to test a specific
    hypothesis
    Data Curator
    Develop and
    maintain annotation
    strategies
    Apr 15, 2016 !7
    Project Leader
    Propose ideas to
    address challenges
    and foresee trends

    View Slide

  8. Fritz Lekschas
    WHAT?
    Apr 15, 2016 !8

    View Slide

  9. Fritz Lekschas
    Apr 15, 2016 !9

    View Slide

  10. Fritz Lekschas
    Apr 15, 2016 !10

    View Slide

  11. Fritz Lekschas
    Apr 15, 2016 !11

    View Slide

  12. Fritz Lekschas
    HOW?
    Apr 15, 2016 !12

    View Slide

  13. Fritz Lekschas
    Apr 15, 2016 !13

    View Slide

  14. Fritz Lekschas
    Apr 15, 2016 !14

    View Slide

  15. Fritz Lekschas
    Apr 15, 2016 !15

    View Slide

  16. Fritz Lekschas
    Apr 15, 2016 !16

    View Slide

  17. Fritz Lekschas
    Apr 15, 2016 !17

    View Slide

  18. Fritz Lekschas
    LIVE DEMO
    *All bugs have been introduced for the sake of
    entertainment only. Please do not report them.
    Apr 15, 2016 !18

    View Slide

  19. Fritz Lekschas
    WRAP UP
    Apr 15, 2016 !19

    View Slide

  20. Fritz Lekschas
    CONCLUSION
    • Ontology-guided visualizations enrich
    exploration: overview & higher-level terms
    • Users need initial training or guidance
    • Need unified query interface
    • Utility of ontologies crucially depend on the
    annotation quality
    Apr 15, 2016 !20

    View Slide

  21. Fritz Lekschas
    FUTURE WORK
    • Evaluate generality of SATORI
    • Unified query interface
    • Invert order nodes in the graph
    • Unify concept of the graph with faceted
    browsing
    Apr 15, 2016 !21

    View Slide

  22. Fritz Lekschas
    THANK YOU!
    http://satori.refinery-platform.org
    Apr 15, 2016 !22

    View Slide

  23. Fritz Lekschas
    APPENDIX
    Apr 15, 2016 !23

    View Slide

  24. Fritz Lekschas
    PRECISION & RECALL
    Apr 15, 2016 !24

    View Slide

  25. Fritz Lekschas
    PRECISION & RECALL
    Data Set Organism Cell Type Disease Seq. Techno.
    1 Human Hepatocyte Healthy RNA-Seq
    Mouse Hepatocyte Healthy RNA-Seq
    2 Human Podocyte Healthy RNA-Seq
    Human Podocyte Congenital nephrotic syndrome RNA-Seq
    3 Human Fibroblast Healthy Microarray
    C57BL/6 Fibroblast Healthy Microarray
    4 Human Fibroblast Cardiac fibrosis Microarray
    5 Human Fibroblast Healthy RNA-Seq
    Charact. Value Precision Recall Charact. Value Precision Recall
    Organism Human 2 / 2 2 / 5 Disease Healthy 2 / 2 2 / 5
    Mouse 1 / 2 1 / 2 Cardiac
    Fibrosis
    1 / 2 1 / 1
    Cell Type Fibroblast 2 / 2 2 / 3 Seq. Techno. Microarray 2 / 2 2 / 2
    Apr 15, 2016 !25
    Search: “Human Fibroblast Microarray”

    View Slide

  26. Fritz Lekschas
    PRECISION & RECALL
    Apr 15, 2016 !26

    View Slide

  27. Fritz Lekschas
    SATORI
    Semantic Annotation Tool and
    Ontological Relations Interface
    Apr 15, 2016 !27

    View Slide

  28. Fritz Lekschas


    “Enlightenment, Awakening,
    Comprehension & Understanding”
    Apr 15, 2016 !28

    View Slide