Upgrade to Pro — share decks privately, control downloads, hide ads and more …

SemanTex

Emir Muñoz
October 21, 2014

 SemanTex

SemanTex: Semantic Text Exploration Using Document Links Implied by Conceptual Networks Extracted from the Texts, Poster presented in ISWC 2014

Emir Muñoz

October 21, 2014
Tweet

More Decks by Emir Muñoz

Other Decks in Research

Transcript

  1. Suad Aldarra(FI*1), Emir Muñoz(FI*1), Pierre-Yves Vandenbussche (FI*1), Vít Nováček (Insight*2)
    SemanTex: Semantic Text Exploration Using
    document Links Implied by Conceptual Networks
    Summary
    - Automatic method for computing semantic links between documents
    - Provide the conceptual paths (explanation) between two documents
    - Allow topical navigation within and through documents
    - Application to biomedical literature (PubMed) Parkinson Disease article
    (*1) Fujitsu Ireland Limited (*2) Insight @ NUI Galway
    Method
    1. Noun-Phrase extraction
    (Using Biomedical Named Entity recognition)
    2. Co-occurrence Relationship computation
    (Using Point wise Mutual Information)
    3. Cosine similarity computation
    Implementation and Evaluation
    - Experimentation on 4722 abstracts from PubMed Parkinson disease related articles
    that led to the extraction of 43,362 concepts and 38M paths possible before paths selection
    - Automated evaluation (TREC dataset: doc-doc relationship; MeSH: topical progression)
    - Expert evaluation (Quality of paths selection)
    A B C D
    x
    r
    u
    t
    v
    y
    x
    r
    u
    t
    v
    y
    (A,B)
    (B,C)
    (C,D) (C)
    (D)
    (D)
    A D
    x
    x
    u v y
    r t
    Conceptual Network
    Extraction
    4. Extraction of paths between all nodes
    and between all documents
    (Using path length and product
    of the edges weight threshold)
    5. Multi-objective optimization
    -Complexity
    -Coherence
    -Entropy
    Paths Extraction
    and Selection
    Conceptual Network
    Most relevant Paths from x
    Most relevant Paths between documents A and D
    Application to PubMed abstracts
    Most relevant paths from
    “maternal transmission” concept

    View Slide