Upgrade to Pro — share decks privately, control downloads, hide ads and more …

A quick intro to networks

A quick intro to networks

Lightning talk from the April 25th joint meetup of R Users ATX and R Ladies ATX

Links:

Introduction to ggraph: Layouts – Thomas Lin Pederson
https://www.data-imaginist.com/2017/ggraph-introduction-layouts/

https://github.com/TheOpteProject/LGL

http://clairemcwhite.github.io/lgl-guide/

"LGL: creating a map of protein function with an algorithm for visualizing very large biological networks.
" https://www.sciencedirect.com/science/article/pii/S0022283604004851?via%3Dihub

Claire D. McWhite

April 26, 2018
Tweet

More Decks by Claire D. McWhite

Other Decks in Programming

Transcript

  1. Edges can have attributes node1 node2 weight relationship direction A

    B 5 red E A C 2 red E A D 5 blue F B C 1 blue EF C D 5 blue EF B A C D
  2. Nodes can have attributes node size color A 1 red

    B 2 red C 3 blue D 4 blue B A C D
  3. D Network layout algorithms node1 node2 A B A C

    A D B C C D B A C Network data structure text file Visualized network Network layout algorithm Calculates node coordinates Not very interpretable by humans Ideally more interpretable by humans
  4. Hierarchical layouts Introduction to ggraph: Layouts – Thomas Lin Pederson

    https://www.data-imaginist.com/2017/ggraph-introduction-layouts/ Yfiles circular layouts
  5. node1 node2 weight A B 1 B C 10 A

    B C A B C A B C 1 10 Edges act as springs in a force-directed layout
  6. There is a sweet spot of network size for most

    network layout algorithms. Then you enter the hairball zone https://www.systemsbiology.org/news/2012/12/19/combing-the-hairball/
  7. Steps to break up hairball networks Apply a threshold for

    edge weights. ------------------------ Thresholding -------------------
  8. Steps to break up hairball networks Apply a clustering algorithm

    to network data ------------------------ Clustering -------------------
  9. My data: No clustering 10 million edge, 1 million node

    network, Large Graph Layout (LGL)
  10. Software  R – ggraph/tidygraph  Python – NetworkX 

    Gui -Cytoscape (lots of tools for biology)  Gui – Gephi  Gui - Graphviz
  11. Large graph layout - LGL Protein homology graph – Edward

    Marcotte and Alex Adai - MOMA • My fav • Minimizes hairballness of larger networks • Scalable (handles over 1 million node networks) Adai 2004, LGL: creating a map of protein function with an algorithm for visualizing very large biological networks.
  12. Post-talk addendum  It would be great if someone could

    make a LGL network layout algorithm plugin for cytoscape or any network visualization software.  The program currently only runs on Linux  https://github.com/TheOpteProject/LGL  My user guide -> http://clairemcwhite.github.io/lgl-guide/  Original paper describing algorithm: https://www.sciencedirect.com/science/article/pii /S0022283604004851?via%3Dihub
  13. If your data looks like this at all, throw it

    in a network layout. node1 node2 A B A C A D B C C D (but don’t overinterpret any one layout, and remember that network layouts are fickle and minor changes to thresholds and data input choices can change everything)