Upgrade to Pro — share decks privately, control downloads, hide ads and more …

Topological Data Analysis Visual presentation o...

Topological Data Analysis Visual presentation of multidimensional data sets

Talk by Edward Kibardin, Head of Data and Analytics @Base79 at Data Science London meetup

Data Science London

January 12, 2014
Tweet

More Decks by Data Science London

Other Decks in Technology

Transcript

  1. Topology The  Seven  Bridges  of  Königsberg,  a  problem  solved  by

     Leonard  Euler  (1736). The  study  of  qualitative  properties  of  certain   objects  (topological  spaces)  that  are  invariant   under  a  certain  kind  of  transformation   (continuous  map),  especially  those  properties   that  are  invariant  under  a  certain  kind  of   equivalence  (homeomorphism).
  2. Topology  Data  Analysis  Pipeline a b a.  First  approximate  the

     unknown  space  X  in   a  combinatorial  structure  K b.  Then  compute  topological  invariants  of  K
  3. Topological  Invariants A  topological  invariant  is  a  map  f  

     that  assigns  the  same  object  to homeomorphic  spaces,  that  is: Homology:  is  a  machine  that   converts  local  data  about  a  space   into  global  algebraic  structure Reference:  Wikipedia,  2010.
  4. Morse  Theory  and  Reeb  Graph Theorem:   Suppose  h  :

     X  g        is  a  discrete   Morse  function. Then  X  is  homotopy  equivalent  to  a   CW-­‐‑complex  with  exactly  one  cell  of   dimension  p  for  each  critical  simplex   of  dimension  p. Reference:  Teng  Ma  ;  Zhuangzhi  Wu  ;  Pei  Luo  ;  Lu  Feng.  Reeb  graph  computation  through  spectral  clustering,  2011.
  5. Case  study:  Netflix  dataset Music Indian Anime French Honk  

    Kong US   Cartoons Kids Movie Ger man US Retro Horror