Slide 1

Slide 1 text

O U S B SYNS : I’ ... Jean-Gabriel Young Center for the Study of Complex Systems, University of Michigan, Ann Arbor, MI, USA jgyoung.ca @_jgyou May th, Based on joint work with Guillaume St-Onge, Patrick Desrosiers, Louis J. Dubé

Slide 2

Slide 2 text

“I made this...” projects

Slide 3

Slide 3 text

“I made this...” projects Marie Kondo projects

Slide 4

Slide 4 text

Political Books co-purchase network V. Krebs, unpublished, http://www.orgnet.com/

Slide 5

Slide 5 text

Political Books co-purchase network V. Krebs, unpublished, http://www.orgnet.com/

Slide 6

Slide 6 text

Political Books co-purchase network V. Krebs, unpublished, http://www.orgnet.com/

Slide 7

Slide 7 text

Political Books co-purchase network V. Krebs, unpublished, http://www.orgnet.com/

Slide 8

Slide 8 text

Political Books co-purchase network V. Krebs, unpublished, http://www.orgnet.com/

Slide 9

Slide 9 text

Political Books co-purchase network V. Krebs, unpublished, http://www.orgnet.com/

Slide 10

Slide 10 text

Political Books co-purchase network V. Krebs, unpublished, http://www.orgnet.com/

Slide 11

Slide 11 text

Political Books co-purchase network V. Krebs, unpublished, http://www.orgnet.com/

Slide 12

Slide 12 text

Political Books co-purchase network V. Krebs, unpublished, http://www.orgnet.com/

Slide 13

Slide 13 text

Our goal : Organize MPE algorithms.

Slide 14

Slide 14 text

Mesoscopic pattern extraction (MPE) with objective functions MPE Algorithm

Slide 15

Slide 15 text

Mesoscopic pattern extraction (MPE) with objective functions What truly matters : the objective function. Explore Partition Space Objective Function E : Propose partitions σ1 , σ2 , ... O : Order partitions H(σo ) ≥ H(σi ) ∀i

Slide 16

Slide 16 text

A large class of MPE objectives G The class of objective functions that can be decomposed as H(σ, A; f ) (ij) f (aij , , σi , σj ), with f (aij , σi , σj ) a score function for node pair (i, j). f (1 , , ) f (0, , ) 2 1 3 W.W. Zachary, J. Anthropol. Res. ( ), http://www-persona .umich.edu/∼mejn/netdata/

Slide 17

Slide 17 text

Two-part algorithms in practice Core-periphery Greedy (Kernighan-Lin type)

Slide 18

Slide 18 text

Two-part algorithms in practice Core-periphery Greedy (Kernighan-Lin type) Modularity Greedy (Kernighan-Lin type)

Slide 19

Slide 19 text

Two-part algorithms in practice Core-periphery Greedy (Kernighan-Lin type) Modularity Greedy (Kernighan-Lin type) Spectral + Gaussian Mixture

Slide 20

Slide 20 text

Our goal : Organize MPE algorithms

Slide 21

Slide 21 text

Our goal : Organize MPE algorithms using their objective functions

Slide 22

Slide 22 text

Organization tools relationships to organize objective functions

Slide 23

Slide 23 text

Organization tools relationships to organize objective functions E H1 ∼ H2 Order partitions σA , σB the same way

Slide 24

Slide 24 text

Organization tools relationships to organize objective functions E H1 ∼ H2 Order partitions σA , σB the same way S H1 ⇓ H2 Restrict the “expressiveness” of H1 to get H2.

Slide 25

Slide 25 text

We’ll use equivalence and specialization to construct a hierarchy.

Slide 26

Slide 26 text

Stochastic Block Model Near the top of the hierarchy sits...

Slide 27

Slide 27 text

Stochastic Block Model Near the top of the hierarchy sits... T A Generative model that creates network based on a hidden partition σ, a density matrix ω P(A|σ, ω) ij P(Aij |σi , σj , ω) ij (ωσi σj )Aij Aij e−ωσi σj 2 1 3 ω

Slide 28

Slide 28 text

Stochastic Block Model To find communities we : maximize the likelihood w.r.t. to σ. L SBM P(A|σ, ω) ij (ωσi σj )Aij Aij e−ωσi σj

Slide 29

Slide 29 text

Stochastic Block Model To find communities we : maximize the likelihood w.r.t. to σ. L SBM P(A|σ, ω) ij (ωσi σj )Aij Aij e−ωσi σj Since argmax f (x) argmax log f (x), we can also use log P(σ|A, ω) ∼ ij [Aij log ωσi σj − ωσi σj ]

Slide 30

Slide 30 text

Stochastic Block Model To find communities we : maximize the likelihood w.r.t. to σ. L SBM P(A|σ, ω) ij (ωσi σj )Aij Aij e−ωσi σj Since argmax f (x) argmax log f (x), we can also use log P(σ|A, ω) ∼ ij [Aij log ωσi σj − ωσi σj ] But! That’s suspiciously close to : H(σ, A; f ) (ij) f (aij , σi , σj )

Slide 31

Slide 31 text

By restricting ω : we get a whole hierarchy!

Slide 32

Slide 32 text

By restricting ω : we get a whole hierarchy! Max. likelihood of the SISBM aij log ωσiσj − ωσiσj λij ωσiσj ≥ 0, λij ≥ 0 Max. likelihood of the SBM aij log ωσiσj − ωσiσj ωσiσj ≥ 0 Max. likelihood of the SIGMGM xσiσj aij + γλij λij ≥ 0, xσiσj ∈ {0, 1}, γ < 0 General modularities xσiσj aij + γλij λij ≥ 0, γ < 0 Modularities δσiσj aij + γλij λij ≥ 0, γ < 0 Max. likelihood of the GMGM xσiσj aij + γ xσiσj ∈ {0, 1}, γ < 0 Edge counts with quadratic size constraints xσiσj aij + γ xσiσj ∈ {0, 1}, γ < 0 Balanced cut δσiσj aij + γ γ < 0 Balanced coloring (1 − δσiσj ) aij + γ γ < 0 Core-periphery e.g.: δσi1 δσj 1 aij + γ γ < 0 *we actually need to use a more general SBM

Slide 33

Slide 33 text

The Stochastic Block Model is in that it is at the top of the hierarchy

Slide 34

Slide 34 text

Why does this matter? . S ! There is good code that will fit the SBM.

Slide 35

Slide 35 text

Why does this matter? . S ! There is good code that will fit the SBM. . S MPE ! You can see them as fitting a model -> statistically principled.

Slide 36

Slide 36 text

Why does this matter? . S ! There is good code that will fit the SBM. . S MPE ! You can see them as fitting a model -> statistically principled. . D MPE ? They are fitting ill-defined models.

Slide 37

Slide 37 text

Why does this matter? . S ! There is good code that will fit the SBM. . S MPE ! You can see them as fitting a model -> statistically principled. . D MPE ? They are fitting ill-defined models. . D ’ ! If algorithm A and B find the same partition, not necessarily a sign of consensus.

Slide 38

Slide 38 text

Why does this matter? . S ! There is good code that will fit the SBM. . S MPE ! You can see them as fitting a model -> statistically principled. . D MPE ? They are fitting ill-defined models. . D ’ ! If algorithm A and B find the same partition, not necessarily a sign of consensus. . T ! Trivial reduction for NP-hardness. Detectability is a general phenomenon. And more?

Slide 39

Slide 39 text

Further reading O ( ) J.-G. Young, . St-Onge and P. Desrosiers, Louis J. Dubé Phys. Rev. E, ( ) Inspiration / related work ( ) J. Reichardt and S. Bornholdt, Phys. Rev. E ( ) ( ) M.E.J Newman, Phys. Rev. E ( ) ( ) S. C. Olhede and P. J. Wolfe, PNAS , ( )

Slide 40

Slide 40 text

Take-home message Objective function = important part of MPE algorithms Organization : via equivalence and specialization Max. likelihood SBM is almost universal as MPE goes Reference : Phys. Rev. E, ( ) Software : github.com/jg-you/sbm_canonica _mcmc

Slide 41

Slide 41 text

Reference : Phys. Rev. E, ( ) arXiv : 1806.04214 jgyou@umich.edu jgyoung.ca @_jgyou