Upgrade to Pro — share decks privately, control downloads, hide ads and more …

On the Universality of the Stochastic Block Model

On the Universality of the Stochastic Block Model

Presented at the SYNS Warm-up event of NetSci 2019 (https://www.networkscienceinstitute.org/syns).

Paper: https://journals.aps.org/pre/abstract/10.1103/PhysRevE.98.032309
arXiv: https://arxiv.org/abs/1806.04214
Code: https://github.com/jg-you/sbm_canonical_mcmc

Abstract
========
Mesoscopic pattern extraction (MPE) is the problem of finding a partition of the nodes of a complex network that maximizes some objective function. Many well-known network inference problems fall in this category, including, for instance, community detection, core-periphery identification, and imperfect graph coloring. In this paper, we show that the most popular algorithms designed to solve MPE problems can in fact be understood as special cases of the maximum likelihood formulation of the stochastic block model (SBM) or one of its direct generalizations. These equivalence relations show that the SBM is nearly universal with respect to MPE problems.

Jean-Gabriel Young

May 26, 2019
Tweet

More Decks by Jean-Gabriel Young

Other Decks in Science

Transcript

  1. O U S B SYNS : I’ ... Jean-Gabriel Young

    Center for the Study of Complex Systems, University of Michigan, Ann Arbor, MI, USA jgyoung.ca @_jgyou May th, Based on joint work with Guillaume St-Onge, Patrick Desrosiers, Louis J. Dubé
  2. Mesoscopic pattern extraction (MPE) with objective functions What truly matters

    : the objective function. Explore Partition Space Objective Function E : Propose partitions σ1 , σ2 , ... O : Order partitions H(σo ) ≥ H(σi ) ∀i
  3. A large class of MPE objectives G The class of

    objective functions that can be decomposed as H(σ, A; f ) (ij) f (aij , , σi , σj ), with f (aij , σi , σj ) a score function for node pair (i, j). f (1 , , ) f (0, , ) 2 1 3 W.W. Zachary, J. Anthropol. Res. ( ), http://www-persona .umich.edu/∼mejn/netdata/
  4. Organization tools relationships to organize objective functions E H1 ∼

    H2 Order partitions σA , σB the same way S H1 ⇓ H2 Restrict the “expressiveness” of H1 to get H2.
  5. Stochastic Block Model Near the top of the hierarchy sits...

    T A Generative model that creates network based on a hidden partition σ, a density matrix ω P(A|σ, ω) ij P(Aij |σi , σj , ω) ij (ωσi σj )Aij Aij e−ωσi σj 2 1 3 ω
  6. Stochastic Block Model To find communities we : maximize the

    likelihood w.r.t. to σ. L SBM P(A|σ, ω) ij (ωσi σj )Aij Aij e−ωσi σj
  7. Stochastic Block Model To find communities we : maximize the

    likelihood w.r.t. to σ. L SBM P(A|σ, ω) ij (ωσi σj )Aij Aij e−ωσi σj Since argmax f (x) argmax log f (x), we can also use log P(σ|A, ω) ∼ ij [Aij log ωσi σj − ωσi σj ]
  8. Stochastic Block Model To find communities we : maximize the

    likelihood w.r.t. to σ. L SBM P(A|σ, ω) ij (ωσi σj )Aij Aij e−ωσi σj Since argmax f (x) argmax log f (x), we can also use log P(σ|A, ω) ∼ ij [Aij log ωσi σj − ωσi σj ] But! That’s suspiciously close to : H(σ, A; f ) (ij) f (aij , σi , σj )
  9. By restricting ω : we get a whole hierarchy! Max.

    likelihood of the SISBM aij log ωσiσj − ωσiσj λij ωσiσj ≥ 0, λij ≥ 0 Max. likelihood of the SBM aij log ωσiσj − ωσiσj ωσiσj ≥ 0 Max. likelihood of the SIGMGM xσiσj aij + γλij λij ≥ 0, xσiσj ∈ {0, 1}, γ < 0 General modularities xσiσj aij + γλij λij ≥ 0, γ < 0 Modularities δσiσj aij + γλij λij ≥ 0, γ < 0 Max. likelihood of the GMGM xσiσj aij + γ xσiσj ∈ {0, 1}, γ < 0 Edge counts with quadratic size constraints xσiσj aij + γ xσiσj ∈ {0, 1}, γ < 0 Balanced cut δσiσj aij + γ γ < 0 Balanced coloring (1 − δσiσj ) aij + γ γ < 0 Core-periphery e.g.: δσi1 δσj 1 aij + γ γ < 0 *we actually need to use a more general SBM
  10. Why does this matter? . S ! There is good

    code that will fit the SBM.
  11. Why does this matter? . S ! There is good

    code that will fit the SBM. . S MPE ! You can see them as fitting a model -> statistically principled.
  12. Why does this matter? . S ! There is good

    code that will fit the SBM. . S MPE ! You can see them as fitting a model -> statistically principled. . D MPE ? They are fitting ill-defined models.
  13. Why does this matter? . S ! There is good

    code that will fit the SBM. . S MPE ! You can see them as fitting a model -> statistically principled. . D MPE ? They are fitting ill-defined models. . D ’ ! If algorithm A and B find the same partition, not necessarily a sign of consensus.
  14. Why does this matter? . S ! There is good

    code that will fit the SBM. . S MPE ! You can see them as fitting a model -> statistically principled. . D MPE ? They are fitting ill-defined models. . D ’ ! If algorithm A and B find the same partition, not necessarily a sign of consensus. . T ! Trivial reduction for NP-hardness. Detectability is a general phenomenon. And more?
  15. Further reading O ( ) J.-G. Young, . St-Onge and

    P. Desrosiers, Louis J. Dubé Phys. Rev. E, ( ) Inspiration / related work ( ) J. Reichardt and S. Bornholdt, Phys. Rev. E ( ) ( ) M.E.J Newman, Phys. Rev. E ( ) ( ) S. C. Olhede and P. J. Wolfe, PNAS , ( )
  16. Take-home message Objective function = important part of MPE algorithms

    Organization : via equivalence and specialization Max. likelihood SBM is almost universal as MPE goes Reference : Phys. Rev. E, ( ) Software : github.com/jg-you/sbm_canonica _mcmc