Time-sensitive Network Inference in Diffusion Networks

Time-sensitive Network Inference in Continuous-Time Diffusion Networks Emaad Ahmed Manzoor
CS229: Final Presentation

Networks

Social Networks

Epidemic Networks Individual stick figures from xkcd.com

Information Networks

Diffusion

Diffusion Formalization

0 T Cascade

0 T Parents & Children

0 T Infection Times t 1 t 2 t 3
t 4

0 T Observation Limit t 1 t 2 t 3
t 4

0 T Underlying Network t 1 t 2 t 3
t 4

0 T Transmission Times 0.1 0.7 0.1 0.4 0.7 0.3
0.5 0.8 0.6 0.9

f(t i ) Probability that node i is infected at
time t i

time t i f(t i |t j ) Probability that node i is infected at time t i given that node j is infected at time t j

time t i f(t i |t j ) Probability that node i is infected at time t i given that node j is infected at time t j f(t i |t j ) = f ij (t i - t j )

Exponential Pairwise Transmission Function

Network Inference

Set C of cascades

Set C of cascades Each cascade is a set of
observations

observations Each cascade is observed until a horizon time

observations Each cascade is observed until a horizon time Nodes not infected before this horizon time

observations Each cascade is observed until a horizon time Nodes not infected before this horizon time Pairwise transmission functions

observations Each cascade is observed until a horizon time Nodes not infected before this horizon time Pairwise transmission functions Find transmission rates that maximise the likelihood of the observed cascades

Metrics Precision Recall

State of the Art

Uncovering the temporal dynamics of diffusion networks Influence maximisation in
continuous-time diffusion networks Scalable influence estimation in continuous time diffusion networks Rodriguez et al. ICML '11 Rodriguez et al. ICML '12 Du et al. NIPS '13

Rodriguez et al. ICML '11 Uncovering the temporal dynamics of
diffusion networks 1. Define cascade likelihood as the objective function 2. Since this function is convex, the problem is a constrained maximisation problem over transmission rates

Rodriguez et al. ICML '11 Uncovering the temporal dynamics of
diffusion networks "Our formulation thus does not depend on the absolute time of infection of the root node" "Transmission functions are shift invariant, and do not depend on the absolute times of infection of the pair of nodes"

Contributions

Independent transmission rates

Independent transmission rates Bayes network inference?

Independent transmission rates Bayes network inference? ?

States aSleep Awake

Contribution 1: Formulate a time-dependent transmission function as a discrete
mixture of distributions.

Contribution 2: Model the time-dependent priors with circular normal distributions

Per node Per edge Unknowns How do we fit these
from the data?

Contribution 3: EM algorithm to fit the unknown parameters from
the data 1. Initialize the state for each node in each cascade randomly; S ic = random(A, S) 2. Estimate and d for every pair of nodes using convex optimisation (Manuel et al., 2011). 3. Estimate and using closed- form maximum-likelihood estimates. 4. Reassign new states S ic to nodes in each cascade

Contribution 3: EM algorithm to fit the unknown parameters from
the data 4. Reassign new states S ic to nodes in each cascade

Results

Synthetic data: 1024 nodes Kronecker core-periphery (Leskovec, '08), transmission times
and root nodes chosen uniformly at random, 1000 cascades. Real data: Memetracker, 1M nodes, >100K cascades.

Future

Algorithm: Continuous states Remove stationarity assumption Implementation: Parallelism Speed Experiments:
Real data New synthetic data

Time-sensitive Network Inference in Diffusion N...

Time-sensitive Network Inference in Diffusion Networks

More Decks by Emaad Manzoor

Other Decks in Research

Featured

Transcript