Slide 1

Slide 1 text

Decentralised Periodic Encounter Community Detection Matthew Williams (M.J.Williams@cs.cf.ac.uk) VLunch 27th May 2010 Supervisors: Dr. Roger Whitaker, Dr. Stuart Allen

Slide 2

Slide 2 text

Overview • Background and motivation • Periodic encounter communities • Decentralised periodic encounter community detection • Analysis of decentralised detection algorithm • Conclusions and future work

Slide 3

Slide 3 text

Background & Motivation

Slide 4

Slide 4 text

Opportunistic Networks • Opportunistic networks (oppnets) are a broad class of networks where messages are spread by the mobility of individuals and their occasional physical encounters • Encounters are the fundamental unit of communication in these networks • In many real-world cases, the behaviour of these nodes results in temporary physical communities occurring • Example networks include: human (PSNs), vehicular (VANETs), and wildlife monitoring

Slide 5

Slide 5 text

Encounter-Aware Content Sharing in OppNets • Recently, oppnets content sharing protocols have tried to capture patterns in encounters • Attempt to assess likelihood of seeing node in the future • Some protocols model periodicity of encounters • All models only consider immediate neighbours -- broader community structure is ignored • All models require predefined domain-specific periods to be set

Slide 6

Slide 6 text

Static Community Detection • Identify components in large graphs • Global-knowledge, offline algorithms • Static: single, time-agnostic graph • Distributed algorithm used in oppnets content sharing Fortunato 2010

Slide 7

Slide 7 text

Periodic Communities • It is intuitive that the underlying behaviour of nodes results in communities of nodes re-appearing regularly in time • Also evidenced in empirical datasets by PSE-Miner and other analyses • We seek to join the concepts of node communities and periodicity • Decentralised approach necessary in oppnets • With automatic detection of periods Periodic Zebra Communities period = 7 days period = 2 months Lahiri et al. 2010

Slide 8

Slide 8 text

Periodic Encounter Communities

Slide 9

Slide 9 text

Dynamic Encounter Representation • A dynamic encounter network is a time series of graphs • Each graph is a snapshot of encounters occurring during a time interval

Slide 10

Slide 10 text

Periodic Encounter Community • We formalise a Periodic Encounter Community (PEC) as • where • C is a connected graph (the community) • S is the harmonic information hC, Si S = (tstart, tend, )

Slide 11

Slide 11 text

PEC Example Example Dynamic Network Periodic Encounter Communities

Slide 12

Slide 12 text

PEC Redundancy • Harmonic maximality: • Multiple ways to fit harmonic information to the same community, but only one is parsimonious • Some PECs capture more information than others • One PEC may subsume another’s information

Slide 13

Slide 13 text

Maximality and Parsimony • Harmonic maximality: • Community does not exist for factors of the period, nor can it be extended in time • Structural maximality: • Cannot add edges or nodes to the community and still maintain its existence in the dynamic network • Parsimony: • A PEC is parsimonious if it is both harmonically maximal and structurally maximal

Slide 14

Slide 14 text

Decentralised PEC-D Problem • Decentralised PEC Detection is the problem of having all nodes detect the parsimonious PECs they belong to, without global knowledge of the network

Slide 15

Slide 15 text

Decentralised PEC Detection Algorithm

Slide 16

Slide 16 text

Algorithm Overview • Local Mining: • Obtain PECs that are parsimonious in their local encounter histories • Local Sharing: • Nodes share and combine their intermediate parsimonious PECs when they meet • Over time, nodes build towards the PECs that are parsimonious in the global dynamic graph local mining local sharing & merging globally parsimonious PECs local encounter histories

Slide 17

Slide 17 text

Intrinsic Dynamic Networks • Global dynamic network can be decomposed into intrinsic dynamic networks • Intrinsic DN corresponds to the encounter information directly observable by a node local encounter histories

Slide 18

Slide 18 text

Local Miner Algorithm • Invertible map from graphs to sets of integers • Edges and nodes given unique integer identifiers • Becomes a problem of mining periodic subsets in a time series of integer sets • Periodic pattern mining in temporal data mining field • Polynomial time complexity • Local returns locally-parsimonious PECs local mining

Slide 19

Slide 19 text

Joining PECs • Two PECs are compatible if the following hold: • their communities intersect • the PECs are harmonically equal, or one harmonically subsumes the other • If compatible, there are three generation cases: case action action harmonic equality merge communities keep harmonic information P1 harmonically subsumes P2 merge communities harmonic information from P2 P2 harmonically subsumes P1 merge communities harmonic information from P1 local sharing & merging

Slide 20

Slide 20 text

Opportunistic Construction • Each node holds local Knowledge Base (KB) of its PECs so far • Node only holds non-subsumed PECs which node itself belongs to • On encounter, a pair of nodes: • Share KBs • Generate candidate PECs • Store any more-maximal candidates • Remove any redundancies PEC Generation Cases local sharing & merging

Slide 21

Slide 21 text

Analysis of Decentralised Detection

Slide 22

Slide 22 text

PEC Construction Lag • Parsimonious PEC construction relies on encounters between nodes • The underlying ordering of edges influences how PECs propagate to nodes • Given the encounters predicted by a PEC, how long does it take the nodes to reach parsimony?

Slide 23

Slide 23 text

Worst Case Construction Lag • Pessimistic constraints: • No encounters extrinsic to the PEC -- only use edges predicted by the PEC • No repeat encounters in same time step • Under these constraints, construction lag depends on underlying ordering of encounters • With a worst-case encounter ordering, the construction lag for a PEC is function of the period and diameter: (d 1) + 1

Slide 24

Slide 24 text

Real Data Experiments • MIT Reality Mining data • Bluetooth encounters between 100 subjects over 9 months • Construction lag experiments • For a given parsimonious PEC, using only the edges predicted by that PEC, how long would it the whole community to detect the PEC? • With reference to the worst-case?

Slide 25

Slide 25 text

PEC Periodicities (cumulative dist.) granularity = 24 hours granularity = 6 hours

Slide 26

Slide 26 text

granularity = 24 hours Normalised Construction Lags (cumulative dist.) granularity = 6 hours

Slide 27

Slide 27 text

Conclusions

Slide 28

Slide 28 text

Conclusions • Globally parsimonious PECs can be mined decentrally, and with automatic periodicity identification • Time for globally parsimonious PEC construction is bounded by PEC period and diameter • On real data, construction time is much better than the analytic worst-case

Slide 29

Slide 29 text

Future Work • Other datasets • Synthetic models for generating encounter orderings • Fuzzy PEC-D • PEC-D applied to oppnets content sharing

Slide 30

Slide 30 text

Thanks! Matthew Williams http://users.cs.cf.ac.uk/M.J.Williams M.J.Williams@cs.cf.ac.uk socialnets http://www.social-nets.eu Any questions?

Slide 31

Slide 31 text

References M. Lahiri and T.Y. Berger-Wolf. Periodic subgraph mining in dynamic networks. Knowledge and Information Systems, Volume 24, Issue 3 (2010), p. 467. S. Fortunato. Community detection in graphs. Physics Reports, 486 (3-5) (2010), pp. 75– 174 Attribution Library Courtyard. nevolution. http://www.flickr.com/photos/nevolution/2906377551/