Slide 13
Slide 13 text
Clustering (similarity)
We conduct clustering on the clicked URLs of each query and its expanded queries.
■Similarity Function
S1 is a similarity function based on the OSS phenomenon,
S2 is based on the SCAK phenomenon,
S3 is based on string similarities, with α, β, and γ as weights.
OSS term SCAK term string sim term
■S1(OSS) term
ui: http://www.a.com/
http://www.b.com/: 2
http://www.c.com/: 23
http://www.d.com/: 10
http://www.e.com/: 20
ユーザ検索にてある検索ワードで共起したURL集合(mui)