Slide 16
Slide 16 text
Scoring documents
• Objective: estimate the relevance of documents in the collection w.r.t. the input
query q (so that the highest-scoring ones can be returned as retrieval results)
• In principle, this would mean scoring all documents in the collection
• In practice, we’re only interested in the top-k results for each query
• Common form of a retrieval function
score(d, q) =
t∈q
wt,d × wt,q
◦ where wt,d
is the weight of term t in document d and wt,q
is the weight of that term
in the query q
16 / 24