(1) where f is the similarity function and U ∈ Zmxn S ∈ R+ mxm U is sparse, and S is massive S is symmetrical, sparsity ≤ 50% Nature of the data affects f Distributed execution: Workers need access to S Need to maintain designated copies of U Look for scipy.[sparse, spatial, dist], sklearn.metrics Jaidev Deshpande Introduction to Recommender Systems