U., 2009. Do not crawl in the DUST: Different URLs with similar text. ACM Transactions on the Web (TWEB), 3(1), pp.1-31. 'On the efficient determination of most nearneighbors: horseshoes, hand grenades, web searchand other situations when close is close enough'(Manasse, 2022) 'Web crawling (Olston, C. and Najork, M., 2010) 'High performance web crawling' (Najork,2002) 'Modern information retrieval' (1999) - Baeza-Yeates