Slide 4
Slide 4 text
Crawling
● Comes under Information Retrieval (IR) - a science that deals with
retrieving information from set of documents
● IR comes with huge cost - called “Cost of Retrieval”
● Google can easily crawl images, text & videos. However, struggles with
Javascript based sites
● Many types of crawlers based on content type, devices, location
○ Adbots, Imagebots, Page resource load bots, mobile bots, desktop bots