Slide 9
Slide 9 text
Analysis
• Breaks chunks of texts in terms (tokens)
• Helps building the inverted index
• ElasticSearch comes with a lot of analyzers (but you can also write
yours)
• Examples:
• Snowball analyzer: remove stop words and transform terms to the stemmed
form
• “The world tallest building” -> [“world”, “tall”, “building”]
• N-gram analyzer: break terms in n-grams (good for autocomplete)
• “future” -> [“fut”, “ure”]
Lucas Saldanha® - http://www.lucassaldanha.com 9