Slide 30
Slide 30 text
Lucene v/s Database search
• There are a number of query types that RDBMSs in general do not
support without vendor extensions:
• Fuzzy queries, in which "fuzzy" and "wuzzy" are considered
matches
• Word stemming queries, which consider "take," "took," and "taken"
to be identical
• Sound-like queries, which consider "cat" and "kat" to be identical
• Synonym queries, which consider "jump," "hop," and "leap" to be
identical
• Queries on binary BLOB data types, such as PDF documents,
Microsoft Word or Excel documents, or HTML and XML documents
• More disappointingly, SQL search results are not ranked by match-
relevance scores. The SQL standard is simply not intended for full-
text querying.