Beyond Text Similarity - Tune your search for your Business Domain
A good search is more than just Lucene scoring, this talk from the Search Meetup Munich is about how one could get to better search results and how to secure them against regressions
Recap: Text similarity and why this stops working in the travel domain • Using recommendations and user interaction feedback • Performance! • Protect yourself against regressions 2
Instead of just relying on the practical scoring function, add other parameters • Use values from the real world that reflect the relevance of a given document in the whole document space
enough to provide valuable feedback about our data • They rate and recommend things (Hotels) • They click on things (Everywhere*) *except ads We also have a geospatial relation between hotels and destinations
rating! http://www.evanmiller.org/how-not-to-sort-by-average-rating.html Maybe use the lower bound of Wilson score confindence interval for a Bernoulli Parameter!
that brought in money • Generate tests that make sure for each search term the relevant result is in the result set • Define a threshold for OK (qalitative tests) • Execute on CI!