This 2008 Carnegie Mellon Night presentation at Fidelity's Center for Advanced Technology discusses Endeca's approach to faceted search and human-computer information retrieval (HCIR).
/ what is endeca? • Software to help people explore, analyze, and understand complex information, guiding them to unexpected insights and better decisions. • 500+ customers • $108M revenue in 2007.
do they? 78% wish search engines could read their minds. What frustrates users most? – 25%: deluge of results – 24%: too many paid listings – 19%: inability to understand their keywords – 19%: disorganized / random results The State of Search Autobytel & Kelton Research, Oct ’07
search vs. enterprise search “Search on the internet is solved. I always find what I need. But why not in the enterprise? Seems like a solution waiting to happen.” - a Fortune 500 CTO
get what you pay for • There are easy use cases… – 30% of queries are navigational. – 30% of queries lead to Wikipedia pages. – Users won’t pay, but advertisers will! • …and hard use cases. – Queries where recall matters. – Exploratory search. – Enterprises will pay for insight.
alone can’t provide insight • The system can’t read your mind. • Your spouse / best friend can’t read your mind. • Sometimes you can’t read your own mind.
information retrieval • Instead of guessing the user’s intent, optimize communication. • De-emphasize the top ten documents; response is a set of documents. • Think beyond single queries; support refinement and exploration.
approach: guided summarization • Set retrieval that responds to queries with – an overview of the user's current context. – an organized set of options for incremental exploration. • Contextual summaries of document sets optimize system’s communication with user. • Query refinement options optimize user’s communication with system.