Upgrade to Pro — share decks privately, control downloads, hide ads and more …

Better Search Through Query Understanding

Better Search Through Query Understanding

This 2014 presentation discusses query understanding in search engines. It describes how query understanding involves identifying entities and tags in queries, predicting the user's intent or topic area, expanding queries using related terms, and incorporating spelling corrections. The key aspects of query understanding covered are tagging queries for entities like names, titles, companies; predicting the user's vertical intent like jobs, people or companies; and expanding queries using name synonyms, job title synonyms or signals from past user queries and clicks. The document also suggests giving users more transparency, guidance and control over the search process.

Avatar for Daniel Tunkelang

Daniel Tunkelang

May 20, 2026

More Decks by Daniel Tunkelang

Other Decks in Technology

Transcript

  1. Recruiting Solutions Recruiting Solutions Recruiting Solutions Daniel Tunkelang Head, Query

    Understanding better search through query understanding Daniel
  2. overview  query understanding: what is it?  how we

    do query understanding at LinkedIn  some other thoughts from search in the wild what I’m not going to cover: 2
  3. Information need query select from results rank using IR model

    user: system: tf-idf PageRank bird’s-eye view of how a search engine works 3
  4. Information need query select from results rank using IR model

    user: system: tf-idf PageRank query understanding 4
  5. 6 tag: skill OR title related skills: search, ranking, …

    tag: company id: 1337 industry: internet verticals: people, jobs intent: exploratory
  6. 7 query understanding pipeline spellcheck query tagging vertical intent prediction

    query expansion raw query structured query + annotations
  7. 8 query understanding pipeline spellcheck query tagging vertical intent prediction

    query expansion raw query structured query + annotations
  8. 10 spelling out the details PEOPLE NAMES COMPANIES TITLES PAST

    QUERIES n-grams marissa => ma ar ri is ss sa metaphone mark/marc => MRK co-occurrence counts marissa:mayer = 1000 marisa meyer yahoo marissa marisa meyer mayer yahoo
  9. 11 spelling out the details problem: corpus as well as

    query logs contain many spelling errors certain spelling errors are quite frequent while genuine words (especially names) might be infrequent
  10. 12 spelling out the details problem: corpus & query logs

    contain spelling errors solution: use query chains to infer correct spelling [product manger] [product manager] CLICK [marissa mayer] CLICK
  11. 13 query understanding pipeline spellcheck query tagging vertical intent prediction

    query expansion raw query structured query + annotations
  12. 14 query tagging: identifying entities in the query TITLE CO

    GEO TITLE-237 software engineer software developer programmer … CO-1441 Google Inc. Industry: Internet GEO-7583 Country: US Lat: 42.3482 N Long: 75.1890 W (RECOGNIZED TAGS: NAME, TITLE, COMPANY, SCHOOL, GEO, SKILL )
  13. 15 query tagging: identifying entities in the query TITLE CO

    GEO MORE PRECISE MATCHING WITH DOCUMENTS
  14. 22 query tagging: sequential model EMISSION PROBABILITIES (learned from user

    profiles) TRANSITION PROBABILITIES (learned from query logs) TRAINING
  15. 24 query understanding pipeline spellcheck query tagging vertical intent prediction

    query expansion raw query structured query + annotations
  16. 27 query understanding pipeline spellcheck query tagging vertical intent prediction

    query expansion raw query structured query + annotations
  17. 30 query expansion: signals [jon] [jonathan] CLICK trained using query

    chains: [programmer] [developer] CLICK symmetric but not transitive! [francis] [frank] ⇔ [franklin] [frank] ⇔ [francis] ≠ [franklin] [software engineer] [software developer] CLICK context based! [software engineer] => [software developer] [civil engineer] ≠ [civil developer]
  18. 31 query understanding pipeline spellcheck query tagging vertical intent prediction

    query expansion raw query structured query + annotations