Slide 1

Slide 1 text

Why People Search for Images using Web Search Engines WSDM 2018 ࿦จಡΈձ 20180414 ٠ా ངฏ (@yohei_kikuta) Event URL: https://atnd.org/events/95510, paper: https://arxiv.org/abs/1711.09559

Slide 2

Slide 2 text

·ͱΊ 1. text base ͷ΢Σϒը૾ݕࡧͷҙਤ͸෼ྨՄೳ͔ʁ → YES. 3ͭʹ෼ྨ: Entertain, Explore/Learn, Locate/Acquire 2. औಘՄೳͳಛ௃ྔ͔ΒҙਤΛ൑ผͰ͖Δ͔ʁ → YES. ଺ཹ࣌ؒ΍Ϛ΢εδΣενϟ 3. ηογϣϯॳظͰݕࡧҙਤΛ༧ଌͰ͖Δ͔ → MAYBE. ಛ௃ྔΛ࢖ͬͯϞσϧΛ࡞੒ͯ͠Ұఆ౓ͷੑೳ 2

Slide 3

Slide 3 text

എܠ 3

Slide 4

Slide 4 text

ݕࡧͷҙਤΛ஌Γ͍ͨ Ϣʔβͷݕࡧߦಈͷཪʹ͋ΔҙਤΛ஌Δ͜ͱ͸ॏཁ → Ϣʔβͷຬ଍౓޲্ʢsuggestion, recommendation, ...ʣ ΢Σϒݕࡧͷݚڀ͸ͳ͞Ε͖͕ͯͨɺը૾ݕࡧʹؔͯ͠͸ݶఆత → ΫΤϦϕʔε → ͔͠͠ը૾ݕࡧͷΫΤϦ͸୹͘ͳΓ͕ͪͰෆ࣮֬ੑ͕େ͖͍ ຊ࿦จͰ͸ηογϣϯ৘ใΛѻͬͯը૾ݕࡧͷҙਤΛݚڀ 4

Slide 5

Slide 5 text

ຊ࿦จʹ͓͚ΔϦαʔνΫΤενϣϯ 1. text base ͷ΢Σϒը૾ݕࡧͷҙਤ͸෼ྨՄೳ͔ʁ 2. औಘՄೳͳಛ௃ྔ͔ΒҙਤΛ൑ผͰ͖Δ͔ʁ 3. ηογϣϯॳظͰݕࡧҙਤΛ༧ଌͰ͖Δ͔ 5

Slide 6

Slide 6 text

ઌߦݚڀ 6

Slide 7

Slide 7 text

΢Σϒݕࡧʹ͓͚Δҙਤͷ taxonomy A taxonomy of web search (2002) ͰҙਤΛ3ͭʹ෼ྨ 1. Navigational: ಛఆͷαΠτ΁౸ୡ 2. Informational: ৘ใͷऔಘ 3. Transactional: ΢ΣϒΛഔհͱͨ͠׆ಈ Ref: https://dl.acm.org/citation.cfm?id=792552 7

Slide 8

Slide 8 text

΢Σϒݕࡧʹ͓͚Δҙਤͷ taxonomy Task Behaviors During Web Search: The Difficulty of Assigning Labels (2009) Ͱ͸ݕࡧλεΫΛ7ͭʹ෼ྨ » Navigate, Find-Simple, Find-Complex, Locate/Acquire, Explore/Learn, Play, Meta ຊ࿦จ͸͜ͷઌߦݚڀΛ౿ऻͭͭ͠ը૾ݕࡧʹൃలͤͨ͞΋ͷɺͱ͍͏ ৭߹͍͕ڧ͍ Ref: http://ieeexplore.ieee.org/document/4755491/ 8

Slide 9

Slide 9 text

ը૾ݕࡧͷҙਤΛ෼ྨ 9

Slide 10

Slide 10 text

Ξϓϩʔν σʔλΛूΊͯͦΕΛجʹ3ਓͷ΢Σϒݚڀऀ͕෼ྨ » ϢʔβͷΞϯέʔτσʔλ » ੑผ৘ใͳͲΛऔಘ » ࠷ۙͷݕࡧʹؔ͢ΔৄࡉʢಈػͳͲʣɺ࢖༻ͨ͠ΫΤϦ » ద੾ͳճ౴Λͨ͠211ਓ͕ର৅ 10

Slide 11

Slide 11 text

Ξϓϩʔν σʔλΛूΊͯͦΕΛجʹ3ਓͷ΢Σϒݚڀऀ͕෼ྨ » ϩάσʔλ » https://www.sogou.com/ ͷϩάσʔλ » 30෼Ҏ಺ʹ࿈ଓతͳΫΤϦΛ༩͍͑ͯΔ475ηογϣϯʢআ͘Ξμ ϧτʣ 11

Slide 12

Slide 12 text

Ξϓϩʔν σʔλΛूΊͯͦΕΛجʹ3ਓͷ΢Σϒݚڀऀ͕෼ྨ » ϩάσʔλʢlength ͸ΫΤϦ਺ʣ Ref: https://arxiv.org/abs/1711.09559 12

Slide 13

Slide 13 text

࡞੒ͨ͠൑அج४ 1. Ϣʔβͷݕࡧߦಈ͸໌֬ͳ໨తʹґΔ΋ͷ͔ʁ 2. ޙͷར༻ͷͨΊʹը૾Λμ΢ϯϩʔυ͢Δඞཁ͕͋Δ͔ʁ 13

Slide 14

Slide 14 text

3ͭͷݕࡧҙਤ 1. Explore/Learn (1-yes, 2-no) ྫʣΰϦϥͱϘϊϘͷݟͨ໨ͷҧ͍ΛνΣοΫ 2. Locate/Acquire (1-yes, 2-yes) ྫʣϨϙʔτ࡞੒Ͱ࢖͏ΰϦϥͷը૾Λ୳ͯ͠μ΢ϯϩʔυ 3. Entertain (1-no, 2-yes or no) ྫʣΰϦϥͷ໘നը૾ΛோΊΔ 14

Slide 15

Slide 15 text

3ͭͷݕࡧҙਤʢྫʣ Ref: https://arxiv.org/abs/1711.09559 15

Slide 16

Slide 16 text

ଥ౰ੑͷݕূʢ3ਓͷେֶӃੜʹΑΔҙਤ෼ྨʣ » ϢʔβͷΞϯέʔτσʔλ Fleiss' kappa: 0.673 Explore/Learn: 27%, Locate/Acquire: 66%, Entertain: 7% » ϩάσʔλʢΫΤϦͷΈΛ࢖༻ʣ Fleiss' kappa: 0.375 Explore/Learn: 56%, Locate/Acquire: 39%, Entertain: 5% ͏·͘෼͚Εͦ͏͕ͩΫΤϦͷΈͰ͸ҙਤΛ἞Ήͷ͸೉͍͠ 16

Slide 17

Slide 17 text

औಘՄೳͳಛ௃ྔͰҙਤΛ൑ผ 17

Slide 18

Slide 18 text

35ਓͷֶ෦ੜʹΑΔ12ݸͷը૾ݕࡧλεΫ ྫʣPCͷഎܠΛ੨ۭͱ৿ͷը૾ʹมߋʢLocate/Acquireʣ ͦͷࡍʹҎԼͷಛ௃ྔΛऔಘ Ref: https://arxiv.org/abs/1711.09559 18

Slide 19

Slide 19 text

ҙਤʹΑͬͯ༗ҙͳ͕ࠩग़ΔͷͰ൑ผՄೳ ఀཹ࣌ؒ͸ E/L ͕ଟ͍ɺϚ΢εΫϦοΫ਺͸ E/L < L/A < EɺͳͲ ʢৄࡉ͸࿦จΛࢀরʣ Ref: https://arxiv.org/abs/1711.09559 19

Slide 20

Slide 20 text

ηογϣϯॳظͰͷҙਤͷ༧ଌ 20

Slide 21

Slide 21 text

໰୊ઃఆ ηογϣϯॳظͱ͸ʮ࠷ॳͷϚ΢εεΫϩʔϧ͕͋Δ·Ͱʯ ༧ଌͰ࢖͏ feature ͱͯ͠ҎԼͷ஫ҙ - ΫϦοΫ਺ͱ࠷ॳͷϚ΢εΦʔόʔ࣌ؒ͸࢖Θͳ͍ - ΫΤϦϕʔεͰ΍Γ͍ͨͷͰ query reformulation ͸࢖Θͳ͍ ֶ෦ੜʹղ͔ͤͨը૾ݕࡧλεΫʹରͯ͠ GBDT Ͱ 10-fold CV 21

Slide 22

Slide 22 text

༧ଌੑೳ͸ߴ͘͸ͳ͍͕ෆՄೳͰ͸ͳͦ͞͏ Baseline ͸ majority ʹશ෦دͤΔͱ͍͏΋ͷ Ref: https://arxiv.org/abs/1711.09559 22

Slide 23

Slide 23 text

·ͱΊͱॴײ 23

Slide 24

Slide 24 text

·ͱΊʢ࠶ܝʣ 1. text base ͷ΢Σϒը૾ݕࡧͷҙਤ͸෼ྨՄೳ͔ʁ → YES. 3ͭʹ෼ྨ: Entertain, Explore/Learn, Locate/Acquire 2. औಘՄೳͳಛ௃ྔ͔ΒҙਤΛ൑ผͰ͖Δ͔ʁ → YES. ଺ཹ࣌ؒ΍Ϛ΢εδΣενϟ 3. ηογϣϯॳظͰݕࡧҙਤΛ༧ଌͰ͖Δ͔ → MAYBE. ಛ௃ྔΛ࢖ͬͯϞσϧΛ࡞੒ͯ͠Ұఆ౓ͷੑೳ 24

Slide 25

Slide 25 text

ॴײ » γϯϓϧͳج४Ͱ෼ྨΛ͍ͯ͠Δͱ͍͏ͷ͸ྑ͍ » ࢐৽ͱ͍͏Θ͚Ͱ͸ͳ͍͕ҰͭҰͭ͸͔ͬ͠Γௐ΂͍ͯΔ » ࣮αʔϏε΁ͷԠ༻ʹ͸Ұาඈ༂͕ඞཁͦ͏ʢ༧ଌੑೳͳͲʣ » ٱʑʹ਺ࣜΛશવ࢖Θͳ͍࿦จΛಡΜͰ৽઱ͩͬͨ 25