Upgrade to Pro
— share decks privately, control downloads, hide ads and more …
Speaker Deck
Features
Speaker Deck
PRO
Sign in
Sign up for free
Search
Search
Leveraging Crowdsourcing for Paraphrase Recogni...
Search
Sponsored
·
SiteGround - Reliable hosting with speed, security, and support you can count on.
→
kakubari
November 28, 2017
Technology
110
0
Share
Embed
Copy iframe code
Copy JS code
Copy link
Start on current slide
Leveraging Crowdsourcing for Paraphrase Recognition
長岡技術科学大学
自然言語処理研究室
角張竜晴
kakubari
November 28, 2017
More Decks by kakubari
See All by kakubari
動詞クエリの語間の関係性に基づくクエリマイニング
kakubari
0
130
Neural Modeling of Multi-Predicate Interactions for Japanese Predicate Argument Structure Analysis
kakubari
1
190
Automatically Acquired Lexical Knowledge Improves Japanese Joint Morphological and Dependency Analysis
kakubari
0
110
Labeling the Semantic Roles of Commas
kakubari
0
97
Integrating Case Frame into Japanese to Chinese Hierarchical Phrase-based Translation Model
kakubari
0
130
Improving Chinese Semantic Role Labelingusing High-quality Surface and Deep Case Frames
kakubari
0
98
Exploring Verb Frames for Sentence Simplification in Hindi
kakubari
0
150
述語項構造と照応関係のアノテーション
kakubari
0
260
用言と直前の格要素の組を単位とする格フレームの自動構築
kakubari
0
230
Other Decks in Technology
See All in Technology
エンジニアリング戦略の作り方 / Crafting Engineering Strategy
iwashi86
7
1.4k
サイバーセキュリティ概論 / Introduction to Cybersecurity
ks91
PRO
0
170
Ruby::Boxでできること、Refinementsでできること
joker1007
3
400
AI Testing Talks: Challenges of Applying AI in Software Testing: From Hype to Practical Use
exactpro
PRO
1
140
2026.06.13_AI時代に事業会社が「SIer出身エンジニア」を求める理由 / Why Businesses Seek Engineers with a System Integrator Background in the AI Era
jumtech
0
890
Diagnosing performance problems without the guesswork
elenatanasoiu
0
170
ABEMA の Datadog × OTel 基盤、 中から見るか? 外から見るか?
tetsuya28
0
110
会社紹介資料 / Sansan Company Profile
sansan33
PRO
18
420k
AI-DLCを活用した高品質・安全なAI駆動開発実践 / AI Driven Development with AI-DLC
yoshidashingo
0
150
EventBridge Connection
_kensh
5
660
10倍の生産性を実現するAI駆動並列エージェントのすべて
kumaiu
4
990
protovalidate-es を導入してみた
bengo4com
0
160
Featured
See All Featured
Abbi's Birthday
coloredviolet
2
8k
Money Talks: Using Revenue to Get Sh*t Done
nikkihalliwell
0
240
What’s in a name? Adding method to the madness
productmarketing
PRO
24
4.1k
The Curse of the Amulet
leimatthew05
1
13k
[RailsConf 2023 Opening Keynote] The Magic of Rails
eileencodes
31
10k
How People are Using Generative and Agentic AI to Supercharge Their Products, Projects, Services and Value Streams Today
helenjbeal
1
200
Design of three-dimensional binary manipulators for pick-and-place task avoiding obstacles (IECON2024)
konakalab
0
450
The innovator’s Mindset - Leading Through an Era of Exponential Change - McGill University 2025
jdejongh
PRO
1
190
Between Models and Reality
mayunak
4
330
How to Think Like a Performance Engineer
csswizardry
28
2.6k
Designing Experiences People Love
moore
143
24k
コードの90%をAIが書く世界で何が待っているのか / What awaits us in a world where 90% of the code is written by AI
rkaga
62
44k
Transcript
Ԭ ٕ ज़ Պ ֶ େ ֶ ࣗ વ ݴ ޠ ॲ ཧ ݚ ڀ ࣨ ֶ ෦ ̐ ֯ ு ཽ Leveraging Crowdsourcing for Parapharase Recognition Martin Tschirsich, Gerold Hintz Proceedings of the 7th Linguistic Annotation Workshop & Interoperability with Discourse, pages 205–213, Sofia, Bulgaria, August 8-9, 2013. ਤදจΑΓҾ༻ 1
概要 2 ʲௐࠪɾఏҊʳ ύϥϑϨʔζೝࣝͷͨΊͷΫϥυιʔγϯάํ๏ ଟஈ֊ͷΫϥυιʔγϯάख๏Λఏࣔɻ ʲ݁Ռʳ จ຺తͳݴ͍͑ͷੜΛͰ͖Δɻ
ݴ͍͑ίʔύεͷίετΛେ෯ʹݮͰ͖Δɻ
はじめに 3 ʲύϥϑϨʔζೝࣝʳ %SBT ަ͕ՄೳͳςΩετͷϖΞΛੳ͢Δ͜ͱ ˔ྫ͑ɾɾɾใݕࡧͷ ʮୈถࠃେ౦ྗͷࡴਓʯ
ɹʹʮδϣϯ'ɾέωσΟͷ҉ࡴʯ
はじめに 4 ʲύϥϑϨʔζೝࣝͷݚڀʳ Φʔϓϯͳݚڀ՝ ۙɺٸܹʹਐา͍ͯ͠Δ 4PDIFSFUBM
͔͠͠ɺਫ਼ະͩʹ্͍ͯ͠ͳ͍ɻ
パラフレーズの定義 5 ʲύϥϑϨʔζͷ֓೦ʳ %PMBOBOE#SPDLFUU ҙຯతྨࣅੑͱ୯ޠΦϯτϩδʔͷ֓೦ʹີʹ ؔ࿈͍ͯ͠Δɻ
ਖ਼֬ͳఆٛͳ͘ɺෳࡶͳΨΠυϥΠϯͰܾఆɻ ΫϥυιʔγϯάΛߦ͏ࡍʹɾɾɾ ࡞ۀऀͷײʹཔ͍ͬͯΔ͜ͱʹҙ͢Δඞཁ͕͋Δɻ ྫจ͕ॏཁͰ͋Δ
パラフレーズの認識 6 ʲύϥϑϨʔζͷೝࣝʳ 4PDIFSFUBM ʮҙͷ͞ͱܗͰ͋ΔͭͷϑϨʔζ͕ಉ͡ҙຯͰ͋ Δ͔Ͳ͏͔ʯΛܾఆ͢Δ ʲઌߦݚڀʳ
/άϥϜͷॏͳΓ ґଘؔπϦʔͷॏͳΓฤूڑ Ͱܾఆ͍ͯ͠Δɻ ಉٛޠҙຯʹ͍͔͠ΛࣝผͰ͖ͳ͍ɻ ݴ͍͑ΒΕͨจষΛֶशͯࣗ͠ಈࣝผ͢Δඞཁ͕͋Δɻ
先行研究 7 ;IPVFUBM ύϥϨϧίʔύεΛ༻͍ͨ༁ϕʔεͷख๏ .BEOBOJFUBM
ҙຯྨࣅੑͷධՁ ୯७ͳ̎Ͱͳ͘ɺ࿈ଓతͳͰϖΞͷྨࣅͯ͠ ͍ΔఔΛࣔ͢ɻ ςΩετͷྨࣅੑɺଟͷΫϥυϫʔΧʔͷ அΛฏۉԽ͢Δɻ
クラウドソーシング 8 ʲ$SPXE'MPXFSʳ ΫϥυιʔγϯάΛߦ͏8FCαʔϏε ࡞ۀऀͷλεΫډॅΛ੍ݶͰ͖Δ ऩू͞Εͨσʔλͷਖ਼ੑΛݕূ͢ΔγεςϜ ¡
࣮ࡍͷσʔλΛॲཧ͢Δલʹɺ࡞ۀऀ͕ਖ਼͘͠ճ͢ΔΑ͏ ʹνΣοΫΛ͢Δɻ ¡ ࡞ۀதʹɺਖ਼͘͠ճ͍ͯ͠Δ͔νΣοΫ͢Δɻ ਖ਼ղσʔλͱ࡞ۀ݁ՌΛൺֱ͠ɺ৴པੑΛ୲อ͢Δɻ
クラウドソーシング 9 ʲࣄ༰ͷσβΠϯʳ ࣄ༰ͷσβΠϯɺऩू͞Εͨσʔλͷ࣭ʹ ࠷େ͖ͳӨڹΛ༩͑Δɻ ਖ਼͍͠ࢦࣔͱΘ͔Γ͍͢ྫ͕ॏཁ $SPXE'MPXFSʹɺ$.-ݕূػೳ͕͋Δɻ
ෆਖ਼ͳϢʔβͷೖྗΛऩू͠ͳ͍
フレーズ−パラフレーズ生成 10 ݴ͍͑ͷϕʔεϥΠϯ ूஂ࡞ۀऀʹɺϑϨʔζQ Λఏࣔ͠ɺॻ͖͑Q Λ
ಘΔ
2段階の言い換えの生成 11 ࡞ۀऀʹϑϨʔζQ Λఏࣔ͠ɺݴ͍͑Q ΛಘΔɻ ̎ɺ̏ਓͷ࡞ۀऀ͕ͦΕͧΕͷੜ͞Εͨݴ͍
͑ͷϖΞΛݕূ͢Δɻ ઐՈͷධՁऀͱ܈ऺͷஅͷ߹ҙ /FHSJFUBM
多段階の言い換えの生成 12 ࡞ۀऀʹϑϨʔζQ Λఏࣔ͠ɺݴ͍͑Q ΛಘΔɻ ଞͷ࡞ۀऀʹϑϨʔζQ
Λఏࣔ͠ɺQ Λݕূ͠ɺ ݴ͍͑Q ΛಘΔɻ ޡͬͨݴ͍͑ΛݮΒ͠ɺ ɹΑΓଟ͘ͷݴ͍͕͑ಘΒΕΔɻ
多段階の言い換えの生成 13 ஈ֊ͷݴ͍͑ɿऩूͨ͠ϖΞͷΛݕূ ଟஈ֊ͷݴ͍͑ ஈ ɿऩूͨ͠ϖΞͷΛݕূ
まとめ 14 Ϋϥυιʔγϯάͷํ๏ͱͯ͠ɺ ɹଟஈ֊ͷݴ͍͑ΛఏҊɻ ଟஈతʹύϥϑϨʔζΛߦ͏͜ͱ͕༗ޮ ¡ ݕূͱݴ͍͑Λߦ͏͜ͱͰɺҙຯͷΒ͖͕ͭগͳ͘ͳΔ