Upgrade to Pro
— share decks privately, control downloads, hide ads and more …
Speaker Deck
Features
Speaker Deck
PRO
Sign in
Sign up for free
Search
Search
2017_B3_Seminar_3
Search
Sponsored
·
SiteGround - Reliable hosting with speed, security, and support you can count on.
→
kakubari
February 10, 2017
Technology
0
84
2017_B3_Seminar_3
長岡技術科学大学
自然言語処理研究室
角張竜晴
kakubari
February 10, 2017
Tweet
Share
More Decks by kakubari
See All by kakubari
動詞クエリの語間の関係性に基づくクエリマイニング
kakubari
0
120
Neural Modeling of Multi-Predicate Interactions for Japanese Predicate Argument Structure Analysis
kakubari
1
180
Leveraging Crowdsourcing for Paraphrase Recognition
kakubari
0
100
Automatically Acquired Lexical Knowledge Improves Japanese Joint Morphological and Dependency Analysis
kakubari
0
110
Labeling the Semantic Roles of Commas
kakubari
0
94
Integrating Case Frame into Japanese to Chinese Hierarchical Phrase-based Translation Model
kakubari
0
120
Improving Chinese Semantic Role Labelingusing High-quality Surface and Deep Case Frames
kakubari
0
95
Exploring Verb Frames for Sentence Simplification in Hindi
kakubari
0
140
述語項構造と照応関係のアノテーション
kakubari
0
250
Other Decks in Technology
See All in Technology
Datadog Cloud Cost Management で実現するFinOps
taiponrock
PRO
0
140
Kiro のクレジットを使い切る!
otanikohei2023
0
110
Serverless Agent Architecture on Azure / serverless-agent-on-azure
miyake
1
150
Eight Engineering Unit 紹介資料
sansan33
PRO
1
6.9k
トップマネジメントとコンピテンシーから考えるエンジニアリングマネジメント
zigorou
4
550
Sansan Engineering Unit 紹介資料
sansan33
PRO
1
4k
自動テストが巻き起こした開発プロセス・チームの変化 / Impact of Automated Testing on Development Cycles and Team Dynamics
codmoninc
1
1.1k
OpenClawで回す組織運営
jacopen
2
560
白金鉱業Meetup_Vol.22_Orbital Senseを支える衛星画像のマルチモーダルエンベディングと地理空間のあいまい検索技術
brainpadpr
2
220
20260305_【白金鉱業】分析者が地理情報を武器にするための軽量なアドホック分析環境
yucho147
1
180
作りっぱなしで終わらせない! 価値を出し続ける AI エージェントのための「信頼性」設計 / Designing Reliability for AI Agents that Deliver Continuous Value
aoto
PRO
1
150
LLM活用の壁を超える:リクルートR&Dの戦略と打ち手
recruitengineers
PRO
1
250
Featured
See All Featured
JAMstack: Web Apps at Ludicrous Speed - All Things Open 2022
reverentgeek
1
380
Efficient Content Optimization with Google Search Console & Apps Script
katarinadahlin
PRO
1
360
The Illustrated Children's Guide to Kubernetes
chrisshort
51
52k
AI: The stuff that nobody shows you
jnunemaker
PRO
3
350
The Limits of Empathy - UXLibs8
cassininazir
1
240
The Curious Case for Waylosing
cassininazir
0
260
Design of three-dimensional binary manipulators for pick-and-place task avoiding obstacles (IECON2024)
konakalab
0
370
Self-Hosted WebAssembly Runtime for Runtime-Neutral Checkpoint/Restore in Edge–Cloud Continuum
chikuwait
0
380
Money Talks: Using Revenue to Get Sh*t Done
nikkihalliwell
0
170
Being A Developer After 40
akosma
91
590k
GitHub's CSS Performance
jonrohan
1032
470k
Ethics towards AI in product and experience design
skipperchong
2
210
Transcript
Ԭٕज़Պֶେֶ ిؾిࢠใֶ՝ఔ ֶ෦ɹ֯ுཽ ࣗવݴޠݚڀࣨ ɹ#̏θϛ ʙୈճʙ ϏοΫσʔλղੳೖ
目次 ˔ϏοΫσʔλͱ ˔ϏοΫσʔλͷཧ 6/*9 ˔౷ܭॲཧͷܭࢉޡࠩ
ビックデータとは ˔ϏοΫσʔλͱ ɹʮ̏7ʯଈͪɺ7PMVNF ༰ྔ 7BSJFUZ
छྨ 7FMPDJUZ ੵස Ὃ σʔλͷ༰ྔ͕େ͖͘ɺσʔλͷछྨ͕ଟ༷Ͱɺ σʔλͷੵස͕ଟ͍ɻ
ビックデータの管理(UNIX) ˔ϏοΫσʔλͷ༰Λ֬ೝ͢Δ σʔλͷ༰ΛΔ͜ͱͰɺߦ͏͖ղੳํ๏͕ܾ ·Δ ɹ⾣DBUɿϑΝΠϧͷதΛશͯදࣔ ɹ⾣MFTTɿදࣔϞʔυͱͳΓɺϑΝΠϧͷதΛදࣔ ɹɹRΛԡ͢͜ͱͰऴྃ͢Δ
DBUpMFDTW MFTTpMFDTW
ビックデータの管理(UNIX) ɹ⾣IFBEOɿϑΝΠϧͷઌ಄͔ΒOߦ͚ͩදࣔ ɹ⾣dcIFBEɿʮʛʢύΠϓʣʯΑΓલͷ࣮ߦ݁Ռ ɹɹɹɹɹɹɹΛ࠷ॳͷߦ͚ͩදࣔ ɹ⾣UBJMrOɿϑΝΠϧͷ࠷ޙ͔ΒOߦ͚ͩදࣔ
IFBErOpMFDTW TBNQMFQZpMFDTWcIFBE UBJMrOpMFDTW
ビックデータの管理(UNIX) ˔ϏοΫσʔλΛཧ͢Δ ղੳ͕͍͢͠Α͏ʹɺσʔλͷஔநग़Λߦ͏ ɹ⾣TFEɿσʔλͷஔ ɹɹTͷ࣍ͷͷؒͷจࣈ͕ɺ ɹɹɹHͷલͷͷؒͷจࣈʹஔ͞ΕΔ TFElT HzpMFDTW
ビックデータの管理(UNIX) ɹ⾣zT @zɿҰߦͷΧϯϚΛ@ʹஔ ɹ⾣TFElddzɿηϛίϩϯͰ۠Δ͜ͱͰɺ ɹɹɹɹɹɹɹɹͭͳ͛ͯॲཧΛهड़Ͱ͖Δ ɹ⾣ ϦμΠϨΫτ ɿग़ྗ݁ՌΛϑΝΠϧʹॻࠐΈ
TFElT @T HzpMFDTWPVUQVUUYU
ビックデータの管理(UNIX) ɹ⾣BXLɿඞཁͳ߲ͷநग़ ɹɹɹɹҰߦͣͭ۠Γ͋ΔจࣈྻΛॲཧ͢Δ ɹɹ'
ɿΧϯϚ۠ΓͰॲཧ͢Δ͜ͱΛ໌ࣔ ɹɹɹɹʢσϑΥϧτͰεϖʔε۠Γʣ BXLr' b\QSJOU^`pMFDTWPVUQVUUYU pMFDTW BQQMF PSBOHF CBOBOB PVUQVUUYU
ビックデータの管理(UNIX) ɹ⾣BXLϓϩάϥϛϯάݴޠ ɹɹQSJOUG GPSจ JGจͳͲ͕͑Δ BXLr'b\QSJOUG lTEݸaOz ^`pMFDTWPVUQVUUYU
pMFDTW BQQMF PSBOHF CBOBOB PVUQVUUYU BQQMFݸ PSBOHFݸ CBOBOBݸ
ビックデータの管理(UNIX) ɹ⾣HSFQldzɿσʔλͷத͔ΒdΛؚΉߦΛݕࡧ ɹ ɹ⾣HSFQrWldzɿσʔλͷத͔ΒdΛؚ·ͳ͍ߦΛݕࡧ HSFQlFzpMFDTWPVUQVUUYU pMFDTW
BQQMF PSBOHF CBOBOB PVUQVUUYU BQQMF PSBOHF HSFQrWlFzpMFDTWPVUQVUUYU PVUQVUUYU CBOBOB
ビックデータの管理(UNIX) ɹ⾣TPSUɿσʔλͷฒͼସ͑ ɹɹྦྷੵؔσʔλͷ࠷খɾ࠷େ ɹɹɹΛ֬ೝ͢Δ߹ʹ༻͍Δ ɹLҰྻʹண͠ɺHࣈͰฒͼସ͑Δɻ
ɹU ΧϯϚ۠ΓͷϑΝΠϧͰ͋Δ͜ͱΛද͢ɻ TPSUrLHrU pMFDTW pMFDTW BQQMF PSBOHF CBOBOB PVUQVUUYU CBOBOB BQQMF PSBOHF
ビックデータの管理(UNIX) ɹ⾣VOJRɿಉ͡σʔλΛ·ͱΊΔ ɹ⾣VOJRrDɿಉ͡σʔλΛ·ͱΊɺΧϯτ͢Δ
ɹ TPSUrLHpMFDTWcVOJR pMFDTW PVUQVUUYU TPSUrLHVpMFDTW TPSUrLHpMFDTWcVOJRD PVUQVUUYU
統計処理の計算誤差 ˔ޡࠩ ɹϏοΫσʔλղੳΛ͢Δ্Ͱɺʮظ͞ΕΔͱɺ ଌఆܭࢉͳͲͰಘΒΕͨͱͷࠩʯ ˔ޡࠩͷྨ ɹ⾣ϞσϧԽޡࠩ ɹ⾣ۙࣅޡࠩ ɹ⾣ܥ౷ޡࠩɺۮવޡࠩ ɹ⾣ܭࢉޡࠩ
統計処理の計算誤差 ⾣ϞσϧԽޡࠩ ɹϞσϧԽͷࡍʹෳࡶԽΛආ͚ΔͨΊʹແࢹ͞Εͨཁ ૉ͕ͨΒ͢ޡࠩ ɹྫʣৼΓࢠͷӡಈํఔࣜ ɹɹɹɾۭؾ߅Λແࢹ ɹɹɹɾมҟ͕ඍখͳͷ
統計処理の計算誤差 ⾣ۙࣅޡࠩ ɹܭࢉͷ؆ૉԽͷͨΊʹۙࣅࣜΛ༻͍Δ͜ͱʹΑΔޡࠩ ɹྫʣTJO Λܭࢉ ɹɹɹTJO Y dYͰۙࣅ͢Δͱɺ
TJO d ɹɹɹ࣮ࡍʹɺ TJO ɹɹɹΑͬͯɺ͕ۙࣅޡࠩ
統計処理の計算誤差 ⾣ܥ౷ޡࠩ ɹଌఆͷํ๏ʹΑͬͯݱΕͯ͠·͏ޡࠩ ɹҰͭͷଌఆํ๏Ͱଌఆ͢ΔݶΓɺࢼߦճΛॏͶ ͯऔΓআ͔Εͣɺ౷ܭతʹͣΕ͕؍ଌ͞ΕΔ ⾣ۮવޡࠩ ɹଌఆ͝ͱʹଌఆʹΒ͖͕ͭੜ·ΕΔ͜ͱʹΑΔޡࠩ
統計処理の計算誤差 ⾣ܭࢉޡࠩ ɹ1$ͰܭࢉΛߦ͏ࡍʹɺແݶΛѻ͑ͳ͍͜ͱʹΑΔޡࠩ ɹʙܭࢉޡࠩͷछྨʙ ɹɹ˗ؙΊޡࠩ ɹɹ˗ଧͪΓޡࠩ ɹɹ˗ใམͪ
統計処理の計算誤差 ˗ؙΊޡࠩ ɹ༗ޮࣈൣғ֎ͷ෦͕ࣺͯΒΕΔ͜ͱʹΑΔޡࠩ ɹྫʣʜ ɹɹɹ༗ޮࣈ͕ܻ̏ͷ߹ɺ ɹɹɹ͜ͷͱ͖ɺʜ
統計処理の計算誤差 ˔ଧͪΓޡࠩ ɹֶͰ༻͍ΔࣜͰɺཧతͳͱͯ͠ແݶݸͷΛ ߟ͑Δɻेʹۃݶڃʹ͍ۙͱஅܻͨ͠ Ͱଧͪͬͨͱ͖ͷޡࠩɹ ɹྫʣ
ʜ ɹ
統計処理の計算誤差 ˔ใམͪɹ ɹઈରʹେ͖ͳ͕ࠩ͋ΔೋͭͷͷՃݮΛߦͬͨ ߹ʹɺܭࢉ͕·ͱʹߦΘΕͳ͍͜ͱʹΑΔޡࠩ ɹྫʣ༗ޮࣈܻ̑ͷ Y Zº Y Z
ɹɹɹ༗ޮࣈܻ̑ͰΓམͱ͢ͱ
参考文献 ˔ߴ҆ඒࠤࢠฤஶɺాଜޫଠɾࡾӜߤஶɺ ɹʮֶੜɾٕज़ऀͷͨΊͷϏοΫσʔλղੳೖʯ ʢୈ̍ষʙୈ̏ষʣɺ ɹגࣜձࣾຊධࣾɺ݄