Upgrade to Pro
— share decks privately, control downloads, hide ads and more …
Speaker Deck
Features
Speaker Deck
PRO
Sign in
Sign up for free
Search
Search
2017_B3_Seminar_3
Search
kakubari
February 10, 2017
Technology
0
80
2017_B3_Seminar_3
長岡技術科学大学
自然言語処理研究室
角張竜晴
kakubari
February 10, 2017
Tweet
Share
More Decks by kakubari
See All by kakubari
動詞クエリの語間の関係性に基づくクエリマイニング
kakubari
0
110
Neural Modeling of Multi-Predicate Interactions for Japanese Predicate Argument Structure Analysis
kakubari
1
160
Leveraging Crowdsourcing for Paraphrase Recognition
kakubari
0
83
Automatically Acquired Lexical Knowledge Improves Japanese Joint Morphological and Dependency Analysis
kakubari
0
100
Labeling the Semantic Roles of Commas
kakubari
0
78
Integrating Case Frame into Japanese to Chinese Hierarchical Phrase-based Translation Model
kakubari
0
120
Improving Chinese Semantic Role Labelingusing High-quality Surface and Deep Case Frames
kakubari
0
90
Exploring Verb Frames for Sentence Simplification in Hindi
kakubari
0
130
述語項構造と照応関係のアノテーション
kakubari
0
230
Other Decks in Technology
See All in Technology
もう外には出ない。より快適なフルリモート環境を目指して
mottyzzz
13
9.8k
会社を支える Pythonという言語戦略 ~なぜPythonを主要言語にしているのか?~
curekoshimizu
3
640
Implementing and Evaluating a High-Level Language with WasmGC and the Wasm Component Model: Scala’s Case
tanishiking
0
180
All About Sansan – for New Global Engineers
sansan33
PRO
1
1.2k
Introduction to Sansan for Engineers / エンジニア向け会社紹介
sansan33
PRO
5
43k
AIとともに歩んでいくデザイナーの役割の変化
lycorptech_jp
PRO
0
840
[2025年10月版] Databricks Data + AI Boot Camp
databricksjapan
1
250
Kubernetes self-healing of your workload
hwchiu
0
440
「タコピーの原罪」から学ぶ間違った”支援” / the bad support of Takopii
piyonakajima
0
140
混合雲環境整合異質工作流程工具運行關鍵業務 Job 的經驗分享
yaosiang
0
170
Databricks AI/BI Genie の「値ディクショナリー」をAmazonの奥地(S3)まで見に行く
kameitomohiro
1
400
Behind Postgres 18: The People, the Code, & the Invisible Work | Claire Giordano | PGConfEU 2025
clairegiordano
0
110
Featured
See All Featured
Performance Is Good for Brains [We Love Speed 2024]
tammyeverts
12
1.2k
Distributed Sagas: A Protocol for Coordinating Microservices
caitiem20
333
22k
The MySQL Ecosystem @ GitHub 2015
samlambert
251
13k
Speed Design
sergeychernyshev
32
1.2k
Designing for humans not robots
tammielis
254
26k
Let's Do A Bunch of Simple Stuff to Make Websites Faster
chriscoyier
508
140k
Build The Right Thing And Hit Your Dates
maggiecrowley
38
2.9k
Making Projects Easy
brettharned
120
6.4k
Sharpening the Axe: The Primacy of Toolmaking
bcantrill
46
2.5k
Docker and Python
trallard
46
3.6k
We Have a Design System, Now What?
morganepeng
53
7.8k
Fireside Chat
paigeccino
41
3.7k
Transcript
Ԭٕज़Պֶେֶ ిؾిࢠใֶ՝ఔ ֶ෦ɹ֯ுཽ ࣗવݴޠݚڀࣨ ɹ#̏θϛ ʙୈճʙ ϏοΫσʔλղੳೖ
目次 ˔ϏοΫσʔλͱ ˔ϏοΫσʔλͷཧ 6/*9 ˔౷ܭॲཧͷܭࢉޡࠩ
ビックデータとは ˔ϏοΫσʔλͱ ɹʮ̏7ʯଈͪɺ7PMVNF ༰ྔ 7BSJFUZ
छྨ 7FMPDJUZ ੵස Ὃ σʔλͷ༰ྔ͕େ͖͘ɺσʔλͷछྨ͕ଟ༷Ͱɺ σʔλͷੵස͕ଟ͍ɻ
ビックデータの管理(UNIX) ˔ϏοΫσʔλͷ༰Λ֬ೝ͢Δ σʔλͷ༰ΛΔ͜ͱͰɺߦ͏͖ղੳํ๏͕ܾ ·Δ ɹ⾣DBUɿϑΝΠϧͷதΛશͯදࣔ ɹ⾣MFTTɿදࣔϞʔυͱͳΓɺϑΝΠϧͷதΛදࣔ ɹɹRΛԡ͢͜ͱͰऴྃ͢Δ
DBUpMFDTW MFTTpMFDTW
ビックデータの管理(UNIX) ɹ⾣IFBEOɿϑΝΠϧͷઌ಄͔ΒOߦ͚ͩදࣔ ɹ⾣dcIFBEɿʮʛʢύΠϓʣʯΑΓલͷ࣮ߦ݁Ռ ɹɹɹɹɹɹɹΛ࠷ॳͷߦ͚ͩදࣔ ɹ⾣UBJMrOɿϑΝΠϧͷ࠷ޙ͔ΒOߦ͚ͩදࣔ
IFBErOpMFDTW TBNQMFQZpMFDTWcIFBE UBJMrOpMFDTW
ビックデータの管理(UNIX) ˔ϏοΫσʔλΛཧ͢Δ ղੳ͕͍͢͠Α͏ʹɺσʔλͷஔநग़Λߦ͏ ɹ⾣TFEɿσʔλͷஔ ɹɹTͷ࣍ͷͷؒͷจࣈ͕ɺ ɹɹɹHͷલͷͷؒͷจࣈʹஔ͞ΕΔ TFElT HzpMFDTW
ビックデータの管理(UNIX) ɹ⾣zT @zɿҰߦͷΧϯϚΛ@ʹஔ ɹ⾣TFElddzɿηϛίϩϯͰ۠Δ͜ͱͰɺ ɹɹɹɹɹɹɹɹͭͳ͛ͯॲཧΛهड़Ͱ͖Δ ɹ⾣ ϦμΠϨΫτ ɿग़ྗ݁ՌΛϑΝΠϧʹॻࠐΈ
TFElT @T HzpMFDTWPVUQVUUYU
ビックデータの管理(UNIX) ɹ⾣BXLɿඞཁͳ߲ͷநग़ ɹɹɹɹҰߦͣͭ۠Γ͋ΔจࣈྻΛॲཧ͢Δ ɹɹ'
ɿΧϯϚ۠ΓͰॲཧ͢Δ͜ͱΛ໌ࣔ ɹɹɹɹʢσϑΥϧτͰεϖʔε۠Γʣ BXLr' b\QSJOU^`pMFDTWPVUQVUUYU pMFDTW BQQMF PSBOHF CBOBOB PVUQVUUYU
ビックデータの管理(UNIX) ɹ⾣BXLϓϩάϥϛϯάݴޠ ɹɹQSJOUG GPSจ JGจͳͲ͕͑Δ BXLr'b\QSJOUG lTEݸaOz ^`pMFDTWPVUQVUUYU
pMFDTW BQQMF PSBOHF CBOBOB PVUQVUUYU BQQMFݸ PSBOHFݸ CBOBOBݸ
ビックデータの管理(UNIX) ɹ⾣HSFQldzɿσʔλͷத͔ΒdΛؚΉߦΛݕࡧ ɹ ɹ⾣HSFQrWldzɿσʔλͷத͔ΒdΛؚ·ͳ͍ߦΛݕࡧ HSFQlFzpMFDTWPVUQVUUYU pMFDTW
BQQMF PSBOHF CBOBOB PVUQVUUYU BQQMF PSBOHF HSFQrWlFzpMFDTWPVUQVUUYU PVUQVUUYU CBOBOB
ビックデータの管理(UNIX) ɹ⾣TPSUɿσʔλͷฒͼସ͑ ɹɹྦྷੵؔσʔλͷ࠷খɾ࠷େ ɹɹɹΛ֬ೝ͢Δ߹ʹ༻͍Δ ɹLҰྻʹண͠ɺHࣈͰฒͼସ͑Δɻ
ɹU ΧϯϚ۠ΓͷϑΝΠϧͰ͋Δ͜ͱΛද͢ɻ TPSUrLHrU pMFDTW pMFDTW BQQMF PSBOHF CBOBOB PVUQVUUYU CBOBOB BQQMF PSBOHF
ビックデータの管理(UNIX) ɹ⾣VOJRɿಉ͡σʔλΛ·ͱΊΔ ɹ⾣VOJRrDɿಉ͡σʔλΛ·ͱΊɺΧϯτ͢Δ
ɹ TPSUrLHpMFDTWcVOJR pMFDTW PVUQVUUYU TPSUrLHVpMFDTW TPSUrLHpMFDTWcVOJRD PVUQVUUYU
統計処理の計算誤差 ˔ޡࠩ ɹϏοΫσʔλղੳΛ͢Δ্Ͱɺʮظ͞ΕΔͱɺ ଌఆܭࢉͳͲͰಘΒΕͨͱͷࠩʯ ˔ޡࠩͷྨ ɹ⾣ϞσϧԽޡࠩ ɹ⾣ۙࣅޡࠩ ɹ⾣ܥ౷ޡࠩɺۮવޡࠩ ɹ⾣ܭࢉޡࠩ
統計処理の計算誤差 ⾣ϞσϧԽޡࠩ ɹϞσϧԽͷࡍʹෳࡶԽΛආ͚ΔͨΊʹແࢹ͞Εͨཁ ૉ͕ͨΒ͢ޡࠩ ɹྫʣৼΓࢠͷӡಈํఔࣜ ɹɹɹɾۭؾ߅Λແࢹ ɹɹɹɾมҟ͕ඍখͳͷ
統計処理の計算誤差 ⾣ۙࣅޡࠩ ɹܭࢉͷ؆ૉԽͷͨΊʹۙࣅࣜΛ༻͍Δ͜ͱʹΑΔޡࠩ ɹྫʣTJO Λܭࢉ ɹɹɹTJO Y dYͰۙࣅ͢Δͱɺ
TJO d ɹɹɹ࣮ࡍʹɺ TJO ɹɹɹΑͬͯɺ͕ۙࣅޡࠩ
統計処理の計算誤差 ⾣ܥ౷ޡࠩ ɹଌఆͷํ๏ʹΑͬͯݱΕͯ͠·͏ޡࠩ ɹҰͭͷଌఆํ๏Ͱଌఆ͢ΔݶΓɺࢼߦճΛॏͶ ͯऔΓআ͔Εͣɺ౷ܭతʹͣΕ͕؍ଌ͞ΕΔ ⾣ۮવޡࠩ ɹଌఆ͝ͱʹଌఆʹΒ͖͕ͭੜ·ΕΔ͜ͱʹΑΔޡࠩ
統計処理の計算誤差 ⾣ܭࢉޡࠩ ɹ1$ͰܭࢉΛߦ͏ࡍʹɺແݶΛѻ͑ͳ͍͜ͱʹΑΔޡࠩ ɹʙܭࢉޡࠩͷछྨʙ ɹɹ˗ؙΊޡࠩ ɹɹ˗ଧͪΓޡࠩ ɹɹ˗ใམͪ
統計処理の計算誤差 ˗ؙΊޡࠩ ɹ༗ޮࣈൣғ֎ͷ෦͕ࣺͯΒΕΔ͜ͱʹΑΔޡࠩ ɹྫʣʜ ɹɹɹ༗ޮࣈ͕ܻ̏ͷ߹ɺ ɹɹɹ͜ͷͱ͖ɺʜ
統計処理の計算誤差 ˔ଧͪΓޡࠩ ɹֶͰ༻͍ΔࣜͰɺཧతͳͱͯ͠ແݶݸͷΛ ߟ͑Δɻेʹۃݶڃʹ͍ۙͱஅܻͨ͠ Ͱଧͪͬͨͱ͖ͷޡࠩɹ ɹྫʣ
ʜ ɹ
統計処理の計算誤差 ˔ใམͪɹ ɹઈରʹେ͖ͳ͕ࠩ͋ΔೋͭͷͷՃݮΛߦͬͨ ߹ʹɺܭࢉ͕·ͱʹߦΘΕͳ͍͜ͱʹΑΔޡࠩ ɹྫʣ༗ޮࣈܻ̑ͷ Y Zº Y Z
ɹɹɹ༗ޮࣈܻ̑ͰΓམͱ͢ͱ
参考文献 ˔ߴ҆ඒࠤࢠฤஶɺాଜޫଠɾࡾӜߤஶɺ ɹʮֶੜɾٕज़ऀͷͨΊͷϏοΫσʔλղੳೖʯ ʢୈ̍ষʙୈ̏ষʣɺ ɹגࣜձࣾຊධࣾɺ݄