Upgrade to Pro
— share decks privately, control downloads, hide ads and more …
Speaker Deck
Features
Speaker Deck
PRO
Sign in
Sign up for free
Search
Search
Hadoop初心者が脱初心者したかった話
Search
onomotoharu
December 22, 2016
0
380
Hadoop初心者が脱初心者したかった話
#CyberZ #Hadoop #SkillWednesday
onomotoharu
December 22, 2016
Tweet
Share
More Decks by onomotoharu
See All by onomotoharu
もし僕らのことばがAPIであったなら
onomotoharu
1
310
Featured
See All Featured
Responsive Adventures: Dirty Tricks From The Dark Corners of Front-End
smashingmag
252
21k
Balancing Empowerment & Direction
lara
3
620
Intergalactic Javascript Robots from Outer Space
tanoku
272
27k
The MySQL Ecosystem @ GitHub 2015
samlambert
251
13k
Designing for Performance
lara
610
69k
Building Adaptive Systems
keathley
43
2.7k
The Illustrated Children's Guide to Kubernetes
chrisshort
48
50k
Learning to Love Humans: Emotional Interface Design
aarron
273
40k
Making the Leap to Tech Lead
cromwellryan
135
9.5k
Dealing with People You Can't Stand - Big Design 2015
cassininazir
367
27k
KATA
mclloyd
32
14k
Fireside Chat
paigeccino
39
3.6k
Transcript
Hadoop ॳ৺ऀ͕ॳ৺ऀ͔ͨͬͨ͠ SkillWednesday#20161221 খݩय़
୭ খݩय़ BLBೋ *%நग़ج൫νʔϜ ΠεΩʔɹ Ṍ͚ࣄ ϚΠϯΫϥϑτ
ͳͥ)BEPPQ
طଘΫϥελ͕Ϡό͍ Ҡߦ͠ͳ͖Ό
ٕज़ඪʹྑͦ͞͏ શવ͠Βͳ͍ྖҬͩ͠
ࠓ)BEPPQͷ͠·ͤΜ
ͬͨ͜ͱ ࣌ظॱ
ωοτهࣄΛಡΉ ຊΛಡΉ ͱΓ͋͑ͣ࡞ͬͯΈΔ ಈ͍͍ͯΔͷΛ৮Δ ϕςϥϯʹฉ͘
ωοτهࣄΛಡΉ ˒ˑˑ ຊΛಡΉ ˒ˑˑ ͱΓ͋͑ͣ࡞ͬͯΈΔ ˑˑˑ ಈ͍͍ͯΔͷΛ৮Δ ˒˒ˑ ϕςϥϯʹฉ͘ ˒˒˒
Φεεϝ
ωοτهࣄΛಡΉ ˒ˑˑ
ಛʹࢀߟʹͳͬͨهࣄ • YARNͷհ • https://www.ibm.com/developerworks/jp/ analytics/library/bd-yarn-intro/ • HadoopͲͷΑ͏ʹಈ͘ͷ͔ • http://gihyo.jp/admin/serial/01/
how_hadoop_works
Ͳ͏ͩͬͨͷ • ·ͣ“Hadoop ೖ”ͳͲͰάάͬͯग़ͯདྷΔ SlideshareΛಡΜͰΈΔ • ຊʹ;ΜΘΓͱ͔͠Θ͔Βͳ͍ • ͻͱ·ͱ·Γͷࢿྉ͍͖ͳΓಡΉͷϋʔυϧ ͕ߴ͍
• ܲͱཛঢ়ଶ • ඇৗʹ͕͔͔࣌ؒΔ • ࠓͷཧղʹదͨ͠ࢿྉ͔ࣗͰஅෆՄ
ຊΛಡΉ ˒ˑˑ
ಡΜͩຊ • HadoopΦϖϨʔγϣϯ -γεςϜӡ༻ཧΨΠυ
Ͳ͏ͩͬͨͷ • Ͳ͔͜ΒಡΊ͍͍ͷ… • ۀͳͲͰ৮Εͨ͜ͱͱΕ͍͗ͯ͢Δ • “Ͳ͏خ͍͠”ͷ͔Πϝʔδ͕͔ͳ͍ • ਖ਼͋·Γ༰͍֮͑ͯͳ͍Ͱ͢ …ͷͰ͜ͷ͔̍͠ಡΊͳ͔ͬͨ
ͱΓ͋͑ͣͬͯΈΔ ˑˑˑ
ͬͨ͜ͱ • QuickStarts for CDH 5.8 • shྲྀ͢ͱΫϥελ͕Ͱ͖Δ • ͍͖ͳΓCMυϯͬͯͳΔ
• http://www.cloudera.com/ downloads/ quickstart_vms/5-8.html
Ͳ͏ͩͬͨͷ • ΫϥελཱͬͨʂదͳCSVΛೖΕͯHive Ͱୟ͍ͯΈͨʂ • …͔͜͜ΒԿΛ͢Ε͍͍ͷ͔ʁ • ͦͦଞσΟετϦϏϡʔγϣϯͳͲͱൺ ֱ͢ΔͨΊͷͬΆ͍ •
ࣗͰࢼߦࡨޡ͢Δ෦͕ͳ͍
ಈ͍͍ͯΔͷΛ৮Δ ˒˒ˑ
ͬͨ͜ͱ • CosmoͷEMRͷઃఆΛ(ͨͿ Μ)શ෦ಡΜͩ • ࣗͳΓʹཧղ্ͨ͠Ͱಉ͡ ઃఆͰΫϥελ࡞ • …ͷલʹAWSͷࢿྉಡΈ ͭͭϓϨʔϯͳͷΛ࡞
Ͳ͏ͩͬͨͷ • AWSͷࢿྉΛಡΈͳ͕ΒཱͯͯΈΔ • EMRͷػೳࢥ͕;ΜΘΓ͔Δ • طଘͷઃఆΛಡΈͳ͕ΒཱͯΔ • ͳͥ͜ͷઃఆ͕͍Δͷ͔ɺάάΔɺҙਤ͕ ͔ΔˠεοΩϦ&Εͳ͍
• ࣗͷPJͰͷԠ༻·ͩԕͦ͏
ϕςϥϯʹฉ͘ ˒˒˒
ͬͨ͜ͱ • ༰ • ࣮ࡍʹ࡞ۀ͍ͯ͠Δͱ͖ʹෆ໌ • ೲಘ͍͔ͳ͍ࣄ • ۩ମతͳϢʔεέʔε͕ු͔ͳ͍ύλʔϯ •
ํ๏ • slackͰ෦ཱͯ(#hadoop) • ͦͷͰ͏
Ͳ͏ͩͬͨͷ • “શମײ͔ͭΊͨͭΓ…͚ͩͲ͕݀͋Δ”ͷ ݀Λ࠹͛Δ • ࣗͷཧղ/PJͷཁ͕݅͋Δఔڞ༗ࡁΈ ཧղʹ߹Θͤͨձ(ΠϯλϥΫςΟϒ!)
ͱݴ͍ͭͭ ·ͱΊͱײ
ωοτهࣄΛಡΉ ຊΛಡΉ ͱΓ͋͑ͣ࡞ͬͯΈΔ ಈ͍͍ͯΔͷΛ৮Δ ϕςϥϯʹฉ͘
ͲͷεςοϓແବͰͳ͔ͬͨ
࠷ॳҙຯෆ໌Ͱ ͱΓ͋͑ͣಡΈ·͘Δ͔͠ͳ͍
͋ʙɹ͍͍ ͳΔ΄ͲͶ શʹཧղͨ͠ l z
ͷස͕ͩΜͩΜ্͕Δͣ ͩΜͩΜࣝϝογϡ͕େ͖͔ͭ͘ࡉ͔͘ͳͬͯ৽͍ࣝ͠Λٵऩ͘͢͠ͳΔΑ͏ͳΠϝʔδ
Θ͔ͬͨ͜ͱ Hadoopʹ৮ͬͯΈͨײ
Զୡͷઓ͍ɺ ࠓ࢝·͔ͬͨΓͩ…! l