Upgrade to Pro
— share decks privately, control downloads, hide ads and more …
Speaker Deck
Features
Speaker Deck
PRO
Sign in
Sign up for free
Search
Search
Hadoop初心者が脱初心者したかった話
Search
onomotoharu
December 22, 2016
0
380
Hadoop初心者が脱初心者したかった話
#CyberZ #Hadoop #SkillWednesday
onomotoharu
December 22, 2016
Tweet
Share
More Decks by onomotoharu
See All by onomotoharu
もし僕らのことばがAPIであったなら
onomotoharu
1
310
Featured
See All Featured
Building a Modern Day E-commerce SEO Strategy
aleyda
42
7.4k
The Language of Interfaces
destraynor
158
25k
Fireside Chat
paigeccino
37
3.5k
Fashionably flexible responsive web design (full day workshop)
malarkey
407
66k
Thoughts on Productivity
jonyablonski
69
4.7k
The Cost Of JavaScript in 2023
addyosmani
51
8.5k
Sharpening the Axe: The Primacy of Toolmaking
bcantrill
44
2.4k
The Invisible Side of Design
smashingmag
301
51k
Principles of Awesome APIs and How to Build Them.
keavy
126
17k
Six Lessons from altMBA
skipperchong
28
3.9k
How to train your dragon (web standard)
notwaldorf
96
6.1k
The Success of Rails: Ensuring Growth for the Next 100 Years
eileencodes
45
7.5k
Transcript
Hadoop ॳ৺ऀ͕ॳ৺ऀ͔ͨͬͨ͠ SkillWednesday#20161221 খݩय़
୭ খݩय़ BLBೋ *%நग़ج൫νʔϜ ΠεΩʔɹ Ṍ͚ࣄ ϚΠϯΫϥϑτ
ͳͥ)BEPPQ
طଘΫϥελ͕Ϡό͍ Ҡߦ͠ͳ͖Ό
ٕज़ඪʹྑͦ͞͏ શવ͠Βͳ͍ྖҬͩ͠
ࠓ)BEPPQͷ͠·ͤΜ
ͬͨ͜ͱ ࣌ظॱ
ωοτهࣄΛಡΉ ຊΛಡΉ ͱΓ͋͑ͣ࡞ͬͯΈΔ ಈ͍͍ͯΔͷΛ৮Δ ϕςϥϯʹฉ͘
ωοτهࣄΛಡΉ ˒ˑˑ ຊΛಡΉ ˒ˑˑ ͱΓ͋͑ͣ࡞ͬͯΈΔ ˑˑˑ ಈ͍͍ͯΔͷΛ৮Δ ˒˒ˑ ϕςϥϯʹฉ͘ ˒˒˒
Φεεϝ
ωοτهࣄΛಡΉ ˒ˑˑ
ಛʹࢀߟʹͳͬͨهࣄ • YARNͷհ • https://www.ibm.com/developerworks/jp/ analytics/library/bd-yarn-intro/ • HadoopͲͷΑ͏ʹಈ͘ͷ͔ • http://gihyo.jp/admin/serial/01/
how_hadoop_works
Ͳ͏ͩͬͨͷ • ·ͣ“Hadoop ೖ”ͳͲͰάάͬͯग़ͯདྷΔ SlideshareΛಡΜͰΈΔ • ຊʹ;ΜΘΓͱ͔͠Θ͔Βͳ͍ • ͻͱ·ͱ·Γͷࢿྉ͍͖ͳΓಡΉͷϋʔυϧ ͕ߴ͍
• ܲͱཛঢ়ଶ • ඇৗʹ͕͔͔࣌ؒΔ • ࠓͷཧղʹదͨ͠ࢿྉ͔ࣗͰஅෆՄ
ຊΛಡΉ ˒ˑˑ
ಡΜͩຊ • HadoopΦϖϨʔγϣϯ -γεςϜӡ༻ཧΨΠυ
Ͳ͏ͩͬͨͷ • Ͳ͔͜ΒಡΊ͍͍ͷ… • ۀͳͲͰ৮Εͨ͜ͱͱΕ͍͗ͯ͢Δ • “Ͳ͏خ͍͠”ͷ͔Πϝʔδ͕͔ͳ͍ • ਖ਼͋·Γ༰͍֮͑ͯͳ͍Ͱ͢ …ͷͰ͜ͷ͔̍͠ಡΊͳ͔ͬͨ
ͱΓ͋͑ͣͬͯΈΔ ˑˑˑ
ͬͨ͜ͱ • QuickStarts for CDH 5.8 • shྲྀ͢ͱΫϥελ͕Ͱ͖Δ • ͍͖ͳΓCMυϯͬͯͳΔ
• http://www.cloudera.com/ downloads/ quickstart_vms/5-8.html
Ͳ͏ͩͬͨͷ • ΫϥελཱͬͨʂదͳCSVΛೖΕͯHive Ͱୟ͍ͯΈͨʂ • …͔͜͜ΒԿΛ͢Ε͍͍ͷ͔ʁ • ͦͦଞσΟετϦϏϡʔγϣϯͳͲͱൺ ֱ͢ΔͨΊͷͬΆ͍ •
ࣗͰࢼߦࡨޡ͢Δ෦͕ͳ͍
ಈ͍͍ͯΔͷΛ৮Δ ˒˒ˑ
ͬͨ͜ͱ • CosmoͷEMRͷઃఆΛ(ͨͿ Μ)શ෦ಡΜͩ • ࣗͳΓʹཧղ্ͨ͠Ͱಉ͡ ઃఆͰΫϥελ࡞ • …ͷલʹAWSͷࢿྉಡΈ ͭͭϓϨʔϯͳͷΛ࡞
Ͳ͏ͩͬͨͷ • AWSͷࢿྉΛಡΈͳ͕ΒཱͯͯΈΔ • EMRͷػೳࢥ͕;ΜΘΓ͔Δ • طଘͷઃఆΛಡΈͳ͕ΒཱͯΔ • ͳͥ͜ͷઃఆ͕͍Δͷ͔ɺάάΔɺҙਤ͕ ͔ΔˠεοΩϦ&Εͳ͍
• ࣗͷPJͰͷԠ༻·ͩԕͦ͏
ϕςϥϯʹฉ͘ ˒˒˒
ͬͨ͜ͱ • ༰ • ࣮ࡍʹ࡞ۀ͍ͯ͠Δͱ͖ʹෆ໌ • ೲಘ͍͔ͳ͍ࣄ • ۩ମతͳϢʔεέʔε͕ු͔ͳ͍ύλʔϯ •
ํ๏ • slackͰ෦ཱͯ(#hadoop) • ͦͷͰ͏
Ͳ͏ͩͬͨͷ • “શମײ͔ͭΊͨͭΓ…͚ͩͲ͕݀͋Δ”ͷ ݀Λ࠹͛Δ • ࣗͷཧղ/PJͷཁ͕݅͋Δఔڞ༗ࡁΈ ཧղʹ߹Θͤͨձ(ΠϯλϥΫςΟϒ!)
ͱݴ͍ͭͭ ·ͱΊͱײ
ωοτهࣄΛಡΉ ຊΛಡΉ ͱΓ͋͑ͣ࡞ͬͯΈΔ ಈ͍͍ͯΔͷΛ৮Δ ϕςϥϯʹฉ͘
ͲͷεςοϓແବͰͳ͔ͬͨ
࠷ॳҙຯෆ໌Ͱ ͱΓ͋͑ͣಡΈ·͘Δ͔͠ͳ͍
͋ʙɹ͍͍ ͳΔ΄ͲͶ શʹཧղͨ͠ l z
ͷස͕ͩΜͩΜ্͕Δͣ ͩΜͩΜࣝϝογϡ͕େ͖͔ͭ͘ࡉ͔͘ͳͬͯ৽͍ࣝ͠Λٵऩ͘͢͠ͳΔΑ͏ͳΠϝʔδ
Θ͔ͬͨ͜ͱ Hadoopʹ৮ͬͯΈͨײ
Զୡͷઓ͍ɺ ࠓ࢝·͔ͬͨΓͩ…! l