$30 off During Our Annual Pro Sale. View Details »
Speaker Deck
Features
Speaker Deck
PRO
Sign in
Sign up for free
Search
Search
Hadoop初心者が脱初心者したかった話
Search
onomotoharu
December 22, 2016
0
380
Hadoop初心者が脱初心者したかった話
#CyberZ #Hadoop #SkillWednesday
onomotoharu
December 22, 2016
Tweet
Share
More Decks by onomotoharu
See All by onomotoharu
もし僕らのことばがAPIであったなら
onomotoharu
1
310
Featured
See All Featured
The Myth of the Modular Monolith - Day 2 Keynote - Rails World 2024
eileencodes
26
3.2k
Creating an realtime collaboration tool: Agile Flush - .NET Oxford
marcduiker
35
2.3k
For a Future-Friendly Web
brad_frost
180
10k
Why Our Code Smells
bkeepers
PRO
340
57k
Cheating the UX When There Is Nothing More to Optimize - PixelPioneers
stephaniewalter
285
14k
The Power of CSS Pseudo Elements
geoffreycrofte
80
6.1k
The Psychology of Web Performance [Beyond Tellerrand 2023]
tammyeverts
49
3.2k
Mobile First: as difficult as doing things right
swwweet
225
10k
How GitHub (no longer) Works
holman
316
140k
Automating Front-end Workflow
addyosmani
1371
200k
Fashionably flexible responsive web design (full day workshop)
malarkey
407
66k
Rails Girls Zürich Keynote
gr2m
95
14k
Transcript
Hadoop ॳ৺ऀ͕ॳ৺ऀ͔ͨͬͨ͠ SkillWednesday#20161221 খݩय़
୭ খݩय़ BLBೋ *%நग़ج൫νʔϜ ΠεΩʔɹ Ṍ͚ࣄ ϚΠϯΫϥϑτ
ͳͥ)BEPPQ
طଘΫϥελ͕Ϡό͍ Ҡߦ͠ͳ͖Ό
ٕज़ඪʹྑͦ͞͏ શવ͠Βͳ͍ྖҬͩ͠
ࠓ)BEPPQͷ͠·ͤΜ
ͬͨ͜ͱ ࣌ظॱ
ωοτهࣄΛಡΉ ຊΛಡΉ ͱΓ͋͑ͣ࡞ͬͯΈΔ ಈ͍͍ͯΔͷΛ৮Δ ϕςϥϯʹฉ͘
ωοτهࣄΛಡΉ ˒ˑˑ ຊΛಡΉ ˒ˑˑ ͱΓ͋͑ͣ࡞ͬͯΈΔ ˑˑˑ ಈ͍͍ͯΔͷΛ৮Δ ˒˒ˑ ϕςϥϯʹฉ͘ ˒˒˒
Φεεϝ
ωοτهࣄΛಡΉ ˒ˑˑ
ಛʹࢀߟʹͳͬͨهࣄ • YARNͷհ • https://www.ibm.com/developerworks/jp/ analytics/library/bd-yarn-intro/ • HadoopͲͷΑ͏ʹಈ͘ͷ͔ • http://gihyo.jp/admin/serial/01/
how_hadoop_works
Ͳ͏ͩͬͨͷ • ·ͣ“Hadoop ೖ”ͳͲͰάάͬͯग़ͯདྷΔ SlideshareΛಡΜͰΈΔ • ຊʹ;ΜΘΓͱ͔͠Θ͔Βͳ͍ • ͻͱ·ͱ·Γͷࢿྉ͍͖ͳΓಡΉͷϋʔυϧ ͕ߴ͍
• ܲͱཛঢ়ଶ • ඇৗʹ͕͔͔࣌ؒΔ • ࠓͷཧղʹదͨ͠ࢿྉ͔ࣗͰஅෆՄ
ຊΛಡΉ ˒ˑˑ
ಡΜͩຊ • HadoopΦϖϨʔγϣϯ -γεςϜӡ༻ཧΨΠυ
Ͳ͏ͩͬͨͷ • Ͳ͔͜ΒಡΊ͍͍ͷ… • ۀͳͲͰ৮Εͨ͜ͱͱΕ͍͗ͯ͢Δ • “Ͳ͏خ͍͠”ͷ͔Πϝʔδ͕͔ͳ͍ • ਖ਼͋·Γ༰͍֮͑ͯͳ͍Ͱ͢ …ͷͰ͜ͷ͔̍͠ಡΊͳ͔ͬͨ
ͱΓ͋͑ͣͬͯΈΔ ˑˑˑ
ͬͨ͜ͱ • QuickStarts for CDH 5.8 • shྲྀ͢ͱΫϥελ͕Ͱ͖Δ • ͍͖ͳΓCMυϯͬͯͳΔ
• http://www.cloudera.com/ downloads/ quickstart_vms/5-8.html
Ͳ͏ͩͬͨͷ • ΫϥελཱͬͨʂదͳCSVΛೖΕͯHive Ͱୟ͍ͯΈͨʂ • …͔͜͜ΒԿΛ͢Ε͍͍ͷ͔ʁ • ͦͦଞσΟετϦϏϡʔγϣϯͳͲͱൺ ֱ͢ΔͨΊͷͬΆ͍ •
ࣗͰࢼߦࡨޡ͢Δ෦͕ͳ͍
ಈ͍͍ͯΔͷΛ৮Δ ˒˒ˑ
ͬͨ͜ͱ • CosmoͷEMRͷઃఆΛ(ͨͿ Μ)શ෦ಡΜͩ • ࣗͳΓʹཧղ্ͨ͠Ͱಉ͡ ઃఆͰΫϥελ࡞ • …ͷલʹAWSͷࢿྉಡΈ ͭͭϓϨʔϯͳͷΛ࡞
Ͳ͏ͩͬͨͷ • AWSͷࢿྉΛಡΈͳ͕ΒཱͯͯΈΔ • EMRͷػೳࢥ͕;ΜΘΓ͔Δ • طଘͷઃఆΛಡΈͳ͕ΒཱͯΔ • ͳͥ͜ͷઃఆ͕͍Δͷ͔ɺάάΔɺҙਤ͕ ͔ΔˠεοΩϦ&Εͳ͍
• ࣗͷPJͰͷԠ༻·ͩԕͦ͏
ϕςϥϯʹฉ͘ ˒˒˒
ͬͨ͜ͱ • ༰ • ࣮ࡍʹ࡞ۀ͍ͯ͠Δͱ͖ʹෆ໌ • ೲಘ͍͔ͳ͍ࣄ • ۩ମతͳϢʔεέʔε͕ු͔ͳ͍ύλʔϯ •
ํ๏ • slackͰ෦ཱͯ(#hadoop) • ͦͷͰ͏
Ͳ͏ͩͬͨͷ • “શମײ͔ͭΊͨͭΓ…͚ͩͲ͕݀͋Δ”ͷ ݀Λ࠹͛Δ • ࣗͷཧղ/PJͷཁ͕݅͋Δఔڞ༗ࡁΈ ཧղʹ߹Θͤͨձ(ΠϯλϥΫςΟϒ!)
ͱݴ͍ͭͭ ·ͱΊͱײ
ωοτهࣄΛಡΉ ຊΛಡΉ ͱΓ͋͑ͣ࡞ͬͯΈΔ ಈ͍͍ͯΔͷΛ৮Δ ϕςϥϯʹฉ͘
ͲͷεςοϓແବͰͳ͔ͬͨ
࠷ॳҙຯෆ໌Ͱ ͱΓ͋͑ͣಡΈ·͘Δ͔͠ͳ͍
͋ʙɹ͍͍ ͳΔ΄ͲͶ શʹཧղͨ͠ l z
ͷස͕ͩΜͩΜ্͕Δͣ ͩΜͩΜࣝϝογϡ͕େ͖͔ͭ͘ࡉ͔͘ͳͬͯ৽͍ࣝ͠Λٵऩ͘͢͠ͳΔΑ͏ͳΠϝʔδ
Θ͔ͬͨ͜ͱ Hadoopʹ৮ͬͯΈͨײ
Զୡͷઓ͍ɺ ࠓ࢝·͔ͬͨΓͩ…! l