Upgrade to Pro
— share decks privately, control downloads, hide ads and more …
Speaker Deck
Features
Speaker Deck
PRO
Sign in
Sign up for free
Search
Search
mobilemethod-2-about-analytic-data.pdf
Search
Yoshihito
September 14, 2018
0
1.1k
mobilemethod-2-about-analytic-data.pdf
モバイルメソッド大阪 第2回
モバイルアプリの裏側 どうやって分析用のデータを集めているか のスライドです
Yoshihito
September 14, 2018
Tweet
Share
More Decks by Yoshihito
See All by Yoshihito
TUI App in Rust
yoshihitoh
0
210
Custom Runtime Lambda empowered by Rust
yoshihitoh
0
3k
Rust tutorial - implement a cli tool.
yoshihitoh
0
260
introduce-rust.pdf
yoshihitoh
2
530
regrowth2018-introduce-reinvent-sessions
yoshihitoh
0
970
cpp-library-on-browse-nodejs
yoshihitoh
0
2.6k
Featured
See All Featured
The Myth of the Modular Monolith - Day 2 Keynote - Rails World 2024
eileencodes
26
2.9k
Building Better People: How to give real-time feedback that sticks.
wjessup
367
19k
Cheating the UX When There Is Nothing More to Optimize - PixelPioneers
stephaniewalter
281
13k
Why Our Code Smells
bkeepers
PRO
337
57k
Design and Strategy: How to Deal with People Who Don’t "Get" Design
morganepeng
130
19k
Code Reviewing Like a Champion
maltzj
524
40k
Dealing with People You Can't Stand - Big Design 2015
cassininazir
367
26k
実際に使うSQLの書き方 徹底解説 / pgcon21j-tutorial
soudai
PRO
181
53k
How GitHub (no longer) Works
holman
314
140k
Sharpening the Axe: The Primacy of Toolmaking
bcantrill
44
2.4k
KATA
mclloyd
30
14k
CSS Pre-Processors: Stylus, Less & Sass
bermonpainter
357
30k
Transcript
ϞόΠϧΞϓϦͷཪଆͰ Ͳ͏ͬͯੳ༻ͷσʔλΛूΊ͍ͯΔ͔ ɹɹɹɹɹɹɹɹɹɹɹɹɹɹɹɹɹɹΫϥεϝιουגࣜձࣾ ɹɹɹɹɹɹɹɹɹɹɹɹɹɹɹɹɹɹɹɹɹɹɹɹ ɹɹฏ 1
About me • ԬΦϑΟεॴଐ • ϞόΠϧΞϓϦαʔϏε෦ • αʔόʔαΠυΤϯδχΞ • ೖࣾͯ͠1ͪΐͬͱ
• ͬͯΔ͜ͱ • AWSͷΠϯϑϥߏங • ϞόΠϧΞϓϦؔ࿈ͷαʔό։ൃ • ੳ༻σʔλͷऩू/Ճ (ETL) 2
͢͜ͱ • όοΫΤϯυΑΓͷ • ϞόΠϧγεςϜͰѻ͏σʔλͷछྨ • ूΊͨσʔλͷྲྀΕ • ͬͯΔٕज़֓ཁ •
࣮ࡍʹ։ൃɾӡ༻ͯ͠ײͨ͜͡ͱ 3
͞ͳ͍͜ͱ • όοΫΤϯυͷΞʔΩςΫνϟͷৄࡉ • ΞϓϦͷτϥοΩϯάख๏ • σʔλੳͷख๏ 4
σʔλͷछྨ ϞόΠϧΞϓϦͷߏཁૉ • ΞϓϦ • αʔό • ϓϥοτϑΥʔϜ ͦΕͧΕ৭ΜͳσʔλΛ࣋ͬͯΔ 5
ϞόΠϧγεςϜͷΞʔΩςΫνϟྫ 6
αʔόͷσʔλ • Ϣʔβݻ༗ͷσʔλ • ӡ༻ίϯςϯπ • ϩά • ΠϯϑϥͷϝτϦΫε 7
ΞϓϦͷσʔλ • ΞϓϦͰͷߦಈཤྺ • ΞϓϦΛىಈͨ͠ • ϘλϯΛԡͨ͠ • λϒΛΓସ͑ͨ •
ىಈ࣌ؒཹ࣌ؒͳͲͷ౷ܭใ • ΫϥογϡϨϙʔτ 8
ूΊͨσʔλͷྲྀΕ 9
10
11
12
13
14
15
16
17
ͬͯΔٕज़ཁૉ • luigi: όονδϣϒͷϫʔΫϑϩʔཧ • Apache Hadoop: Ϗοάσʔλͷࢄॲཧج൫ • EMR:
AWSͷϏοάσʔλ༻ΫϥελϓϥοτϑΥʔϜ 18
luigi • h#ps:/ /github.com/spo1fy/luigi • PythonͷϫʔΫϑϩʔཧγεςϜ • ґଘؔʹैͬͯλεΫΛ࣮ߦ͢Δ • λεΫͷঢ়ଶΛཧͯ͠͏·͍͜ͱ࣮ߦ੍ޚ͢Δ
• ະྃͷͷ͚࣮ͩߦ • ྃͯ͠ΔͷεΩοϓ 19
20
21
22
23
24
25
26
27
Apache Hadoop େنσʔλΛฒྻࢄॲཧ͢ΔͨΊͷϑϨʔϜϫʔΫ େنσʔλͷՃɾੳγεςϜΛࣗ࡞͢Δͷେม • Ͳ͏ͬͯେྔͷσʔλΛޮΑ͘ࡹ͔͘ • 1Ͱฒྻॲཧͯ͠Ϛγϯੑೳͷ্ݶͰ಄ଧͪ͢Δ • CPU
• σΟεΫIO • ωοτϫʔΫ௨৴ • ࢄॲཧࢄঢ়گͷཧΫϥελͷߏཧ͕ඞཁ 28
Apache Hadoop ฒྻॲཧɾࢄॲཧͷ໘ͳͱ͜Ζͷ໘ΛΈͯ͘ΕΔ • ຊདྷ࡞Γ͍ͨॲཧʹྗͰ͖Δ • ΤίγεςϜ͕༏ल • ؆୯ͳՃɾੳͷ߹ΞϓϦΛ࡞Βͳ͍ͰରԠͰ͖Δ •
HiveɾPrestoΛ׆༻ͯ͠ΫΤϦΛॻ͚ͩ͘ • ෳࡶԽ͢Δ߹ΞϓϦʹΓସ͑Δ͜ͱͰ͖Δ 29
30
31
32
33
34
35
36
EMR (Amazon Elas.c MapReduce) AWS͕༻ҙ͢ΔϏοάσʔλ༻ΫϥελϓϥοτϑΥʔϜ • HadoopSparkΛ͏ͨΊͷڥΛखܰʹߏஙͰ͖Δ • όʔδϣϯͷΓସ͕͑؆୯ •
AMIͷࢦఆΛม͑Δ͚ͩ • ઃఆΛม͑Δ͚ͩͰΫϥελߏΛมߋͰ͖Δ • - ༻్ʹԠͨ͡ΠϯελϯεɾΠϯελϯεछผ • AWSͷαʔϏεͱ؆୯ʹ࿈ܞͰ͖Δ • S3DynamoDBͷσʔλΛಡΈॻ͖Ͱ͖Δ 37
38
39
40
41
42
ଞʹϏοάσʔλؔ࿈ͷαʔϏε͕༻ҙ͞ Ε͍ͯΔ • Redshi(: σʔλΣΞϋε • Athena: S3ͷσʔλΛΞυϗοΫʹੳͰ͖Δ • Glue:
ϚωʔδυͳETLαʔϏε 43
࣮ࡍʹ։ൃɾӡ༻ͯ͠ײͨ͜͡ͱ • σʔλੳΛ҆ఆՔಇͤ͞Δͷ͍͠ • ΠϨΪϡϥʔͳσʔλ͕ૹΒΕ͖ͯͨ • ֎෦αʔϏεͷোͰ࣮ߦͰ͖ͳ͔ͬͨ • ঢ়گ͕มԽ͢Δͱσʔλੳͷཁ݅มΘͬͯ͘Δ •
Կ͔͕ىͬͨ͜߹ʹΓ͍͢͠Α͏ʹ͓ͯ͘͠ͱ҆৺ • HiveɾPrestoศར͗͢ 44
͓͠·͍ 45
ࢀߟจݙ • Apache Hadoop - Wikipedia • h0ps:/ /ja.wikipedia.org/wiki/Apache_Hadoop •
Amazon EMR • h0ps:/ /aws.amazon.com/jp/emr/ 46