Upgrade to Pro
— share decks privately, control downloads, hide ads and more …
Speaker Deck
Features
Speaker Deck
PRO
Sign in
Sign up for free
Search
Search
mobilemethod-2-about-analytic-data.pdf
Search
Yoshihito
September 14, 2018
0
1.1k
mobilemethod-2-about-analytic-data.pdf
モバイルメソッド大阪 第2回
モバイルアプリの裏側 どうやって分析用のデータを集めているか のスライドです
Yoshihito
September 14, 2018
Tweet
Share
More Decks by Yoshihito
See All by Yoshihito
TUI App in Rust
yoshihitoh
0
210
Custom Runtime Lambda empowered by Rust
yoshihitoh
0
3.1k
Rust tutorial - implement a cli tool.
yoshihitoh
0
260
introduce-rust.pdf
yoshihitoh
2
530
regrowth2018-introduce-reinvent-sessions
yoshihitoh
0
980
cpp-library-on-browse-nodejs
yoshihitoh
0
2.7k
Featured
See All Featured
Creating an realtime collaboration tool: Agile Flush - .NET Oxford
marcduiker
31
2.2k
実際に使うSQLの書き方 徹底解説 / pgcon21j-tutorial
soudai
PRO
183
54k
Cheating the UX When There Is Nothing More to Optimize - PixelPioneers
stephaniewalter
283
13k
How to Think Like a Performance Engineer
csswizardry
25
1.8k
CSS Pre-Processors: Stylus, Less & Sass
bermonpainter
358
30k
Intergalactic Javascript Robots from Outer Space
tanoku
272
27k
Sharpening the Axe: The Primacy of Toolmaking
bcantrill
44
2.4k
Faster Mobile Websites
deanohume
309
31k
Six Lessons from altMBA
skipperchong
28
4k
Unsuck your backbone
ammeep
671
58k
Understanding Cognitive Biases in Performance Measurement
bluesmoon
29
1.8k
The Power of CSS Pseudo Elements
geoffreycrofte
77
5.9k
Transcript
ϞόΠϧΞϓϦͷཪଆͰ Ͳ͏ͬͯੳ༻ͷσʔλΛूΊ͍ͯΔ͔ ɹɹɹɹɹɹɹɹɹɹɹɹɹɹɹɹɹɹΫϥεϝιουגࣜձࣾ ɹɹɹɹɹɹɹɹɹɹɹɹɹɹɹɹɹɹɹɹɹɹɹɹ ɹɹฏ 1
About me • ԬΦϑΟεॴଐ • ϞόΠϧΞϓϦαʔϏε෦ • αʔόʔαΠυΤϯδχΞ • ೖࣾͯ͠1ͪΐͬͱ
• ͬͯΔ͜ͱ • AWSͷΠϯϑϥߏங • ϞόΠϧΞϓϦؔ࿈ͷαʔό։ൃ • ੳ༻σʔλͷऩू/Ճ (ETL) 2
͢͜ͱ • όοΫΤϯυΑΓͷ • ϞόΠϧγεςϜͰѻ͏σʔλͷछྨ • ूΊͨσʔλͷྲྀΕ • ͬͯΔٕज़֓ཁ •
࣮ࡍʹ։ൃɾӡ༻ͯ͠ײͨ͜͡ͱ 3
͞ͳ͍͜ͱ • όοΫΤϯυͷΞʔΩςΫνϟͷৄࡉ • ΞϓϦͷτϥοΩϯάख๏ • σʔλੳͷख๏ 4
σʔλͷछྨ ϞόΠϧΞϓϦͷߏཁૉ • ΞϓϦ • αʔό • ϓϥοτϑΥʔϜ ͦΕͧΕ৭ΜͳσʔλΛ࣋ͬͯΔ 5
ϞόΠϧγεςϜͷΞʔΩςΫνϟྫ 6
αʔόͷσʔλ • Ϣʔβݻ༗ͷσʔλ • ӡ༻ίϯςϯπ • ϩά • ΠϯϑϥͷϝτϦΫε 7
ΞϓϦͷσʔλ • ΞϓϦͰͷߦಈཤྺ • ΞϓϦΛىಈͨ͠ • ϘλϯΛԡͨ͠ • λϒΛΓସ͑ͨ •
ىಈ࣌ؒཹ࣌ؒͳͲͷ౷ܭใ • ΫϥογϡϨϙʔτ 8
ूΊͨσʔλͷྲྀΕ 9
10
11
12
13
14
15
16
17
ͬͯΔٕज़ཁૉ • luigi: όονδϣϒͷϫʔΫϑϩʔཧ • Apache Hadoop: Ϗοάσʔλͷࢄॲཧج൫ • EMR:
AWSͷϏοάσʔλ༻ΫϥελϓϥοτϑΥʔϜ 18
luigi • h#ps:/ /github.com/spo1fy/luigi • PythonͷϫʔΫϑϩʔཧγεςϜ • ґଘؔʹैͬͯλεΫΛ࣮ߦ͢Δ • λεΫͷঢ়ଶΛཧͯ͠͏·͍͜ͱ࣮ߦ੍ޚ͢Δ
• ະྃͷͷ͚࣮ͩߦ • ྃͯ͠ΔͷεΩοϓ 19
20
21
22
23
24
25
26
27
Apache Hadoop େنσʔλΛฒྻࢄॲཧ͢ΔͨΊͷϑϨʔϜϫʔΫ େنσʔλͷՃɾੳγεςϜΛࣗ࡞͢Δͷେม • Ͳ͏ͬͯେྔͷσʔλΛޮΑ͘ࡹ͔͘ • 1Ͱฒྻॲཧͯ͠Ϛγϯੑೳͷ্ݶͰ಄ଧͪ͢Δ • CPU
• σΟεΫIO • ωοτϫʔΫ௨৴ • ࢄॲཧࢄঢ়گͷཧΫϥελͷߏཧ͕ඞཁ 28
Apache Hadoop ฒྻॲཧɾࢄॲཧͷ໘ͳͱ͜Ζͷ໘ΛΈͯ͘ΕΔ • ຊདྷ࡞Γ͍ͨॲཧʹྗͰ͖Δ • ΤίγεςϜ͕༏ल • ؆୯ͳՃɾੳͷ߹ΞϓϦΛ࡞Βͳ͍ͰରԠͰ͖Δ •
HiveɾPrestoΛ׆༻ͯ͠ΫΤϦΛॻ͚ͩ͘ • ෳࡶԽ͢Δ߹ΞϓϦʹΓସ͑Δ͜ͱͰ͖Δ 29
30
31
32
33
34
35
36
EMR (Amazon Elas.c MapReduce) AWS͕༻ҙ͢ΔϏοάσʔλ༻ΫϥελϓϥοτϑΥʔϜ • HadoopSparkΛ͏ͨΊͷڥΛखܰʹߏஙͰ͖Δ • όʔδϣϯͷΓସ͕͑؆୯ •
AMIͷࢦఆΛม͑Δ͚ͩ • ઃఆΛม͑Δ͚ͩͰΫϥελߏΛมߋͰ͖Δ • - ༻్ʹԠͨ͡ΠϯελϯεɾΠϯελϯεछผ • AWSͷαʔϏεͱ؆୯ʹ࿈ܞͰ͖Δ • S3DynamoDBͷσʔλΛಡΈॻ͖Ͱ͖Δ 37
38
39
40
41
42
ଞʹϏοάσʔλؔ࿈ͷαʔϏε͕༻ҙ͞ Ε͍ͯΔ • Redshi(: σʔλΣΞϋε • Athena: S3ͷσʔλΛΞυϗοΫʹੳͰ͖Δ • Glue:
ϚωʔδυͳETLαʔϏε 43
࣮ࡍʹ։ൃɾӡ༻ͯ͠ײͨ͜͡ͱ • σʔλੳΛ҆ఆՔಇͤ͞Δͷ͍͠ • ΠϨΪϡϥʔͳσʔλ͕ૹΒΕ͖ͯͨ • ֎෦αʔϏεͷোͰ࣮ߦͰ͖ͳ͔ͬͨ • ঢ়گ͕มԽ͢Δͱσʔλੳͷཁ݅มΘͬͯ͘Δ •
Կ͔͕ىͬͨ͜߹ʹΓ͍͢͠Α͏ʹ͓ͯ͘͠ͱ҆৺ • HiveɾPrestoศར͗͢ 44
͓͠·͍ 45
ࢀߟจݙ • Apache Hadoop - Wikipedia • h0ps:/ /ja.wikipedia.org/wiki/Apache_Hadoop •
Amazon EMR • h0ps:/ /aws.amazon.com/jp/emr/ 46