Upgrade to Pro
— share decks privately, control downloads, hide ads and more …
Speaker Deck
Features
Speaker Deck
PRO
Sign in
Sign up for free
Search
Search
mobilemethod-2-about-analytic-data.pdf
Search
Yoshihito
September 14, 2018
0
950
mobilemethod-2-about-analytic-data.pdf
モバイルメソッド大阪 第2回
モバイルアプリの裏側 どうやって分析用のデータを集めているか のスライドです
Yoshihito
September 14, 2018
Tweet
Share
More Decks by Yoshihito
See All by Yoshihito
TUI App in Rust
yoshihitoh
0
180
Custom Runtime Lambda empowered by Rust
yoshihitoh
0
2.7k
Rust tutorial - implement a cli tool.
yoshihitoh
0
220
introduce-rust.pdf
yoshihitoh
2
500
regrowth2018-introduce-reinvent-sessions
yoshihitoh
0
840
cpp-library-on-browse-nodejs
yoshihitoh
0
2.4k
Featured
See All Featured
Optimising Largest Contentful Paint
csswizardry
13
2.4k
Making the Leap to Tech Lead
cromwellryan
125
8.5k
StorybookのUI Testing Handbookを読んだ
zakiyama
13
4.6k
How GitHub Uses GitHub to Build GitHub
holman
468
290k
Fantastic passwords and where to find them - at NoRuKo
philnash
39
2.5k
The Brand Is Dead. Long Live the Brand.
mthomps
49
29k
No one is an island. Learnings from fostering a developers community.
thoeni
16
2.1k
What's new in Ruby 2.0
geeforr
337
31k
How to name files
jennybc
65
93k
Imperfection Machines: The Place of Print at Facebook
scottboms
261
12k
Debugging Ruby Performance
tmm1
70
11k
Being A Developer After 40
akosma
67
580k
Transcript
ϞόΠϧΞϓϦͷཪଆͰ Ͳ͏ͬͯੳ༻ͷσʔλΛूΊ͍ͯΔ͔ ɹɹɹɹɹɹɹɹɹɹɹɹɹɹɹɹɹɹΫϥεϝιουגࣜձࣾ ɹɹɹɹɹɹɹɹɹɹɹɹɹɹɹɹɹɹɹɹɹɹɹɹ ɹɹฏ 1
About me • ԬΦϑΟεॴଐ • ϞόΠϧΞϓϦαʔϏε෦ • αʔόʔαΠυΤϯδχΞ • ೖࣾͯ͠1ͪΐͬͱ
• ͬͯΔ͜ͱ • AWSͷΠϯϑϥߏங • ϞόΠϧΞϓϦؔ࿈ͷαʔό։ൃ • ੳ༻σʔλͷऩू/Ճ (ETL) 2
͢͜ͱ • όοΫΤϯυΑΓͷ • ϞόΠϧγεςϜͰѻ͏σʔλͷछྨ • ूΊͨσʔλͷྲྀΕ • ͬͯΔٕज़֓ཁ •
࣮ࡍʹ։ൃɾӡ༻ͯ͠ײͨ͜͡ͱ 3
͞ͳ͍͜ͱ • όοΫΤϯυͷΞʔΩςΫνϟͷৄࡉ • ΞϓϦͷτϥοΩϯάख๏ • σʔλੳͷख๏ 4
σʔλͷछྨ ϞόΠϧΞϓϦͷߏཁૉ • ΞϓϦ • αʔό • ϓϥοτϑΥʔϜ ͦΕͧΕ৭ΜͳσʔλΛ࣋ͬͯΔ 5
ϞόΠϧγεςϜͷΞʔΩςΫνϟྫ 6
αʔόͷσʔλ • Ϣʔβݻ༗ͷσʔλ • ӡ༻ίϯςϯπ • ϩά • ΠϯϑϥͷϝτϦΫε 7
ΞϓϦͷσʔλ • ΞϓϦͰͷߦಈཤྺ • ΞϓϦΛىಈͨ͠ • ϘλϯΛԡͨ͠ • λϒΛΓସ͑ͨ •
ىಈ࣌ؒཹ࣌ؒͳͲͷ౷ܭใ • ΫϥογϡϨϙʔτ 8
ूΊͨσʔλͷྲྀΕ 9
10
11
12
13
14
15
16
17
ͬͯΔٕज़ཁૉ • luigi: όονδϣϒͷϫʔΫϑϩʔཧ • Apache Hadoop: Ϗοάσʔλͷࢄॲཧج൫ • EMR:
AWSͷϏοάσʔλ༻ΫϥελϓϥοτϑΥʔϜ 18
luigi • h#ps:/ /github.com/spo1fy/luigi • PythonͷϫʔΫϑϩʔཧγεςϜ • ґଘؔʹैͬͯλεΫΛ࣮ߦ͢Δ • λεΫͷঢ়ଶΛཧͯ͠͏·͍͜ͱ࣮ߦ੍ޚ͢Δ
• ະྃͷͷ͚࣮ͩߦ • ྃͯ͠ΔͷεΩοϓ 19
20
21
22
23
24
25
26
27
Apache Hadoop େنσʔλΛฒྻࢄॲཧ͢ΔͨΊͷϑϨʔϜϫʔΫ େنσʔλͷՃɾੳγεςϜΛࣗ࡞͢Δͷେม • Ͳ͏ͬͯେྔͷσʔλΛޮΑ͘ࡹ͔͘ • 1Ͱฒྻॲཧͯ͠Ϛγϯੑೳͷ্ݶͰ಄ଧͪ͢Δ • CPU
• σΟεΫIO • ωοτϫʔΫ௨৴ • ࢄॲཧࢄঢ়گͷཧΫϥελͷߏཧ͕ඞཁ 28
Apache Hadoop ฒྻॲཧɾࢄॲཧͷ໘ͳͱ͜Ζͷ໘ΛΈͯ͘ΕΔ • ຊདྷ࡞Γ͍ͨॲཧʹྗͰ͖Δ • ΤίγεςϜ͕༏ल • ؆୯ͳՃɾੳͷ߹ΞϓϦΛ࡞Βͳ͍ͰରԠͰ͖Δ •
HiveɾPrestoΛ׆༻ͯ͠ΫΤϦΛॻ͚ͩ͘ • ෳࡶԽ͢Δ߹ΞϓϦʹΓସ͑Δ͜ͱͰ͖Δ 29
30
31
32
33
34
35
36
EMR (Amazon Elas.c MapReduce) AWS͕༻ҙ͢ΔϏοάσʔλ༻ΫϥελϓϥοτϑΥʔϜ • HadoopSparkΛ͏ͨΊͷڥΛखܰʹߏஙͰ͖Δ • όʔδϣϯͷΓସ͕͑؆୯ •
AMIͷࢦఆΛม͑Δ͚ͩ • ઃఆΛม͑Δ͚ͩͰΫϥελߏΛมߋͰ͖Δ • - ༻్ʹԠͨ͡ΠϯελϯεɾΠϯελϯεछผ • AWSͷαʔϏεͱ؆୯ʹ࿈ܞͰ͖Δ • S3DynamoDBͷσʔλΛಡΈॻ͖Ͱ͖Δ 37
38
39
40
41
42
ଞʹϏοάσʔλؔ࿈ͷαʔϏε͕༻ҙ͞ Ε͍ͯΔ • Redshi(: σʔλΣΞϋε • Athena: S3ͷσʔλΛΞυϗοΫʹੳͰ͖Δ • Glue:
ϚωʔδυͳETLαʔϏε 43
࣮ࡍʹ։ൃɾӡ༻ͯ͠ײͨ͜͡ͱ • σʔλੳΛ҆ఆՔಇͤ͞Δͷ͍͠ • ΠϨΪϡϥʔͳσʔλ͕ૹΒΕ͖ͯͨ • ֎෦αʔϏεͷোͰ࣮ߦͰ͖ͳ͔ͬͨ • ঢ়گ͕มԽ͢Δͱσʔλੳͷཁ݅มΘͬͯ͘Δ •
Կ͔͕ىͬͨ͜߹ʹΓ͍͢͠Α͏ʹ͓ͯ͘͠ͱ҆৺ • HiveɾPrestoศར͗͢ 44
͓͠·͍ 45
ࢀߟจݙ • Apache Hadoop - Wikipedia • h0ps:/ /ja.wikipedia.org/wiki/Apache_Hadoop •
Amazon EMR • h0ps:/ /aws.amazon.com/jp/emr/ 46