Upgrade to Pro
— share decks privately, control downloads, hide ads and more …
Speaker Deck
Features
Speaker Deck
PRO
Sign in
Sign up for free
Search
Search
mobilemethod-2-about-analytic-data.pdf
Search
Yoshihito
September 14, 2018
0
1k
mobilemethod-2-about-analytic-data.pdf
モバイルメソッド大阪 第2回
モバイルアプリの裏側 どうやって分析用のデータを集めているか のスライドです
Yoshihito
September 14, 2018
Tweet
Share
More Decks by Yoshihito
See All by Yoshihito
TUI App in Rust
yoshihitoh
0
190
Custom Runtime Lambda empowered by Rust
yoshihitoh
0
2.9k
Rust tutorial - implement a cli tool.
yoshihitoh
0
230
introduce-rust.pdf
yoshihitoh
2
510
regrowth2018-introduce-reinvent-sessions
yoshihitoh
0
910
cpp-library-on-browse-nodejs
yoshihitoh
0
2.5k
Featured
See All Featured
Building Your Own Lightsaber
phodgson
103
6.1k
Designing on Purpose - Digital PM Summit 2013
jponch
115
7k
[RailsConf 2023] Rails as a piece of cake
palkan
52
4.9k
Embracing the Ebb and Flow
colly
84
4.5k
実際に使うSQLの書き方 徹底解説 / pgcon21j-tutorial
soudai
169
50k
We Have a Design System, Now What?
morganepeng
50
7.2k
Responsive Adventures: Dirty Tricks From The Dark Corners of Front-End
smashingmag
250
21k
Fireside Chat
paigeccino
34
3k
Being A Developer After 40
akosma
87
590k
Building Adaptive Systems
keathley
38
2.3k
JavaScript: Past, Present, and Future - NDC Porto 2020
reverentgeek
47
5k
Building a Scalable Design System with Sketch
lauravandoore
459
33k
Transcript
ϞόΠϧΞϓϦͷཪଆͰ Ͳ͏ͬͯੳ༻ͷσʔλΛूΊ͍ͯΔ͔ ɹɹɹɹɹɹɹɹɹɹɹɹɹɹɹɹɹɹΫϥεϝιουגࣜձࣾ ɹɹɹɹɹɹɹɹɹɹɹɹɹɹɹɹɹɹɹɹɹɹɹɹ ɹɹฏ 1
About me • ԬΦϑΟεॴଐ • ϞόΠϧΞϓϦαʔϏε෦ • αʔόʔαΠυΤϯδχΞ • ೖࣾͯ͠1ͪΐͬͱ
• ͬͯΔ͜ͱ • AWSͷΠϯϑϥߏங • ϞόΠϧΞϓϦؔ࿈ͷαʔό։ൃ • ੳ༻σʔλͷऩू/Ճ (ETL) 2
͢͜ͱ • όοΫΤϯυΑΓͷ • ϞόΠϧγεςϜͰѻ͏σʔλͷछྨ • ूΊͨσʔλͷྲྀΕ • ͬͯΔٕज़֓ཁ •
࣮ࡍʹ։ൃɾӡ༻ͯ͠ײͨ͜͡ͱ 3
͞ͳ͍͜ͱ • όοΫΤϯυͷΞʔΩςΫνϟͷৄࡉ • ΞϓϦͷτϥοΩϯάख๏ • σʔλੳͷख๏ 4
σʔλͷछྨ ϞόΠϧΞϓϦͷߏཁૉ • ΞϓϦ • αʔό • ϓϥοτϑΥʔϜ ͦΕͧΕ৭ΜͳσʔλΛ࣋ͬͯΔ 5
ϞόΠϧγεςϜͷΞʔΩςΫνϟྫ 6
αʔόͷσʔλ • Ϣʔβݻ༗ͷσʔλ • ӡ༻ίϯςϯπ • ϩά • ΠϯϑϥͷϝτϦΫε 7
ΞϓϦͷσʔλ • ΞϓϦͰͷߦಈཤྺ • ΞϓϦΛىಈͨ͠ • ϘλϯΛԡͨ͠ • λϒΛΓସ͑ͨ •
ىಈ࣌ؒཹ࣌ؒͳͲͷ౷ܭใ • ΫϥογϡϨϙʔτ 8
ूΊͨσʔλͷྲྀΕ 9
10
11
12
13
14
15
16
17
ͬͯΔٕज़ཁૉ • luigi: όονδϣϒͷϫʔΫϑϩʔཧ • Apache Hadoop: Ϗοάσʔλͷࢄॲཧج൫ • EMR:
AWSͷϏοάσʔλ༻ΫϥελϓϥοτϑΥʔϜ 18
luigi • h#ps:/ /github.com/spo1fy/luigi • PythonͷϫʔΫϑϩʔཧγεςϜ • ґଘؔʹैͬͯλεΫΛ࣮ߦ͢Δ • λεΫͷঢ়ଶΛཧͯ͠͏·͍͜ͱ࣮ߦ੍ޚ͢Δ
• ະྃͷͷ͚࣮ͩߦ • ྃͯ͠ΔͷεΩοϓ 19
20
21
22
23
24
25
26
27
Apache Hadoop େنσʔλΛฒྻࢄॲཧ͢ΔͨΊͷϑϨʔϜϫʔΫ େنσʔλͷՃɾੳγεςϜΛࣗ࡞͢Δͷେม • Ͳ͏ͬͯେྔͷσʔλΛޮΑ͘ࡹ͔͘ • 1Ͱฒྻॲཧͯ͠Ϛγϯੑೳͷ্ݶͰ಄ଧͪ͢Δ • CPU
• σΟεΫIO • ωοτϫʔΫ௨৴ • ࢄॲཧࢄঢ়گͷཧΫϥελͷߏཧ͕ඞཁ 28
Apache Hadoop ฒྻॲཧɾࢄॲཧͷ໘ͳͱ͜Ζͷ໘ΛΈͯ͘ΕΔ • ຊདྷ࡞Γ͍ͨॲཧʹྗͰ͖Δ • ΤίγεςϜ͕༏ल • ؆୯ͳՃɾੳͷ߹ΞϓϦΛ࡞Βͳ͍ͰରԠͰ͖Δ •
HiveɾPrestoΛ׆༻ͯ͠ΫΤϦΛॻ͚ͩ͘ • ෳࡶԽ͢Δ߹ΞϓϦʹΓସ͑Δ͜ͱͰ͖Δ 29
30
31
32
33
34
35
36
EMR (Amazon Elas.c MapReduce) AWS͕༻ҙ͢ΔϏοάσʔλ༻ΫϥελϓϥοτϑΥʔϜ • HadoopSparkΛ͏ͨΊͷڥΛखܰʹߏஙͰ͖Δ • όʔδϣϯͷΓସ͕͑؆୯ •
AMIͷࢦఆΛม͑Δ͚ͩ • ઃఆΛม͑Δ͚ͩͰΫϥελߏΛมߋͰ͖Δ • - ༻్ʹԠͨ͡ΠϯελϯεɾΠϯελϯεछผ • AWSͷαʔϏεͱ؆୯ʹ࿈ܞͰ͖Δ • S3DynamoDBͷσʔλΛಡΈॻ͖Ͱ͖Δ 37
38
39
40
41
42
ଞʹϏοάσʔλؔ࿈ͷαʔϏε͕༻ҙ͞ Ε͍ͯΔ • Redshi(: σʔλΣΞϋε • Athena: S3ͷσʔλΛΞυϗοΫʹੳͰ͖Δ • Glue:
ϚωʔδυͳETLαʔϏε 43
࣮ࡍʹ։ൃɾӡ༻ͯ͠ײͨ͜͡ͱ • σʔλੳΛ҆ఆՔಇͤ͞Δͷ͍͠ • ΠϨΪϡϥʔͳσʔλ͕ૹΒΕ͖ͯͨ • ֎෦αʔϏεͷোͰ࣮ߦͰ͖ͳ͔ͬͨ • ঢ়گ͕มԽ͢Δͱσʔλੳͷཁ݅มΘͬͯ͘Δ •
Կ͔͕ىͬͨ͜߹ʹΓ͍͢͠Α͏ʹ͓ͯ͘͠ͱ҆৺ • HiveɾPrestoศར͗͢ 44
͓͠·͍ 45
ࢀߟจݙ • Apache Hadoop - Wikipedia • h0ps:/ /ja.wikipedia.org/wiki/Apache_Hadoop •
Amazon EMR • h0ps:/ /aws.amazon.com/jp/emr/ 46