Upgrade to Pro
— share decks privately, control downloads, hide ads and more …
Speaker Deck
Features
Speaker Deck
PRO
Sign in
Sign up for free
Search
Search
Ido Barkan
Search
AppsFlyer
July 27, 2016
Technology
1
140
Ido Barkan
Using Druid Analyzing web access logs for 8 billion events per day
AppsFlyer
July 27, 2016
Tweet
Share
More Decks by AppsFlyer
See All by AppsFlyer
Processing 15 Billion events a day without breaking the bank - ReversimX ILTechTalks
appsflyer
0
480
Journey to the Real-Time Analytics in Extreme Growth
appsflyer
0
290
10 Real problems & solutions in your build and deploy process
appsflyer
0
130
DevOps paradigm in R&D day-to-day
appsflyer
0
130
Building a Mobile Backend to Evolve
appsflyer
0
87
Sometimes, Druid is not the best solution for a business use case
appsflyer
1
420
Processing 8 Billion Daily Events in Real Time!
appsflyer
1
110
React Performance
appsflyer
1
200
Real-time analytics with Druid at Appsflyer
appsflyer
0
350
Other Decks in Technology
See All in Technology
きのこカンファレンス_ランチスポンサーセッション
kabaya
1
390
LangGraph × Bedrock による複数の Agentic Workflow を利用した Supervisor 型のマルチエージェントの実現/langgraph-bedrock-supervisor-agent
ren8k
4
500
書籍『入門 OpenTelemetry』 / Intro of OpenTelemetry book
ymotongpoo
10
660
Microsoft_20250311_できるつくれるAIAgent.pdf
iotcomjpadmin
0
110
VPoEの引き継ぎでやったこと、わかったこと
saitoryc
1
830
株式会社Awarefy(アウェアファイ)会社説明資料 / Awarefy-Company-Deck
awarefy
3
12k
20250304_赤煉瓦倉庫_DeepSeek_Deep_Dive
hiouchiy
2
160
Microsoft_20250311_AzureIoTPortfolio_PDF.pdf
iotcomjpadmin
0
110
Autonomous Database サービス・アップデート (FY25)
oracle4engineer
PRO
1
510
生成AIがローコードツールになる時代の エンジニアの役割を考える
khwada
0
400
エンジニア採用と 技術広報の実践/acaricsummit2025
nishiuma
1
160
Global Databaseで実現するマルチリージョン自動切替とBlue/Greenデプロイ
j2yano
0
230
Featured
See All Featured
Improving Core Web Vitals using Speculation Rules API
sergeychernyshev
11
560
Save Time (by Creating Custom Rails Generators)
garrettdimon
PRO
29
1.1k
Optimising Largest Contentful Paint
csswizardry
34
3.1k
The MySQL Ecosystem @ GitHub 2015
samlambert
251
12k
The Language of Interfaces
destraynor
156
24k
個人開発の失敗を避けるイケてる考え方 / tips for indie hackers
panda_program
101
18k
Building Applications with DynamoDB
mza
93
6.3k
Scaling GitHub
holman
459
140k
[RailsConf 2023] Rails as a piece of cake
palkan
53
5.3k
Done Done
chrislema
182
16k
The Web Performance Landscape in 2024 [PerfNow 2024]
tammyeverts
4
460
Building Flexible Design Systems
yeseniaperezcruz
328
38k
Transcript
Ido Barkan Analyzing web access logs for 8 billion events
per day
5xx Errors
Appsflyer gets around 8B web events per day.
Micro Services Architecture Real Time Attr.
AWS Elastic load balancer Log entry
A Log line 2016-02-06T12:51:54.201846Z Appsflyer-web 139.162.156.169:50435 10.10.8.90:6555 0.000021 0.001916 0.00001
200 200 780 2 "POST https://track. appsflyer.com:443/... HTTP/1.1" "Dalvik/1.6.0 (Linux; U; Android 4.4.4; SM-J110H Build/KTU84P)" ECDHE-RSA-AES128-SHA TLSv1 $ head -1 195229424603_elasticloadbalancing_eu-west-1_appsflyer-web_.log | wc -c 331 Total: 300-1500 bytes =>sub sampling of 1/10 => 223 GB daily approx.
What was missing? No transparency of incoming web requests. ?
# error (400 / 500) responses grouped by app ? # of events grouped by app ? # of events grouped by response code
What wasn’t missing? ! No single event granularity- only analytics
! No fancy enterprise features (role-based access, alerts etc.)
Possible solutions 1. Our own ELK- will not hold the
volume 2. SaaS based ELK (logz.io, loggly...)- expensive and gives more than we want.
Data flow Log to bucket Trigger Lambda Druid sink service
Druid configured naively • 3 data nodes (historical+RT) • 1
master (coordinator) • 1 broker • No data duplication • 7d data retention • Only 5 machines
Basic log processing 2016-02-06T12:51:54.201846Z Appsflyer-web 139.162.156.169:50435 10.10.8.90:6555 0.000021 0.001916 0.00001
200 200 780 2 "POST https://track.appsflyer.com:443/... HTTP/1.1" "Dalvik/1.6.0 (Linux; U; Android 4.4.4; SM-J110H Build/KTU84P)" ECDHE-RSA-AES128-SHA TLSv1
Demo! • druidquery • caravel
Thank you
[email protected]
Questions?
Thank you
[email protected]
We are hiring!