Upgrade to Pro
— share decks privately, control downloads, hide ads and more …
Speaker Deck
Features
Speaker Deck
PRO
Sign in
Sign up for free
Search
Search
Elasticsearch Aggregations
Search
Javier Rey
August 19, 2015
Technology
0
400
Elasticsearch Aggregations
Meetup Big Data -
http://www.meetup.com/Montevideo-BigData-DataScience-Meetup/events/223833509/
Javier Rey
August 19, 2015
Tweet
Share
More Decks by Javier Rey
See All by Javier Rey
Elasticsearch en producción
vierja
0
300
Tryomeetup - Elasticsearch
vierja
0
39
Tryomeetup - Elasticsearch - English
vierja
0
45
Other Decks in Technology
See All in Technology
インサイト情報からどこまで自動化できるか試してみた
takas0522
0
130
「Verify with Wallet API」を アプリに導入するために
hinakko
1
200
Geospatialの世界最前線を探る [2025年版]
dayjournal
3
450
DataOpsNight#8_Terragruntを用いたスケーラブルなSnowflakeインフラ管理
roki18d
1
310
生成AIを活用したZennの取り組み事例
ryosukeigarashi
0
180
pprof vs runtime/trace (FlightRecorder)
task4233
0
140
Optuna DashboardにおけるPLaMo2連携機能の紹介 / PFN LLM セミナー
pfn
PRO
1
810
Function calling機能をPLaMo2に実装するには / PFN LLMセミナー
pfn
PRO
0
810
OCI Network Firewall 概要
oracle4engineer
PRO
1
7.7k
GC25 Recap+: Advancing Go Garbage Collection with Green Tea
logica0419
1
320
自作LLM Native GORM Pluginで実現する AI Agentバックテスト基盤構築
po3rin
2
220
それでも私はContextに値を詰めたい | Go Conference 2025 / go conference 2025 fill context
budougumi0617
4
1.1k
Featured
See All Featured
Fireside Chat
paigeccino
40
3.7k
Cheating the UX When There Is Nothing More to Optimize - PixelPioneers
stephaniewalter
285
14k
Refactoring Trust on Your Teams (GOTO; Chicago 2020)
rmw
35
3.2k
Practical Orchestrator
shlominoach
190
11k
How to train your dragon (web standard)
notwaldorf
96
6.3k
Reflections from 52 weeks, 52 projects
jeffersonlam
352
21k
Building Adaptive Systems
keathley
43
2.8k
4 Signs Your Business is Dying
shpigford
185
22k
Speed Design
sergeychernyshev
32
1.1k
Keith and Marios Guide to Fast Websites
keithpitt
411
22k
Responsive Adventures: Dirty Tricks From The Dark Corners of Front-End
smashingmag
252
21k
Designing for Performance
lara
610
69k
Transcript
Elasticsearch Aggregations Javier Rey @vierja
Aggregations Información sumarizada a partir de varios documentos o tuplas
para obtener alguna métrica o información del conjunto de datos. SELECT sum(x), count(1) GROUP BY y;
Aggregations en Big Data ◎ Grandes volúmenes de datos se
hace difícil revisar/monitorear la información. ◎ A veces queremos extraer información compleja sin tener que programar (app o jobs). ◎ Métricas en tiempo real.
Buckets terms histogram geo grids ... Aggregations en Elasticsearch Metrics
min - max - avg stats cardinality ... SELECT sum(x), count(1) GROUP BY y; BUCKET METRICS
Nested aggregations Root agg Inner agg ▸ bucket Leaf agg
▸ bucket or metric
Por qué usarlas en Elasticsearch? ◎ Combinables con búsquedas ◦
Ver agregaciones de resultados de búsquedas ◎ Fáciles de usar ◦ JSON based DSL ◎ Rápidas ◦ Estructuras de datos optimizadas ◦ Single pass de datos
Funcionamiento en clusters ◎ Agregación a nivel de shard (distribuido)
◎ Merge de las distintas sub agregaciones ◎ Algoritmos de aproximación: ◦ Cardinalidad ◉ HyperLogLog ◦ Percentiles ◉ TDigest
Aggregations interesantes ◎ Significant Terms aggregations ◦ The uncommonly common
◎ Geohash grid ◎ Terms con scripts
Ejemplos Multas de tránsito de Montevideo catalogodatos.gub.uy github.com/vierja/geojson_montevideo Queries: gist.github.com/vierja/aad3cd56fb11af129fae
SFPD Crime Incident Reporting elasticsearch.demos.tryolabs.com
Gracias! Preguntas?