Upgrade to Pro
— share decks privately, control downloads, hide ads and more …
Speaker Deck
Features
Speaker Deck
PRO
Sign in
Sign up for free
Search
Search
Elasticsearch Aggregations
Search
Javier Rey
August 19, 2015
Technology
0
390
Elasticsearch Aggregations
Meetup Big Data -
http://www.meetup.com/Montevideo-BigData-DataScience-Meetup/events/223833509/
Javier Rey
August 19, 2015
Tweet
Share
More Decks by Javier Rey
See All by Javier Rey
Elasticsearch en producción
vierja
0
290
Tryomeetup - Elasticsearch
vierja
0
39
Tryomeetup - Elasticsearch - English
vierja
0
44
Other Decks in Technology
See All in Technology
Notion x ポストモーテムで広げる組織の学び / Notion x Postmortem
isaoshimizu
1
130
生成AIによるCloud Native基盤構築の可能性と実践的ガードレールの敷設について
nwiizo
7
1.3k
Twelve-Factor-Appから学ぶECS設計プラクティス/ECS practice for Twelve-Factor-App
ozawa
3
140
Winning at PHP in Production in 2025
beberlei
1
230
Porting PicoRuby to Another Microcontroller: ESP32
yuuu
4
500
Oracle Cloud Infrastructure:2025年4月度サービス・アップデート
oracle4engineer
PRO
0
220
クラウド開発環境Cloud Workstationsの紹介
yunosukey
0
210
Mastraに入門してみた ~AWS CDKを添えて~
tsukuboshi
0
360
Microsoft Fabric vs Databricks vs (Snowflake) -若手エンジニアがそれぞれの強みと違いを比較してみた- "A Young Engineer's Comparison of Their Strengths and Differences"
reireireijinjin6
1
100
Стильный код: натуральный поиск редких атрибутов по картинке. Юлия Антохина, Data Scientist, Lamoda Tech
lamodatech
0
830
2025-04-14 Data & Analytics 井戸端会議 Multi tenant log platform with Iceberg
kamijin_fanta
0
130
QA/SDETの現在と、これからの挑戦
imtnd
0
150
Featured
See All Featured
How to Think Like a Performance Engineer
csswizardry
23
1.5k
The Success of Rails: Ensuring Growth for the Next 100 Years
eileencodes
45
7.2k
Typedesign – Prime Four
hannesfritz
41
2.6k
Templates, Plugins, & Blocks: Oh My! Creating the theme that thinks of everything
marktimemedia
30
2.3k
Practical Tips for Bootstrapping Information Extraction Pipelines
honnibal
PRO
19
1.2k
What’s in a name? Adding method to the madness
productmarketing
PRO
22
3.4k
Sharpening the Axe: The Primacy of Toolmaking
bcantrill
41
2.3k
Responsive Adventures: Dirty Tricks From The Dark Corners of Front-End
smashingmag
251
21k
Gamification - CAS2011
davidbonilla
81
5.2k
Being A Developer After 40
akosma
91
590k
実際に使うSQLの書き方 徹底解説 / pgcon21j-tutorial
soudai
178
53k
Performance Is Good for Brains [We Love Speed 2024]
tammyeverts
10
770
Transcript
Elasticsearch Aggregations Javier Rey @vierja
Aggregations Información sumarizada a partir de varios documentos o tuplas
para obtener alguna métrica o información del conjunto de datos. SELECT sum(x), count(1) GROUP BY y;
Aggregations en Big Data ◎ Grandes volúmenes de datos se
hace difícil revisar/monitorear la información. ◎ A veces queremos extraer información compleja sin tener que programar (app o jobs). ◎ Métricas en tiempo real.
Buckets terms histogram geo grids ... Aggregations en Elasticsearch Metrics
min - max - avg stats cardinality ... SELECT sum(x), count(1) GROUP BY y; BUCKET METRICS
Nested aggregations Root agg Inner agg ▸ bucket Leaf agg
▸ bucket or metric
Por qué usarlas en Elasticsearch? ◎ Combinables con búsquedas ◦
Ver agregaciones de resultados de búsquedas ◎ Fáciles de usar ◦ JSON based DSL ◎ Rápidas ◦ Estructuras de datos optimizadas ◦ Single pass de datos
Funcionamiento en clusters ◎ Agregación a nivel de shard (distribuido)
◎ Merge de las distintas sub agregaciones ◎ Algoritmos de aproximación: ◦ Cardinalidad ◉ HyperLogLog ◦ Percentiles ◉ TDigest
Aggregations interesantes ◎ Significant Terms aggregations ◦ The uncommonly common
◎ Geohash grid ◎ Terms con scripts
Ejemplos Multas de tránsito de Montevideo catalogodatos.gub.uy github.com/vierja/geojson_montevideo Queries: gist.github.com/vierja/aad3cd56fb11af129fae
SFPD Crime Incident Reporting elasticsearch.demos.tryolabs.com
Gracias! Preguntas?