Time Series Data Models and the future of InfluxDB's Query Language

Time Series Data Models & the Query Languages That Love
Them Paul Dix paul@inﬂuxdb.com @pauldix

Future of InﬂuxDB’s Data Model & Query Language

CTO & co-founder makers of

Founder of NYC Machine Learning Meetup

Editor Addison Wesley’s Data & Analytics

Recovering Rubyist

Data Models

Graphite apps.backend.server_01.counters.requests.count

Graphite apps.backend.server_01.counters.requests.count Hierarchy

Graphite apps.backend.server_01.counters.requests.count Value - double precision ﬂoat Time - second
precision epoch

Graphite apps.backend.server_01.counters.requests.count Value - double precision ﬂoat Time - second
precision epoch regular series only!

Regular time series t0 t1 t2 t3 t4 t6 t7
Samples at regular intervals

Irregular time series t0 t1 t2 t3 t4 t6 t7
Events whenever they come in

OpenTSDB sys.cpu.user host=webserver01,cpu=3 1356998400 1.2

OpenTSDB sys.cpu.user host=webserver01,cpu=3 1356998400 1.2 Metric

OpenTSDB sys.cpu.user host=webserver01,cpu=3 1356998400 1.2 Tags (string key/value pairs)

OpenTSDB sys.cpu.user host=webserver01,cpu=3 1356998400 1.2 millisecond precision epoch

OpenTSDB sys.cpu.user host=webserver01,cpu=3 1356998400 1.2 Value int64 or ﬂoat64 (2.4)

Prometheus http_requests_total{method="post",code="200"} 1027 1395066363000

Prometheus http_requests_total{method="post",code="200"} 1027 1395066363000 Metric

Prometheus http_requests_total{method="post",code="200"} 1027 1395066363000 Labels (string key/value pairs)

Prometheus http_requests_total{method="post",code="200"} 1027 1395066363000 Value - ﬂoat64

Prometheus http_requests_total{method="post",code="200"} 1027 1395066363000 millisecond precision epoch

InﬂuxDB 1.x cpu,host=serverA,region=west user=23.2,system=54.1 1465839830100400200

InﬂuxDB 1.x cpu,host=serverA,region=west user=23.2,system=54.1 1465839830100400200 Measurement

InﬂuxDB 1.x cpu,host=serverA,region=west user=23.2,system=54.1 1465839830100400200 Tags (string key/value pairs)

InﬂuxDB 1.x cpu,host=serverA,region=west user=23.2,system=54.1 1465839830100400200 Fields (key/value pairs)

InﬂuxDB 1.x cpu,host=serverA,region=west user=23.2,system=54.1 1465839830100400200 ﬂoat64 value

InﬂuxDB 1.x cpu,host=serverA,region=west foo=23i 1465839830100400200 int64 value

InﬂuxDB 1.x cpu,host=serverA,region=west bar=t 1465839830100400200 bool value

InﬂuxDB 1.x cpu,host=serverA,region=west line=“some text here” 1465839830100400200 string value

InﬂuxDB 1.x cpu,host=serverA,region=west user=23.2,system=54.1 1465839830100400200 nanosecond precision epoch

Differences organization data types precision Graphite hierarchical float64 seconds OpenTSDB
metric, tags float64 milliseconds Prometheus metric, tags float64 milliseconds InfluxDB 1.x metric, tags, fields float64, int64, bool, string nanoseconds

Querying

Data Exploration what series do I have

Retrieval & Computation raw data, transforms, materialized series, aggregates, samples

Organization matters with thousands of series or more

Hierarchy Tree!

Lookup metrics/measurements OpenTSDB /api/search/lookup?query= (all series) Prometheus {__name__=~“.+”} (all series?)
InﬂuxDB SHOW MEASUREMENTS

Lookup tag/label keys OpenTSDB /api/search/lookup?query= (all series) Prometheus {__name__=~“.+”} (all
series?) InﬂuxDB SHOW TAG KEYS

Lookup tag/label values OpenTSDB /api/search/lookup?query={host=*} (all series with host?) Prometheus
{host=~“.+”} (all series with host?) InﬂuxDB SHOW TAG VALUES WITH KEY = “host” SHOW TAG VAVUES FROM “cpu” with KEY = “host”

Drill Down OpenTSDB /api/search/lookup?query={host=*,service=mysql} (all metrics on the hosts) Prometheus
{host=~“.+”,service=mysql} (all metrics on the hosts) InﬂuxDB SHOW TAG VALUES WITH KEY = “host” WHERE “service” = ‘mysql’

Why to care about drill down (faceted search)

Why to care about drill down (faceted search) select nonstop,
LGA, JFK

Facets __name__ host service region group …

Facets __name__ host service region group … go_goroutines go_memstats_alloc_bytes go_memstats_alloc_bytes_total
go_memstats_gc_sys_bytes go_memstats_other_sys_bytes …

go_memstats_gc_sys_bytes go_memstats_other_sys_bytes … host service region group …

go_memstats_gc_sys_bytes go_memstats_other_sys_bytes … host service region group … dynamic hierarchy! (name already selected)

Labels/Tags > Hierarchy

Up Front Design

Powerful Discovery

Slicing, dicing, grouping

Query Languages Query Language Example Graphite functional target, from, until
sumSeries(summarize(water.level.h2o.feet.*, '1hour', 'max')) OpenTSDB http params startTime, endTime, metric, aggregationFunction, ﬁlter, functions, expressions Prometheus functional-ish increase(http_requests_total{job=“prometheus”}[5m]) InﬂuxDB 1.x SQL-ish select mean(system) from cpu where time > now() - 6h group by time(10m)

Functional > SQL or API

Time series are streams

Apply Functions!

Selection what series (streams) are we working with?

Timing what time range are we interested in?

Merging multiple streams into 1

Joining /, *, +, -, &, |, ^, ﬁlter

Partitioning do we slice the stream into blocks of time?

Sampling ﬁrst, last, min, max, ﬁlters

Transforming time shift, derivative, rate, interpolate

Summarizing count, percentile, mean, median, mode, histogram

Future InﬂuxDB!

Subject to Change! *disclaimer

Requirements • Support InﬂuxDB 1.x Data Model • Support InﬂuxDB
1.x QL • Support Prometheus Data Model • Functional Query Language • Rich Query Builder UI • Query Completion CLI • PromQL?

InﬂuxDB 2.0 Data Model • Tags • non-string values? •
Value • int64 • uint64 • ﬂoat64 • bool • string • bytes • Timestamp (nanosecond)

No More Measurement!

No more ﬁelds?! yep, but remember joining and merging!

SIMPLE

InﬂuxDB 2.0 Line Protocol system:cpu,region:cpu,host:a,metric:user_idle 23.2 1491675816

InﬂuxDB 2.0 Line Protocol system:cpu,region:cpu,host:a,metric:user_idle 23.2 1491675816 Tags

InﬂuxDB 2.0 Line Protocol system:cpu,region:cpu,host:a,metric:user_idle 23.2 1491675816 Key

InﬂuxDB 2.0 Line Protocol system:cpu,region:cpu,host:a,metric:user_idle 23.2 1491675816 Value

InﬂuxDB 2.0 Line Protocol system:cpu,region:cpu,host:a,metric:user_idle 23.2 1491675816 Separators

InﬂuxDB 2.0 Line Protocol system:cpu,region:cpu,host:a,metric:user_idle 23.2 1491675816 spaces, /, and
: must be escaped

InﬂuxDB 2.0 Line Protocol system:cpu,region:cpu,host:a,metric:user_idle 23.2 1491675816 ﬂoat64 value

InﬂuxDB 2.0 Line Protocol system:cpu,region:cpu,host:a,metric:user_idle 23 1491675816 ﬂoat64 value

InﬂuxDB 2.0 Line Protocol system:cpu,region:cpu,host:a,metric:user_idle 23 1491675816 time (precision assumed
closest to now)

InﬂuxDB 2.0 Line Protocol system:cpu,region:cpu,host:a,metric:user_idle 23 1491675816000000us time (precision speciﬁed)

InﬂuxDB 2.0 Line Protocol system:cpu,region:cpu,host:a,metric:user_idle 23.2 2017-04-08T14:23:54Z time (RFC3339Nano)

InﬂuxDB 2.0 Line Protocol name:foo 2i 1491675816 int64 value

InﬂuxDB 2.0 Line Protocol name:foo 2u 1491675816 uint64 value

InﬂuxDB 2.0 Line Protocol name:foo [234, 21, 9, 23, 87,
90, 11, 54] 1491675816 bytes value

InﬂuxDB 2.0 Line Protocol name:foo “it’s a string, yo!” 2017-04-18T14:58:00Z
string value

InﬂuxDB 2.0 Line Protocol name:foo f 2017-04-18T14:58:00Z bool value

InﬂuxQL 2.0 functional!

f1(f2(f3(f4(streams)))) Lisp?

Paul Graham, Rich Hickey

D3 d3.select("body") .selectAll("p") .data([4, 8, 15, 16, 23, 42]) .enter().append("p")
.text(function(d) { return "I’m number " + d + "!"; });

Function Chaining!

Series { "id": 24, "meta": { "dataType": "float64", "metricType": "gauge"
}, "tagset": { "host": "A", "region": "B" }, "vector": [ {"value":23.2, "epoch":1491499253}, {"value":78.1, "epoch":1491499263, "tagset":{"host":"B"}} ] }

Matrix [ { "tagset": { "host": "A", }, "vector": [{"value":23.1,
"epoch":1491499253}, {"value":56.2, "epoch":1491499263}] }, { "tagset": { "host": "B" }, "vector": [{"value":23.1, "epoch":1491499253}, {"value":56.2, "epoch":1491499263}] } ]

Example database(name:"testdb") .select(criteria:`"host" = 'A' and "system" = 'cpu'`) .range(startOffset:"-1h")

Named Parameters database(name:"testdb") .select(criteria:`"host" = 'A' and "system" = 'cpu'`)
.range(startOffset:"-1h") Named parameters!

Example Queries database(name:"testdb") .select(criteria:`"host" = 'A' and "system" = 'cpu'`)
.range(startOffset:"-1h") Wrap strings in back ticks to avoid pesky escaping

Database returns Matrix database(name:"testdb") [ { "tagset": { "host": "A",
}, "vector": [ {"value":23.1, “epoch":1491499253}, {"value":56.2, "epoch":1491499263}] }, { "tagset": { "host": "B" }, "vector": [ {"value":23.1, “epoch":1491499253}, {"value":56.2, "epoch":1491499263}] } ]

Select ﬁlters vectors database(name:"testdb") .select(criteria:`"host" = 'A' and "system" =
'cpu'`)

'cpu'`) Tag keys

'cpu'`) Tag values

Complex Criteria database(name:"testdb") .select(criteria:`”t1” = ‘foo’ AND (“t2” = ‘bar’
OR “t3” = ‘asdf’)`)

Criteria Operators • = • != • =~ • !~
• < • > • startsWith • in • notIn

What hosts do we have? database(name:"testdb") .values(key:"host") .sort() .limit(n:20)

How many hosts? database(name:"testdb") .values(key:”host") .count()

// get the cpu load of hosts that have mysql
running var db = database(name:"testdb") db.select( criteria:`"system" = 'cpu' and "metric" = 'load' and "host" in #{ db.select(`"service" = 'mysql'`).values(key:"host") }`) .range(startOffset:"-4h")

running var db = database(name:"testdb") db.select( criteria:`"system" = 'cpu' and "metric" = 'load' and "host" in #{ db.select(`"service" = 'mysql'`).values(key:"host") }`) .range(startOffset:"-4h") Variables

running var db = database(name:"testdb") db.select( criteria:`"system" = 'cpu' and "metric" = 'load' and "host" in #{ db.select(`"service" = 'mysql'`).values(key:"host") }`) .range(startOffset:"-4h") string interpolation

// get the count in 10m periods in the last
24h from an event stream // and ﬁlter that to only include those periods that were 2 sigma above the average var m = database(name:”testdb”).select(criteria:"\"event\" = 'pageview'") .range(startOffset:"-24h") .merge() .window(func:count(),duration:"10m")

24h from an event stream // and ﬁlter that to only include those periods that were 2 sigma above the average var m = database(name:”testdb”).select(criteria:”\”event\” = 'pageview'") .range(startOffset:"-24h") .merge() .window(func:count(),duration:"10m") // this is shorthand for m.stddev.join(op:"*", right:2) var sigma = m.stddev() * 2

24h from an event stream // and ﬁlter that to only include those periods that were 2 sigma above the average var m = database(name:”testdb”).select(criteria:”\”event\” = 'pageview'") .range(startOffset:"-24h") .merge() .window(func:count(),duration:"10m") // this is shorthand for m.stddev.join(op:"*", right:2) var sigma = m.stddev() * 2 // return only the counts 1 sigma above m.ﬁlter(exp:"$ > #{sigma}")

// return the last hour of time series of the
top 10 host cpu utilizations by // their average load over last 10 minutes var topTen = db.select(criteria:”\”metric\” = 'load' and system = 'cpu'") .range(startOffset:"-10m") .mean() .sort(func:ﬁrst()) .slice(end:10) .values(key:"host") db.select(criteria:”\"metric\" = 'load' and system = 'cpu' and host in #{topTen}") .range(startOffset:"-1h")

Functions • interpolate • join • merge • timeShift •
window • rate • ﬁrst, last, min, max, mean, percentile, etc.

Public docs PR in two weeks! please to give feedback
:)

Thank you. Paul Dix paul@inﬂuxdb.com @pauldix

Time Series Data Models and the future of Influ...

Time Series Data Models and the future of InfluxDB's Query Language

More Decks by Paul Dix

Other Decks in Technology

Featured

Transcript