Upgrade to Pro — share decks privately, control downloads, hide ads and more …

Large-scale OLAP with Kobayashi

Large-scale OLAP with Kobayashi

A talk about a large-scale time-series OLAP engine I built on top of Riak called Kobayashi. Watch the talk here https://vimeo.com/42902962

Dietrich Featherston

May 18, 2012
Tweet

More Decks by Dietrich Featherston

Other Decks in Technology

Transcript

  1. Large-scale OLAP with Kobayashi Boundary Tech Talks Fri, May 18,

    2012 Dietrich Featherston, Boundary @d2fn Friday, October 12, 12
  2. dimensions (11) epoch seconds epoch minutes epoch hours meter id

    source ip source port dest ip dest port interface country network measurements (4) egress packets egress octets ingress packets ingress octets Friday, October 12, 12
  3. Total Volume. by Host Port/Protocol Country Network + meter For

    each aggregation period Friday, October 12, 12
  4. { { 100 meters 10 seconds Riak Key Layout <

    80KB Friday, October 12, 12
  5. Find 45 minutes of total traffic seen on meters 1,

    2, 226, & 301 starting 18 hours ago broken down by traffic type Friday, October 12, 12
  6. < 80KB { { 100 meters 10 seconds Atomic Unit

    of Storage Friday, October 12, 12
  7. 0 (0,99) 100 (100,199) 200 (200,299) 300 (300,399) 400 (400,499)

    t0 t1 t2 t3 t4 t5 t6 t7 t8 t9 t10 t11 t12 t13 t14 t15 t18 t19 Time Meter Id Step 1: fetch appropriate blocks (riak) 45 min 1 2 226 301 Friday, October 12, 12
  8. 0 (0,99) 100 (100,199) 200 (200,299) 300 (300,399) 400 (400,499)

    t0 t1 t2 t3 t4 t5 t6 t7 t8 t9 t10 t11 t12 t13 t14 t15 t18 t19 Time Meter Id Step 2: filter 45 min 1 2 226 301 Friday, October 12, 12
  9. topk( , 10) Step 3: aggregate and perform top-k 1

    2 226 301 45 min + { epochMillis: 1337230140000 portProtocol: "4740:6" ingressPackets: 370482 ingressOctets: 3113782199 egressPackets: 343780 egressOctets: 37126033 }, { epochMillis: 1337230140000 portProtocol: "9092:6" ingressPackets: 440915 ingressOctets: 1816615857 egressPackets: 481237 egressOctets: 1312198133 }, ... Friday, October 12, 12
  10. Open source the data cubing and predicate matching code Query

    grammar for kobayashi Friday, October 12, 12