IoT Data Processing and Analytics 101

© 2015 Mesosphere, Inc. All Rights Reserved. IOT DATA PROCESSING
& ANALYTICS 101 1 Michael Hausenblas, Developer & Cloud Advocate | 2015-11-03 | EclipseCon

© 2015 Mesosphere, Inc. All Rights Reserved. CITIES 9 ©
2014, Wired magazine

© 2015 Mesosphere, Inc. All Rights Reserved. OVERALL FOCUS 11
Devices IoT Gateways Networks Backend Systems iot.eclipse.org

© 2015 Mesosphere, Inc. All Rights Reserved. LET'S TALK ABOUT
WORKLOADS* … 13 *) kudos to Timothy St. Clair, @timothysc batch streaming PaaS MapReduce

© 2015 Mesosphere, Inc. All Rights Reserved. • Kafka •
ØMQ, RabbitMQ, Disque (Redis-based), etc. • ﬂuentd, Logstash, Flume, etc. • Akka streams • cloud-only: AWS SQS, Google Cloud Pub/Sub • see also queues.io MESSAGE QUEUES & ROUTERS 14

© 2015 Mesosphere, Inc. All Rights Reserved. APACHE KAFKA 15
• High-throughput, distributed, persistent publish-subscribe messaging system • Originates from LinkedIn • Typically used as buffer/de-coupling layer in online stream processing Message queues & routers kafka.apache.org

© 2015 Mesosphere, Inc. All Rights Reserved. FLUENTD 16 Message
queues & routers www.fluentd.org

© 2015 Mesosphere, Inc. All Rights Reserved. STREAM PROCESSING PLATFORMS
17 • Storm • Spark • Samza • Flink • Concord • cloud-only: AWS Kinesis, Google Cloud Dataﬂow • see also my webinar on stream processing

© 2015 Mesosphere, Inc. All Rights Reserved. APACHE STORM 18
• Distributed, fault-tolerant stream- processing platform • Guaranteed message processing (replaying messages on failure) • Concepts: tuples, streams, spouts, bolts, topologies Stream processing platforms storm.apache.org

© 2015 Mesosphere, Inc. All Rights Reserved. APACHE SPARK 19
Stream processing platforms spark.apache.org Spark SQL Spark Streaming MLlib  (machine learning) Spark core (RDD) GraphX  (graph processing) Mesos Filesystem (local, HDFS, S3) or data store (HBase, Cassandra, Elasticsearch, etc.) YARN Standalone

© 2015 Mesosphere, Inc. All Rights Reserved. TIME SERIES DATASTORES
20 • InﬂuxDB • OpenTSDB • KairosDB • Prometheus • see also iot-a.info

© 2015 Mesosphere, Inc. All Rights Reserved. OPENTSDB 21 •
Distributed time series database on top HBase • Store, index, query & plot metrics • Extremely scalable • Low-level monitoring time series datastores opentsdb.net

© 2015 Mesosphere, Inc. All Rights Reserved. INFLUXDB 22 •
No-dependency, time series database written in Go • SQLish query language (incl. regex, fan out) • Single node or Raft-based distributed node mode time series datastores influxdb.com

© 2015 Mesosphere, Inc. All Rights Reserved. DCOS IS A
DISTRIBUTED OPERATING SYSTEM 25 • local OS per node (+container enabled) • scheduling (long-lived, batch) • networking • service discovery • stateful services • security • monitoring, logging, debugging

© 2015 Mesosphere, Inc. All Rights Reserved. BENEFITS 28 DCOS
• Run stateless services such as nginx or Java app server, etc. and Big Data services like Spark, Kafka, Cassandra, etc. together on one cluster • Dynamic partitioning of your cluster, depending on your needs (business requirements) • Increased utilization: ca. 10% → 80%+

IoT Data Processing and Analytics 101

IoT Data Processing and Analytics 101

Michael Hausenblas

More Decks by Michael Hausenblas

Other Decks in Technology

Featured

Transcript

© 2015 Mesosphere, Inc. All Rights Reserved. IOT DATA PROCESSING

© 2015 Mesosphere, Inc. All Rights Reserved. WHY BOTHER? 2

© 2015 Mesosphere, Inc. All Rights Reserved. 3

© 2015 Mesosphere, Inc. All Rights Reserved. AIRLINES 4

© 2015 Mesosphere, Inc. All Rights Reserved. LOGISTICS 5

© 2015 Mesosphere, Inc. All Rights Reserved. HEALTH  CARE 6

© 2015 Mesosphere, Inc. All Rights Reserved. TRADERS 7

© 2015 Mesosphere, Inc. All Rights Reserved. FARMERS 8

© 2015 Mesosphere, Inc. All Rights Reserved. CITIES 9 ©

© 2015 Mesosphere, Inc. All Rights Reserved. YOU 10

© 2015 Mesosphere, Inc. All Rights Reserved. OVERALL FOCUS 11

© 2015 Mesosphere, Inc. All Rights Reserved. THE  TOOLBOX 12

© 2015 Mesosphere, Inc. All Rights Reserved. LET'S TALK ABOUT

© 2015 Mesosphere, Inc. All Rights Reserved. • Kafka •

© 2015 Mesosphere, Inc. All Rights Reserved. APACHE KAFKA 15

© 2015 Mesosphere, Inc. All Rights Reserved. FLUENTD 16 Message

© 2015 Mesosphere, Inc. All Rights Reserved. STREAM PROCESSING PLATFORMS

© 2015 Mesosphere, Inc. All Rights Reserved. APACHE STORM 18

© 2015 Mesosphere, Inc. All Rights Reserved. APACHE SPARK 19

© 2015 Mesosphere, Inc. All Rights Reserved. TIME SERIES DATASTORES

© 2015 Mesosphere, Inc. All Rights Reserved. OPENTSDB 21 •

© 2015 Mesosphere, Inc. All Rights Reserved. INFLUXDB 22 •

© 2015 Mesosphere, Inc. All Rights Reserved. MEET THE DATACENTER

© 2015 Mesosphere, Inc. All Rights Reserved. LOCAL OS VS.

© 2015 Mesosphere, Inc. All Rights Reserved. DCOS IS A

© 2015 Mesosphere, Inc. All Rights Reserved. 26

© 2015 Mesosphere, Inc. All Rights Reserved. 27

© 2015 Mesosphere, Inc. All Rights Reserved. BENEFITS 28 DCOS

© 2015 Mesosphere, Inc. All Rights Reserved. DEMO  TIME! 29

© 2015 Mesosphere, Inc. All Rights Reserved. MESOSPHERE IS HIRING,

© 2015 Mesosphere, Inc. All Rights Reserved. Q & A