Blizzard: Building a Near Real-Time Data Pipeline

Blizzard Entertainment March 8th, 2017 @jordanirwin / @ctide Building a
Near Real-Time Pipeline for All Things Blizzard Jordan Irwin / Chris Burkhart, Technical Leads

Jordan Irwin Technical Lead, Data Team Chris Burkhart Technical Lead,
Data Team Who are we?

Is This Easy Mode? ??? Data Step 1: Generate Good
Data Step 2: Collect and Analyze Step 3: Profit!

Quest List • Brief history of Big Data at Blizzard
– Where it began – The world could use more hero…ic data – Glimpse into our future • Elastic Stack: GG or OMG? • Lessons learned • Tidbits

A long long time ago… • Protocol Buffers as IDL
• Server data only • Publish directly to RMQ • Federation galore • Map/Reduce all the things • Standard “data lake” approach

Back Then Game Server RMQ Flume Hadoop Game Client API
* Limited client support eventually added… Map/Reduce

The Good Parts • From zero to hero: It worked!
• Data driven decisions now possible • Positive “data culture” formed • Protocol Buffers well established • Foundation for Big Data at Blizzard

The Bad Parts • Schemas coordinated via emails (if at
all) • Map/Reduce requires specialized expertise • More effort preparing data vs analyzing it • RMQ scaling became non-trivial

You must construct additional pylons Some goals… • ~20 billion
messages/day • Schema Registry • Collect data from anywhere • “Free the data”

Road to Overwatch Game Server SDK Kafka Hadoop Elasticsearch Game
Client Git repo for Schemas Map/Reduce Tribe API Metrics Logs Logstash Logstash Kibana

Cluster Cluster Cluster Cluster Cluster Tribe

Immediate Winz • Client data meant new ways to debug
– CCU Drops tied to ISPs – Network Quality reports – Measurable customer impact • Even better than server monitoring! • Centralized log searching – RIP grep • All in near real-time!

The Good Parts • Elasticsearch + Kibana accessibility – Single
“pane of glass” – Easy to use – Instant data • “Free the data” worked • Much higher scalability • SDKs for multiple languages/platforms – C#, C++ (PC/Xbox/PS4) • Offered a schema storage place • The business LOVED IT

The Bad Parts • Schemas not required and avoided –
Not really a true registry – Dynamic mapping nightmares – Converted “data lake” into “data swamp” • Map/Reduce all-the-things still a problem • Tribe Node instability meant frequent outages • Metrics solution wasn’t scalable (ingest) • Logging wasn’t sustainable (configuration) We needed a bigger boat...

MOAR PYLONS! Reconsidered goals… • ~100 billion messages/day • Schema
Registry revisited and required • Collect data from anywhere • “Free the data” even more • Easy to onboard • Dogfood everything

Today’sh Kafka Hadoop SDK Schema Registry … Kibana API Enrich
Kafka Game Client Elasticsearch Tribe* Game Server Logs Metrics TDK

The Good Parts • Required and robust Schema Registry –
"What You Registered Is What You Get” (WYRWYG) • Telemetry Development Kit (TDK) • Improved and expanded SDKs – C#, C++ (PC/Mac/Xbox/PS4), Python, Java, NodeJS, Android, Unity, Go* • Documentation prioritization • Telem-Telem: Dog food is tasty • Stable Tribe Nodes (Thanks Elastic!) • Extendible for more features • Less map/reduce, moar insight

The Bad Parts • Deprecated metrics support (for now) •
Limited logging support • Dozens of global Elasticsearch clusters constituting a single system isn’t trivial (but still possible!) – Monitoring – Logging – Auditing – JVM GC – Updates…

Future • Upgrade to 5.x Elastic Stack (/shivers) • Logging
4realz • Metrics 4sho (w/rollup!) • Custom transforms • Subscriptions • ODBC/JDBC • Machine Learning • … and much, much more!

The Good Parts • Leverages existing foundation • Low risk
updates allows major features • Pairs tools with access patterns • Favors extensibility

The Bad Parts • NONE • It’ll be PERFECT!

Think Globally • Proven architecture • Vetted by influential companies
• Best parts of popular pipelines • Blizzard will be a global leader in Big Data, Soon™

GG Elastic • “Free the data” contributor • Kibana makes
data accessible • Tribe Nodes centralize data • Aliases abstract index names • Fast time to insight • APIs allow tooling • Shield controls access • Communication with Elastic has been great!

OMG Elastic • Shield can get complicated • Kibana multi-tenancy
needs loves • Tribes are great… when they work • Logs can be spammy • Auditing gaps (who did what?) • Bad actors can ruin the fun

We were not prepared • Take schema management seriously •
Let use-cases drive development • Expect success • Get data flowing ASAP

Data Data • Message Rate – Billions/day • Elasticsearch Storage
– Hundreds of Terabytes • HDFS Storage – Petabytes So sorry no real details L

Shameless “Plug” • Using NodeJS with Kafka? – We open
sourced node-rdkafka – https://github.com/Blizzard/node-rdkafka • We’re Hiring! – Know someone? – Am someone? – Java / Scala / Kafka / Hadoop / Big Data – http://careers.blizzard.com

Thanks for listening! Questions?

3 2 More Questions? Visit us at the AMA

www.elastic.co

Blizzard: Building a Near Real-Time Data Pipeline

Blizzard: Building a Near Real-Time Data Pipeline

Elastic Co

More Decks by Elastic Co

Other Decks in Technology

Featured

Transcript

Blizzard Entertainment March 8th, 2017 @jordanirwin / @ctide Building a

Jordan Irwin Technical Lead, Data Team Chris Burkhart Technical Lead,

Is This Easy Mode? ??? Data Step 1: Generate Good

Quest List • Brief history of Big Data at Blizzard

A long long time ago… • Protocol Buffers as IDL

Back Then Game Server RMQ Flume Hadoop Game Client API

The Good Parts • From zero to hero: It worked!

The Bad Parts • Schemas coordinated via emails (if at

You must construct additional pylons Some goals… • ~20 billion

Road to Overwatch Game Server SDK Kafka Hadoop Elasticsearch Game

Cluster Cluster Cluster Cluster Cluster Tribe

Immediate Winz • Client data meant new ways to debug

The Good Parts • Elasticsearch + Kibana accessibility – Single

The Bad Parts • Schemas not required and avoided –

MOAR PYLONS! Reconsidered goals… • ~100 billion messages/day • Schema

Today’sh Kafka Hadoop SDK Schema Registry … Kibana API Enrich

The Good Parts • Required and robust Schema Registry –

The Bad Parts • Deprecated metrics support (for now) •

21

Future • Upgrade to 5.x Elastic Stack (/shivers) • Logging

The Good Parts • Leverages existing foundation • Low risk

The Bad Parts • NONE • It’ll be PERFECT!

Think Globally • Proven architecture • Vetted by influential companies

GG Elastic • “Free the data” contributor • Kibana makes

OMG Elastic • Shield can get complicated • Kibana multi-tenancy

We were not prepared • Take schema management seriously •

Data Data • Message Rate – Billions/day • Elasticsearch Storage

Shameless “Plug” • Using NodeJS with Kafka? – We open

Thanks for listening! Questions?

3 2 More Questions? Visit us at the AMA

www.elastic.co