Cyber Security Log Analytics at Decision Lab

Nathan Necaise and Drew Malone Oct. 6, 2015 1
Cyber Security Log Analytics

Our Cyber Security Stories •  Why should we care • 
What a solution should provide •  How did we get to ELK §  Hadoop §  Hadoop with Lambda §  MEAN Stack §  KLEAN Stack •  What about other COTS solutions •  Next 2

So What’s the Big Deal? •  Data Exhaust •  Evolution
of disparate data sources •  IRS and OPM breaches •  Shellshock, Heartbleed, and more •  Heterogeneous environment •  Reactionary •  Decisions based on anecdotal evidence 3

What Does a Solution Need? •  Aggregation •  Correlation • 
Alerting •  Compliance verification •  Data Retention •  Forensic/Historic analysis •  Work well with existing and new technologies 4

How We Started •  What we could now do: § 
Analysis of large volumes of data §  Excellent batch and historic insight §  Allowed both quick prototyping as well as fine grained control •  What was still not present: §  Real time analysis §  Barrier to entry via need for Java, Pig, etc. §  Output was not visual and intuitive 5

Hadoop Solution 6

Lets build on this starting point •  What we gained:
§  Real-time analysis §  Ability to perform low-latency queries into vast amounts of data •  What was still a challenge: §  We had to support a custom web application §  Missing flexibility in queries §  Lead time required for unforeseen questions §  More complexity 7

Lambda Architecture 8

Lets Check out this MEAN Stack •  What worked well:
§  Nice and low barrier to entry (JSON + JavaScript) §  Flattened the tech stack •  What challenges did we find: §  Higher sustainment and maintenance cost §  Scaling was more complex and time consuming §  Limitations in the aggregation pipeline for complex queries §  Still spending more time developing than solving problems 9

MEAN Stack 10

K, Lets try to keep all the good and toss
the bad… •  We already use Elasticsearch for full text searching. Why don’t we make it structured search and use it as a big data back-end? •  How did this work: §  Surprisingly easy transition (MEAN to KLEAN in prod ~5 days) §  Better query performance §  More flexibility in queries §  Crazy simple scalability §  Simple stack compared to Hadoop §  Support for both simple and custom ingest §  Kibana! 11

KLEAN Stack 12

That’s great, so what… •  Because our time is focused
on solving problems and not maintenance and sustainment of the technology we can do things like: §  Tell where vulnerable versions of software are present in a highly fragmented enterprise §  Deliver insight into data based on roles and accesses by integrating with in house authorization services §  Retrospectively query for prior evidence of newly found malicious behavior §  Automatically discover the uncommonly common and anomalous trends §  Make data based decisions and not rely on stale data and intuition 13

Did we try Splunk? •  Sister projects used Splunk. We
worked closely with large Splunk deployments. •  It works for some scenarios: moderate data, nice out of the box UIs •  Why didn’t we also use it? §  No way to test drive at scale §  500MB / day doesn’t allow me to determine if it will fit my needs §  High cost for our scale §  Our data is not always time series §  Poor performance compared to Elasticsearch at scale §  HUNK, emphasis on summarizing data before ingest §  Splunk licenses not renewed 14

What have we done since? 15

What next? •  Continue to resolve pain points •  Automate
everything, everywhere 16

Cyber Security Log Analytics at Decision Lab

Cyber Security Log Analytics at Decision Lab

Elastic Co

More Decks by Elastic Co

Other Decks in Technology

Featured

Transcript

Nathan Necaise and Drew Malone Oct. 6, 2015 1

Our Cyber Security Stories •  Why should we care •

So What’s the Big Deal? •  Data Exhaust •  Evolution

What Does a Solution Need? •  Aggregation •  Correlation •

How We Started •  What we could now do: §

Hadoop Solution 6

Lets build on this starting point •  What we gained:

Lambda Architecture 8

Lets Check out this MEAN Stack •  What worked well:

MEAN Stack 10

K, Lets try to keep all the good and toss

KLEAN Stack 12

That’s great, so what… •  Because our time is focused

Did we try Splunk? •  Sister projects used Splunk. We

What have we done since? 15

What next? •  Continue to resolve pain points •  Automate