Slide 1

Slide 1 text

Kai Wähner Technology Evangelist kontakt@kai-waehner.de LinkedIn @KaiWaehner www.kai-waehner.de O’Reilly Software Architecture Conference 2016 (London, UK) Log Analytics for Distributed Microservices

Slide 2

Slide 2 text

© Copyright 2000-2016 TIBCO Software Inc. Can you answer to these questions? • Are you storing all of your logs for enough time to answer the question “What happened?” a week from now? How about a year from now? • Can you issue a single search across all your machine data - regardless of source or type? • Can you set an alert that would trigger from any source in your enterprise? • Do you analyze and correlate all events in your distributed microservice architecture? • What about predictive monitoring?

Slide 3

Slide 3 text

© Copyright 2000-2016 TIBCO Software Inc. Key Takeaways • Log Analytics is needed to monitor distributed microservice architectures • Consolidation of broad range of events is key to enabling business insights • Log Analytics is complementary to other Big Data components

Slide 4

Slide 4 text

© Copyright 2000-2016 TIBCO Software Inc. Agenda • Distributed Microservice Log Events • Introduction to Log Analytics • Market Overview • Relation to other Big Data Components

Slide 5

Slide 5 text

© Copyright 2000-2016 TIBCO Software Inc. Agenda • Distributed Microservice Log Events • Introduction to Log Analytics • Market Overview • Relation to other Big Data Components

Slide 6

Slide 6 text

© Copyright 2000-2016 TIBCO Software Inc. Scenarios for Distributed Log Events Infrastructure • Log Management – Applications – SOA – Microservices – Cloud Platforms – SaaS • Transaction Tracing • Root Cause Analysis • Visual Analytics on Machine Data Competitive Undermining • Filtering / Cost Avoidance Solution IT Operations • Troubleshooting Connectivity • Outage Troubleshooting • Application Monitoring / Tracking • Service Level Confirmation for IT Outsourcing Security • Centralized Log / Event Management Platform • Security • Fraud Detection Compliance • PCI Compliance • Retention Compliance • Service Level Confirmation for IT Outsourcing

Slide 7

Slide 7 text

© Copyright 2000-2016 TIBCO Software Inc. Distributed Microservice Architecture http://blogs.gartner.com/gary-olliffe/2015/01/30/microservices-guts-on-the-outside/ ”That complexity has moved and […] increased [to] the outer architecture.”

Slide 8

Slide 8 text

© Copyright 2000-2016 TIBCO Software Inc. 12 Factor Apps for Cloud Native Microservices Codebase One codebase tracked in revision control, many deploys. Dependencies Explicitly declare and isolate dependencies. Config Store config in the environment. Backing Services Treat backing services as attached resources. Build, Release, Run Strictly separate build and run stages. Processes Execute the app as one or more stateless processes. Port Binding Export services via port binding. Concurrency Scale out via the process model. Disposability Maximize robustness with fast startup and graceful shutdown. Dev / Prod Parity Keep dev, staging, and prod as similar as possible. Logs Treat logs as event streams. Admin Processes Run admin/mgmt tasks as one-off processes. https://12factor.net/

Slide 9

Slide 9 text

© Copyright 2000-2016 TIBCO Software Inc. Some Cloud Platforms (PaaS) with Support for 12 Factor Apps With or without such a cloud platform, you need a way to aggregate and analyze distributed microservice logs. … to treat logs as event streams. ! !

Slide 10

Slide 10 text

© Copyright 2000-2016 TIBCO Software Inc. Agenda • Distributed Microservice Log Events • Introduction to Log Analytics • Market Overview • Relation to other Big Data Components

Slide 11

Slide 11 text

© Copyright 2000-2016 TIBCO Software Inc. Distributed Microservice Architecture http://blogs.gartner.com/gary-olliffe/2015/01/30/microservices-guts-on-the-outside/ Microservices means… - distributed services - distributed infrastructure - different technologies - containers and cloud platforms - distributed log messages - unstructured / semi-structured data Log Analytics

Slide 12

Slide 12 text

© Copyright 2000-2016 TIBCO Software Inc. Operational Intelligence Platform for Log Analytics Log Analytics Platform ü Centralize and Store of Record ü Search, Auto-id, Parsing, Correlation ü Forensics and Alerts ü Reports Engine Logs Application Logs Microservices Monitoring Configuration Messaging Web UI API Analysis Tools Data Discovery Streaming Analytics Live Visualization

Slide 13

Slide 13 text

© Copyright 2000-2016 TIBCO Software Inc. How an Operation Intelligence Platform Works INGEST OPERATIONALIZE ANALYZE Collect Data from Any Source Device Logs Web Logs Application & DB Logs Configuration Files OS Metrics Sensor Data Microservice Events Make Unstructured Data Usable Normalize Enrich Transform Index Aggregate Gain Actionable Insight Search Report Alert Correlate Visualize

Slide 14

Slide 14 text

© Copyright 2000-2016 TIBCO Software Inc. Log Analytics Example • May 2 23:06:14 app-1 login[5130]: pam_unix(login:auth): authentication failure; logname=LOGIN uid=0 euid=0 tty=tty1 ruser= rhost= user=timothy • "<13>Feb 5 08:34:55 10.92.2.188 MSWinEventLog 0 Security 106236353 Fri Feb 05 08:33:15 2010 529 Security SYSTEM User Failure Audit OHAEPHQDC009 Logon/Logoff Logon Failure: Reason: Unknown user name or bad password User Name: timothy Domain: Logon Type: 3 Logon Process: CISCO Authentication Package: MICROSOFT_AUTHENTICATION_PACKAGE_V1_0 Workstation Name: CISCO Caller User Name: portal Caller Domain: CORP Caller Logon ID: (0x0,0x63194519) Caller Process ID: 2972 Transited Services: - Source Network Address: - Source Port: - 1679136992" • Jun 11 10:51:04 10.0.0.244 Jun 11 10: 51:42 1,06/11 10:51:42,0001a100200,TRAFFIC,start,24,06/11 10:51:15,10.0.0.101,10.0.0.246,0.0.0.0,0.0.0.0,timothy,,,dns,vsys1,l2-lan-trust,l2- lan-untrust,ethernet1/12,ethernet1/11,Forward to Timothy,06/11 10:51:42,2074963,1,54604,53,0,0,0x0,udp,allow,80,80,80,1,06/11 10:51:16,0,any,0 Source Type User User Name SRC User Unix timothy Windows timothy Firewall timothy Source Type User Unix timothy Windows timothy Firewall timothy • Unix • Windows • Firewall

Slide 15

Slide 15 text

© Copyright 2000-2016 TIBCO Software Inc. Characteristics of Log Management Solutions Data Sources • Log information (standard protocols like TCP, UDP, File, Syslog, JMS) • All events (logs, messaging, streams, ...) • Extendable plugins (connectors, SDK, API) Features • Collect, parse, correlate, search, report, forward, etc. • Store and index • Query Language (SQL, Custom) à sliding windows, correlations, etc. • Retention • Compliance Templates Frequency • Historical data • Near Real Time Processing (seconds or minutes) Deployment Options • On-premise vs. Cloud (SaaS) • Open Source vs. Commercial • Software vs. Hardware Appliance Pricing • Free (open source) vs. CPU-based vs. Volume-based à Be careful here: IoT... Data grows exponentially

Slide 16

Slide 16 text

© Copyright 2000-2016 TIBCO Software Inc. Agenda • Distributed Microservice Log Events • Introduction to Log Analytics • Market Overview • Relation to other Big Data Components

Slide 17

Slide 17 text

© Copyright 2000-2016 TIBCO Software Inc. Market Analysis Segment CAGR Incumbents Challengers Log Management 15% Splunk, TIBCO LogLogic, etc. Open Source (Graylog, “ELK Stack”) SIEM RSA, ArcSight, LogRhythm Splunk, MSSPs (Managed Security Service Provider) ITOA (1.6B) 100% TIBCO LogLogic, Splunk, SumoLogic, AppDynamics, NewRelic APM (2.9B) 10% AppDynamics, NewRelic ITOM (19B) 4% IBM, CA, BMC, MS, HP AppDynamics, NewRelic, Chef, Puppet, Docker, CloudFoundry Rapidly Emerging and Evolving, Encompasses Many Segments Traditional: Log Management, IT Operations Monitoring (ITOM), Security (SIEM) Current: IT Operations Analytics (ITOA), Application Performance Management (APM) Future: DevOps and Continuous Improvement (2.9B)

Slide 18

Slide 18 text

© Copyright 2000-2016 TIBCO Software Inc. Security information and event management (SIEM) SIEM is a specific part of Log Analytics focusing on Security. • Threat management: Early detection of targeted attacks and data breaches • Compliance: Collect, store, analyze and report on log data for incident response, forensics and regulatory compliance • Aggregates event data produced by security devices, network infrastructures, systems and applications Log Analytics handles all kinds of use cases, not focusing on security. http://www.gartner.com/document/3097022 https://www-01.ibm.com/marketing/iwm/dre/signup?source=swg-WW_Security_Organic&S_PKG=ov37658&cm_mmc=Blog_SI-_-Sec_Int-_-Organic-_-IBM-is-a-leader-again-in-2015-gartner-magic-quadrant-for-SIEM SIEM is out-of-scope for this presentation!

Slide 19

Slide 19 text

© Copyright 2000-2016 TIBCO Software Inc. Alternatives for Log Analytics Time to Market Log Analytics Product Middleware Suite (includes Log Analytics Product) Slow Fast Log Analytics Framework Includes Includes

Slide 20

Slide 20 text

© Copyright 2000-2016 TIBCO Software Inc. Alternatives for Log Management (no complete list) Open Source Closed Source SaaS On Premise

Slide 21

Slide 21 text

© Copyright 2000-2016 TIBCO Software Inc. “Cloud washing is the purposeful and sometimes deceptive attempt by a vendor to rebrand an old product or service by associating the buzzword ‘cloud’ with it [and offering it via a public cloud infrastructure].” On Premise vs. Cloud Washing vs. Cloud Native / SaaS http://searchcloudstorage.techtarget.com/definition/cloud-washing !

Slide 22

Slide 22 text

© Copyright 2000-2016 TIBCO Software Inc. Alternatives for Log Management (no complete list) Open Source Closed Source SaaS On Premise Open Source Framework

Slide 23

Slide 23 text

© Copyright 2000-2016 TIBCO Software Inc. Alternatives for Log Analytics Time to Market Log Analytics Product Middleware Suite (includes Log Analytics Product) Slow Fast Log Analytics Framework Library (Java, .NET, Python) Operators (Collect, Filter, Sort, Aggregate, Alert) Scalability (Horizontal and Vertical, Fail Over) Connectivity (Standards, Technologies, Products) User Interface (Basic Monitoring and Reporting)

Slide 24

Slide 24 text

© Copyright 2000-2016 TIBCO Software Inc. ELK Stack (Logstash, Elasticsearch, Kibana) Characteristics • Data Sources • Features • Frequency • Deployment Options • Pricing Facts Combination of Open Source Frameworks • Quick getting started for developers with ”Hello World” example • More complex Enterprise setup and usage (coding and configuration) • AWS offering available for Elastic and Kibana, not Logstash) Targeted for developers • Mainly focused on helping developers detect and fix errors in their apps • Entirely open source, i.e. free to use • Commercial support available • Combination of different mature frameworks Less enterprise-focused • Very basic user interface • Based on ElasticSearch, Logstash and Kibana • Plenty of connectors + easy to extend (with coding) • Sufficient reporting (i.e. dashboards), but missing visual analytics

Slide 25

Slide 25 text

© Copyright 2000-2016 TIBCO Software Inc. Live Demo ELK Stack (Open Source) in Action…

Slide 26

Slide 26 text

© Copyright 2000-2016 TIBCO Software Inc. graylog Characteristics • Data Sources • Features • Frequency • Deployment Options • Pricing Facts Combination of Open Source Frameworks • Quick getting started for developers with ”Hello World” example • More complex Enterprise setup and usage (coding and configuration) Targeted for developers • Mainly focused on helping developers detect and fix errors in their apps • Entirely open source, i.e. free to use • Commercial support available • Young solution (1.0 GA in 2015) – not as mature as others yet Less enterprise-focused • Very basic user interface • Based on MongoDB, ElasticSearch and Apache Kafka • Marketplace for connectors + easy to extend (with coding) • Missing extensive reporting and analytics

Slide 27

Slide 27 text

© Copyright 2000-2016 TIBCO Software Inc. Alternatives for Log Management (no complete list) Open Source Closed Source SaaS On Premise SaaS Cloud Service

Slide 28

Slide 28 text

© Copyright 2000-2016 TIBCO Software Inc. papertrail Facts Easy setup and very simple to use • Targeted for developers • „Very small“ free version available (100MB/month) • Cheap pricing, e.g. 1GB/month: 5 USD; 1000GB/month: 875 USD Less enterprise-focused • Stripped down and basic log analyzer • Mostly text-based • User interface is very similar to looking at a log on your machine • No advanced integrations, predictive or reporting capabilities SaaS • Upload (masses of) data to the cloud • Worse latency than on-premise solutions • Efforts to anonymize sensitive data Characteristics • Data Sources • Features • Frequency • Deployment Options • Pricing

Slide 29

Slide 29 text

© Copyright 2000-2016 TIBCO Software Inc. Live Demo Papertrail (SaaS) in Action…

Slide 30

Slide 30 text

© Copyright 2000-2016 TIBCO Software Inc. loggly 30 Facts Easy setup and very simple to use • Custom performance and DevOps dashboards Targeted for developers and DevOps • Pricing from 50 USD to some thousand USD • Feature-limited free version available (200MB/day) Less enterprise-focused • Focus especially on logs from application servers • Anything beyond that has to be built • Find and fix operational problems • Primary use cases are for troubleshooting / customer support scenarios SaaS • Upload (masses of) data to the cloud • Worse latency than on-premise solutions • Efforts to anonymize sensitive data Characteristics • Data Sources • Features • Frequency • Deployment Options • Pricing

Slide 31

Slide 31 text

© Copyright 2000-2016 TIBCO Software Inc. Alternatives for Log Analytics Time to Market Log Analytics Product Middleware Suite (includes Log Analytics Product) Slow Fast Log Analytics Framework Library Operators Scalability Connectivity User Interface Visual Configuration (Analysis, Correlation, Alerting) Simulation (Feed Testing, Test Generation) User Interface (Advanced Monitoring, Reporting, Analytics) Maturity (product, 24h support, consulting)

Slide 32

Slide 32 text

© Copyright 2000-2016 TIBCO Software Inc. sumologic Characteristics • Data Sources • Features • Frequency • Deployment Options • Pricing Facts • Easy setup and simple to use • Targeted for developer, security teams, business – Pricing from 90 USD to some thousand USD – Feature-limited free version available (500MB/day) • Most enterprise-focused SaaS product – Founded as „Splunk for the Cloud“ – Most feature-rich SaaS solution – Many features of „enterprise grade solutions“ • SaaS – Upload (masses of) data to the cloud – Worse latency than on-premise solutions – Efforts to anonymize sensitive data

Slide 33

Slide 33 text

© Copyright 2000-2016 TIBCO Software Inc. Alternatives for Log Management (no complete list) Open Source Closed Source SaaS On Premise Enterprise Product

Slide 34

Slide 34 text

© Copyright 2000-2016 TIBCO Software Inc. Splunk Characteristics • Data Sources • Features • Frequency • Deployment Options • Pricing Facts • Complex setup (especially for larger scale) – SaaS Offering for getting started quickly in the public cloud • Simple to use for the end user • Targeted for all use cases (including SIEM) – Not just for log files, but also other events / messaging – „Enterprise Pricing“ - Very High pricing (for medium and high volume) – No access to your data if limit is reached! (contrary to other vendors) • Enterprise Class – Market leader – Most feature-rich solution – Moving into ITOA market – No hardware appliance (just via partner „SBOX“) – Just log analytics, no complete middleware suite

Slide 35

Slide 35 text

© Copyright 2000-2016 TIBCO Software Inc. Alternatives for Log Analytics Time to Market Log Analytics Product Middleware Suite (includes Log Analytics Product) Slow Fast Log Analytics Framework Library Operators Scalability Connectivity User Interface Visual Configuration Simulation Advanced User Interface Maturity Out-of-the-Box Integration and Support (Messaging, ESB, MDM, etc.)

Slide 36

Slide 36 text

© Copyright 2000-2016 TIBCO Software Inc. IBM QRadar Characteristics • Data Sources • Features • Frequency • Deployment Options • Pricing Facts • Complex setup • Simple to use for the end user • Targeted for all use cases (including SIEM) – Not just for log files, but also other events / messaging – „Enterprise Pricing“ - High pricing (for medium and high volume) • Enterprise Class – Part of a complete middleware suite – Very feature-rich solution – Available as SaaS offering – Available as hardware appliance – Moving into ITOA market

Slide 37

Slide 37 text

© Copyright 2000-2016 TIBCO Software Inc. TIBCO LogLogic 37 © Copyright 2000-2015 TIBCO Characteristics • Data Sources • Features • Frequency • Deployment Options • Pricing Facts • Easy setup (small and large scale) • Simple to use for the end user – Powerful user interface – Not as powerful as Splunk or IBM QRadar • Targeted for all use cases – Not just for log files, but also other events / messaging – „Enterprise Pricing“ - Low costs compared to competitors – „Always on“ – even after limit is reached • Enterprise Class – Part of a complete middleware suite – Most advanced analytics (via TIBCO Spotfire add-on) – Available as hardware appliance

Slide 38

Slide 38 text

© Copyright 2000-2016 TIBCO Software Inc. Live Demo TIBCO LogLogic (Enterprise) in Action…

Slide 39

Slide 39 text

© Copyright 2000-2016 TIBCO Software Inc. Message Pattern Generation with TIBCO LogLogic Web UI Discover Unstructured Data à Generate Pattern à Validate à Apply Pattern for Structured Data

Slide 40

Slide 40 text

© Copyright 2000-2016 TIBCO Software Inc. Spoilt for Choice Does it make sense to combine different Log Analytics solutions?

Slide 41

Slide 41 text

© Copyright 2000-2016 TIBCO Software Inc. Example: TIBCO LogLogic à „A Splunk Management Solution“ http://www.tibco.de/assets/blt0da0bc2ea7d5b9b7/solution-brief-tibco-loglogic-splunk-management-solution.pdf

Slide 42

Slide 42 text

© Copyright 2000-2016 TIBCO Software Inc. Conclusion - Market Analysis Log Management • SaaS à Easy to setup and use, but cloud cons (not flexible, public cloud) • Open Source à Free and extendable, but coding / config instead of tooling • Enterprise à Most feature-rich and powerful tooling, but more expensive IT Operations Analytics (ITOA) • Enterprise vendors entering this market these days à Extending existing solutions • Focus on more complex correlations, real time processing, predictive monitoring

Slide 43

Slide 43 text

© Copyright 2000-2016 TIBCO Software Inc. Market Analysis Segment CAG R Incumbents Challengers Log Management 15% Splunk, TIBCO LogLogic, etc. Open Source (Graylog, “ELK Stack”) SIEM RSA, ArcSight, LogRhythm Splunk, MSSPs (Managed Security Service Provider) ITOA (1.6B) 100% TIBCO LogLogic, Splunk, SumoLogic, AppDynamics, NewRelic APM (2.9B) 10% AppDynamics, NewRelic ITOM (19B) 4% IBM, CA, BMC, MS, HP AppDynamics, NewRelic, Chef, Puppet, Docker, CloudFoundry (2.9B) Rapidly Emerging and Evolving, Encompasses Many Segments Traditional: Log Management, IT Operations Monitoring (ITOM), Security (SIEM) Current: IT Operations Analytics (ITOA), Application Performance Management (APM) Future: DevOps & Continuous Improvement

Slide 44

Slide 44 text

© Copyright 2000-2016 TIBCO Software Inc. Log Analytics is a very stable and established market. ITOA enhances Log Analytics to allow more powerful real time correlation.

Slide 45

Slide 45 text

© Copyright 2000-2016 TIBCO Software Inc. Agenda • Distributed Microservice Log Events • Introduction to Log Analytics • Market Overview • Relation to other Big Data Components

Slide 46

Slide 46 text

© Copyright 2000-2016 TIBCO Software Inc. When to use Log Analytics Time of Action Historical Data Near Real Time Real Time Predictive IT Operations Analytics (ITOA) Log Management Data Warehouse Streaming Analytics Data Discovery „Data Lake“ (Various Apache Hadoop Frameworks) Log Analytics Visual Real Time Analytics (There is some overlapping!)

Slide 47

Slide 47 text

© Copyright 2000-2016 TIBCO Software Inc. Streaming Analytics: Act on Critical Business Moments

Slide 48

Slide 48 text

© Copyright 2000-2016 TIBCO Software Inc. Streaming Analytics Voltage Temperature Vibration Device history Temporal analytic: “If vibration spike is followed by temp spike then voltage spike [within 4 hours] then flag high severity alert.”

Slide 49

Slide 49 text

© Copyright 2000-2016 TIBCO Software Inc. Live Demo Apache Flink (Open Source), StreamSets (SaaS) and TIBCO StreamBase / Live Datamart (Enterprise) in Action…

Slide 50

Slide 50 text

© Copyright 2000-2016 TIBCO Software Inc. Log Management / ITOA vs. Hadoop and Log Collectors Why not use just a Data Lake (Apache Hadoop)? You can also store and analyze all data on its cluster! Why not just use Log Collectors and forward data directly without Log Analytics “in the middle”? • In general: Fluentd, Logstash • Apache Hadoop specific: Apache Flume or Apache Kafka DIFFERENTIATORS OF LOG MANAGEMENT / IT OPERATIONS ANALYTICS • Integrated solution for data analysis (tooling, consulting, support) • Built exactly for these use cases (Log Management, ITOA) • Involves data indexing, data processing (querying) and data visualization by means of dashboards and other tools • Tooling for Ease-of-Use and Time-to-Market • Graphical user interface for operational intelligence • There is no “one size fits all” tool to solve all your problems

Slide 51

Slide 51 text

© Copyright 2000-2016 TIBCO Software Inc. Relation to other Big Data Components • Data Warehouse – Historical data – Only structured data – Reporting • Apache Hadoop – Historical and near real time data – All data – Storage and Analytics (e.g. MapReduce, Spark) • NoSQL – Specific Storage (graph, document, key/value, ...) – Search (e.g. ElasticSearch) • Stream Processing – Especially real time data • Predictive Analytics – R, Machine Learning, SAS, etc. – Combined with the others! Log Analytics Forward Forward Parse, Filter, Structure, Forward Parse, Filter, Structure, Forward Parse, Filter, Structure, Forward

Slide 52

Slide 52 text

© Copyright 2000-2016 TIBCO Software Inc. Trend: Machine Learning applied to Log Analytics “… when the log-data patterns cannot be precisely defined in advance, unsupervised and reinforcement learning may be appropriate [to find outliers or anomalies].” http://www.infoworld.com/article/2608064/big-data/big-data-log-analysis-thrives-on-machine-learning.html “… They combined the aggregation of log data, the metadata that is created any time IT systems are used, along with high-level analytics and machine learning tools … … give context to the ’needle in a haystack’ problem …” http://www.forbes.com/sites/benkepes/2015/03/27/using-log-data-and-machine-learning-to-weed-out-the-bad- guys

Slide 53

Slide 53 text

© Copyright 2000-2016 TIBCO Software Inc. Key Takeaways • Log Analytics is needed to monitor distributed microservice architectures • Consolidation of broad range of events is key to enabling business insights • Log Analytics is complementary to other Big Data components

Slide 54

Slide 54 text

Questions? Please contact me! Kai Waehner Technology Evangelist kontakt@kai-waehner.de @KaiWaehner www.kai-waehner.de LinkedIn