Upgrade to Pro — share decks privately, control downloads, hide ads and more …

Connecting the Dots... Tracing of Events within ING with Elasticsearch

Elastic Co
October 29, 2015

Connecting the Dots... Tracing of Events within ING with Elasticsearch

Coming from a world of isolated departments and applications, ING has now embraced the omnichannel approach. Teams are working closely together, but lacking the visibility needed across systems. This is how ING solves the problem of tracing events in a complex software environment.

Stephane Rouault and Christiaan Douma | Elastic{ON} Tour Amsterdam | October 29, 2015

Elastic Co

October 29, 2015
Tweet

More Decks by Elastic Co

Other Decks in Technology

Transcript

  1. Connecting the dots… Christiaan Douma - Dev Engineer, Stephane Rouault

    - Dev Engineer Tracing of events within ING Amsterdam • 29th October 2015
  2. Market leaders Benelux Growth markets Commercial Banking Challengers 5 European

    map Full-service bank Very strong European base Ranked 7th Largest bank of NL 150+ DevOps teams
  3. •  Engineer •  Software Developer •  7 years of experience

    in IT •  @ING since: 01-03-2011 •  Engineer •  Jack of all trades •  11 years of experience in IT •  @ING since: 01-01-2011 6 A little bit about us Stephane Rouault Christiaan Douma
  4. 9 Happy Flow A simple illustration of a transaction Frontend

    Middleware Backend Request ------------ ------------ ------------ ------------ Reply ------------ ------------ ------------ ------------
  5. 10 Error occurs But what if something goes wrong? Frontend

    Middleware Backend Request ------------ ------------ ------------ ------------ ERROR: Some error at … What happened ? Where did it happen ? What has been affected?
  6. 11 What usually happens next .. Frontend Middleware Backend Log

    Log Log DevOps teams searching their logs..
  7. •  Finding the owner of the problem •  Access to

    logs is restricted to system owners •  Difficult to link events across systems •  Time Consuming Process 12 Issues
  8. 16 Unique Correlation Identifier Frontend Middleware Backend Request: UUID ------------

    ------------ ------------ Link events across systems, extra context info in the request
  9. 17 Using Elasticsearch to combine everything Frontend Middleware Backend Log

    Log Log Log Log Log Log Log Log Log Log Log Elasticsearch
  10. Event: UUID Context Duration Status Event: UUID Context Duration Status

    Event: UUID Context Duration Status 18 Event logging in Elasticsearch : Tracing Frontend Middleware Backend Request: UUID ------------ ------------ ------------ Elasticsearch
  11. 24 Free text search Shared service within ING Data lake

    for operational events Scalability No read/write interference Flexible data model High Available How did Elasticsearch help us?
  12. •  Peak number events per second: ~650 events per second

    •  Avg number events per second: ~300 events per second •  Peak number of events per day: ~24 million events •  Avg number of events per day ~11 million events •  Growth of the number of events: ~40% more than 3 months ago.. •  Growth of the number of operations: ~12% more than 3 months ago.. •  Longest chain of events: >30 •  Number of DevOps teams using Tracing : ~24 teams •  Number of architecture domains in Tracing ~5 Some statistics about Tracing 28
  13. •  Peak number events per second ~2500 events per second

    •  Peak number of events per day ~120 million events •  Index per day takes up to 100-130 Gb •  Retaining 30 days of data (1x replicated) ~4,4 TB •  Number of Elasticsearch queries 30 tps •  Avg responsetime (for most queries) 1-2 secondes •  Number of kibana dashboards: ~740 and growing fast. •  Number of shards: 5 Some statistics about Elasticsearch 29
  14. •  Multi-source event correlation; combining tracing with: •  System logs

    •  Alert logs •  System metrics •  Incidents logs •  Deployment logs •  Etc.. •  Elasticsearch hook up to a graph database for realtime graphical insight •  Forecasting of (business) usage •  Business chain alerting with watcher or other tool Roadmap 34