Slide 1

Slide 1 text

Big data analytics: ELK Stack by Craciun Daniela

Slide 2

Slide 2 text

What is the ELK Stack? The ELK stack (now known as the Elastic Stack) consists of Elasticsearch, Logstash, and Kibana and they work exceptionally well together.

Slide 3

Slide 3 text

No content

Slide 4

Slide 4 text

Logstash pipeline data to and from anywhere via a tool known as an ETL pipeline it also allows us to fetch, transform, and store into ElasticSearch

Slide 5

Slide 5 text

Elasticsearch based on Apache Lucene "schema-less" database easy to scale horizontally powerful search capabilities

Slide 6

Slide 6 text

What does "schema-less" mean? You have the ability to pass lightweight JSON at the database and it will update the schema as you go along.

Slide 7

Slide 7 text

ElasticSearch indexes every single eld, so you can search anything (including full-text search) and it will aggregate and group the data. If you want to register a new node to a cluster then it is a matter of installing ElasticSearch on a machine and editing a con g le, which is a pretty simple ask. ElasticSearch then takes care of spreading data around and splitting out requests over multiple servers.

Slide 8

Slide 8 text

Kibana web-based dashboarding and data analysis tool that leverages ElasticSearch's search capabilities so you can visualise data

Slide 9

Slide 9 text

Kibana interface

Slide 10

Slide 10 text

Why ELK stack? It is great at handling large numbers of logs and requires relatively little con guration Essentially the strengths of this technology stack are ideally suited to log-driven big data projects

Slide 11

Slide 11 text

The end!