The most intriguing data scientist in the universe just endorsed this kickstarter project which is a complete, practical, hands-on course which comprises of step-by-step video tutorials, a book, practice exercises and downloadable code samples covering everything you need to know about aggregating, processing, searching and visualizing log data generated at high volume and high velocity using only open source software.
Course Contents
# Installing Java and Configuring Environment Variables
# Introduction to Apache Flume Architecture and Inner Workings
# Introduction to Logstash Architecture and Inner Workings
# Downloading, Installing and Configuring Apache Flume
# Downloading, Installing and Configuring Logstash
# Overview of Different Forms of Capturing Application Log Data
# Parsing Raw Data to Extract Metadata and Useful Information
# Strategies and Techniques for Buffering Log Events before Storage
# Using Elasticsearch and HDFS as Centralized Datastores
# Installing and Configuring ElasticSearch
# Setting up Hadoop and Configuring HDFS
# Installing Kibana as a User Interface to the ElasticSearch Indicies
# Moving Log Data into ElasticSearch and HDFS
# Introduction to Hadoop, HDFS and MapReduce
# Using Hadoop to Process the Log data in HDFS with MapReduce jobs
# Introduction to Lucene Query Syntax
# Quering ElasticSearch to Retrieve Information
# Using D3.js to Visualize ElasticSearch Query Results
# Using D3.js to Visualize Results from MapReduce jobs
# Setting up Dashboards in Kibana to Visualize Log Events in Realtime.
# Brining It All Together with Sample Projects
# Design Patterns, Best Practices, Tips and Strategies for Scaling Log data Aggregation, Processing, Searching and Visualization
http://www.kickstarter.com/projects/1368497725/massive-log-data-aggregation-processing-and-visual