Motivation
Source: IDC’s Digital Universe Study (EMC), June 2011
Data is growing
Slide 5
Slide 5 text
Motivation
Source: IDC’s Digital Universe Study (EMC), June 2011
Data is growing
Slide 6
Slide 6 text
Motivation
Source: IDC’s Digital Universe Study (EMC), June 2011
Data is growing
Slide 7
Slide 7 text
Streaming Data
Big Data & Real Time
Slide 8
Slide 8 text
Big Data
McKinsey Global Institute (MGI) Report on Big Data, 2011.
Big data refers to datasets whose size is beyond
the ability of typical database software tools to
capture, store, manage, and analyze.
Slide 9
Slide 9 text
Big Data
McKinsey Global Institute (MGI) Report on Big Data, 2011.
Big data refers to datasets whose size is beyond
the ability of typical database software tools to
capture, store, manage, and analyze.
Slide 10
Slide 10 text
Methodology
Sampling and distributed systems
Slide 11
Slide 11 text
Methodology
Paolo Boldi
Big Data does not need big machines,
it needs big intelligence
Slide 12
Slide 12 text
Real time analytics
We want to analyze what is happening now.
Slide 13
Slide 13 text
Real time analytics
We want to analyze what is happening now.
Slide 14
Slide 14 text
Time and Memory
Number 8 Wire Mentality
Time and memory are the resource dimensions of
the process.
Slide 15
Slide 15 text
Time and Memory
Time and memory are the resource dimensions of
the process.
Applications
sensor data: industry, cities
telecomm data
social networks: twitter, facebook, yahoo
marketing: sales business
Data may come from: humans, sensors, or
machines.