1
The
Myths
&
Reali.es
Surrounding
Hadoop
Rob
Anderson
VP
Systems
Engineering
Slide 3
Slide 3 text
2
Sales
SCM
CRM
Public
Web
Logs
Produc7on
Data
Sensor
Data
Click
Streams
Loca7on
Social
Media
Billing
Enterprise
Data
Hub
Hadoop
Changes
Analy.cs
“Simple
algorithms
and
lots
of
data
trump
complex
models
”
Halevy,
Norvig,
and
Pereira,
Google
IEEE
Intelligent
Systems
Slide 4
Slide 4 text
3
Slide 5
Slide 5 text
4
Slide 6
Slide 6 text
5
Data
Warehouse
Volume
Variety
Velocity
Slide 7
Slide 7 text
6
Slide 8
Slide 8 text
7
Big Data is hard to move…because it’s
BIG
Slide 9
Slide 9 text
8
What
was
the
genius
of
Hadoop?
§ Fueling
an
industry
revolu7on
by
providing
infinite
capability
to
store
and
process
big
data
§ Expanding
analy7cs
across
data
types
§ Compelling
economics
–
20
to
100X
more
cost
effec7ve
than
alterna7ves
Slide 10
Slide 10 text
9
Slide 11
Slide 11 text
10
Random
Wri.ng
in
MapR
S1
S2
S3 S5
S4
S1, S2, S4
S1, S3
S1, S4, S5
S2, S4, S5
S3
Client
wri.ng
data
CLDB
Ask
for
64M
block
Create
cont.
Picks
master
and
2
replica
slaves
Write
next
chunk
to
S2
S2, S3, S5
aZach