Slide 1

Slide 1 text

Big$Data$&$Analy-cs$ 0$ Cost$and$Value$ Valen-n$Zacharias,$codecentric.de$ Karlsruhe,$22.5.2014$

Slide 2

Slide 2 text

200#consultants,#enthusiasts,#engineers,#cra2smen,#experts,#nerds# •  build#systems#to#create#value#from#big#data# •  help#businesses#scale#so2ware# •  realize#agile#so2ware#development#–#with#our#customer‘s#in#house# development#and#in#custom#so2ware#development# We#are#codecentric.#

Slide 3

Slide 3 text

Mo-va-on,$Data$Deluge,#DaBficaBon,#Volume,#Variety,#Velocity,#Data$Driven$ Decision,$n=All,#t=now,#Cost,$Data$OS,#Hadoop,#Enterprise$Data$Lake,# Distributed$Search,#ElasBcSearch,#NoSQL$et$al,#Cassandra,#mongoDB,#Riak,#Pivotal,# cloudera,#Distributed$Stream$Processing,#Storm,#In$Memory$Compu-ng,#SAP# Hana,#Spark,#EXASOL,#Reac-ve$Programming,#Moore’s#Law,#Cloud#CompuBng,#Data# Value#Chain,#Value,#Analy-cs,#DescripBve,#Visual,#Causal,#PredicBve,#PrescripBve,# Applica-on$Areas,#OperaBonal#Excellence,#Customer#InBmacy,#Product#Leadership,# 360°$Everything,$Business$Models,#Data#Driven#Business,#Data#for#Products,# Products#with#Data,#Data#as#Business# My#Goal#today:## •  You’ve#heard#the#most#important#Big#Data#terms#(see#above)#and# can#fit#them#into#one#coherent#picture#of#the#Big#Data#world# •  You#know#where#to#start#(if#you#want#to)#

Slide 4

Slide 4 text

Mo-va-on$ $ The#Data#Deluge#and## The#Need#for#(more)#Data#Driven#Decision#

Slide 5

Slide 5 text

The#Data#Deluge# •  Cheap#networked#sensors,#social#web,#digital# workflows#...#and#the#DaBficaBon#of# everything##lead#to:# # – Datasets#of#much#larger#size#(Volume)# – Datasets#with#many#different#formats#(Variety)# – Datasets#that#change#fast#(Velocity)#

Slide 6

Slide 6 text

The#need#for#(more)## Data#Driven#Decision# CompeBBon#and#the#expectaBon#of#further# progress#require#the#use#and#fast#processing#of# this#data#

Slide 7

Slide 7 text

Further#progress#in#medicine#rests#on#understanding# complex#relaBonships#and#individualized#treatments.#

Slide 8

Slide 8 text

Further#producBvity#gains#in#farming#will#largely#have# to#come#from#the#opBmized#use#of#machines,# ferBlizer#and#pesBcides##

Slide 9

Slide 9 text

With#every#product#available#everywhere#at#the#click# of#a#bu`on,#customizaBon#products#and#services#to# ever#smaller#customer#groups#becomes#paramount#

Slide 10

Slide 10 text

Unpredictable#fluctuaBon#in#energy#producBon#(and# demand)#make#it#necessary#to#opBmize#the#energy# network#in#realBme.##

Slide 11

Slide 11 text

InnovaBons#in#logisBcs#such#as#“SamecDaycDelivery”# rest#on#real#Bme#planning#and#opBmizaBon.##

Slide 12

Slide 12 text

n=All#&#t=now# It#is#no#longer#enough#to#plan#for#averages#(Bme,# space,#customers),#but#necessary#to#opBmize#for# the#individual,#precise#locaBons#and#now#

Slide 13

Slide 13 text

No content

Slide 14

Slide 14 text

Cost$ $ The#Big#Data#Technologies#used#to#lower#the# cost#to#build#systems#that#do#more#complex# processing#with#more#data#faster.##

Slide 15

Slide 15 text

No content

Slide 16

Slide 16 text

Volume#&#Variety# Data#OS#–#Hadoop#&#Enterprise#Data#Lake# NoSQL#et#al.# Distributed#Search#

Slide 17

Slide 17 text

The#Hadoop#ecosystem#provides#a#‘data#operaBng# system’#with#applicaBons#for#efficient,#diverse,#very# scalable#data#storage#and#processing.#

Slide 18

Slide 18 text

The#use#of#hadoop#can#radically#lower#costs#for#many# wharehousing#/#data#storage#scenarios# BigData$–$what$does$it$really$cost?$ Winter$Corpora-on$

Slide 19

Slide 19 text

Enterprise$Data$Lake$/$Data$Hub:#The#vision#to#transform# business#it#architecture#with#Hadoop#through#a#central#data# store#feeding#all#DWHs#–#simplifying#DWH/ETL#architecture#and# radically#speeding#up#the#creaBon#of#new#reports/dashboards.## …#

Slide 20

Slide 20 text

Distributed#(No)SQL#databases#easily#scale#to#very# large#datasets,#very#high#load#and#do#not#need# predefined#schemata#(can#deal#easily#with#Variability)#

Slide 21

Slide 21 text

The#ElasBcsearch#Stack#(with#Logstash#and#Kibana)# provides#an#end#to#end#soluBon#for#the#management# of#(semicstructured)#textual#data#(in#parBcular#logs)#

Slide 22

Slide 22 text

Velocity# •  With$Distributed$Stream$Processing#from#hours# to#seconds## •  With#In0Memory$compuBng#from#minutes#to# seconds### •  With#Reac-ve$Programming#from#seconds#to# microseconds## #

Slide 23

Slide 23 text

Distributed#Stream#Processing#systems#enable#the# cheap#creaBon#of#systems#that#create#realBme#views# from#fast#moving#data.## Apache#Storm#

Slide 24

Slide 24 text

Building#on#speed#improvements#from#doing#computaBons#“In0 Memory”,#these#systems#reduce#the#Bme#for#large#Analysis# tasks#from#minutes#to#(milic)seconds##

Slide 25

Slide 25 text

Reac-ve$Programming$techniques#from#domains#where# microseconds#count#(e.g.#High#Frequency#Trading,#Real#Time# AdverBsing,#intrusion#prevenBon,#sensors#with#high#data#rates#)# are#becoming#mainstream#and#easy#to#use.##

Slide 26

Slide 26 text

Moores#Law,#Netzwerk#Äquivalent,#Cloud#CompuBng,#Data#Value#Chain# Moore’s#Law,#Cloud#CompuBng,#advances#in#networking# technology#and#emerging#data#value#chains#are#trends# that#further#increase#the#speed#by#which#costs#are#falling.###

Slide 27

Slide 27 text

Value$ $ Big#Date#Technologies#add#value#by#making# more#pa`erns#(in#data)#visible#and#useful## c#through#AnalyBcs##

Slide 28

Slide 28 text

No content

Slide 29

Slide 29 text

AnalyBcs#Types# •  Descrip-ve:#What#is#and#what#has#been?# –  Special#Cases:#Visual#AnalyBcs,#Causal#AnalyBcs# •  Predic-ve:#What#will#be?# •  Prescrip-ve:#What#is#my#opBmal#course#of# acBon?#

Slide 30

Slide 30 text

Descrip-ve$Analy-cs:#IntegraBng#data#and#making#it# accessible#to#understand#past#and#present,#e.g.#the# success#of#a#tried#treatments#in#similar#paBents.## Fla-ron$Health$

Slide 31

Slide 31 text

Visual$Analy-cs:#AggregaBng#informaBon#into#forms# that#human#knowledge#and#intuiBon#can#be#applied# to#harness#it,#e.g.#to#understand#elecBon#fraud.##

Slide 32

Slide 32 text

Part#with#defect# Causal$Analy-cs:#Using#data#to#understand#the#root# cause#of#observed#phenomena#such#as#defects#(also# known#as#Root#Cause#Analysis)##

Slide 33

Slide 33 text

BlueYonder#forward#Demand# Predic-ve$Analy-cs:$Using#pa`erns#in#data#to#predict# the#future,#e.g.#of#demand#

Slide 34

Slide 34 text

Prescrip-ve$Analy-cs:#Find#an#opBmal#course#of# acBon,#e.g.#an#opBmal#sequence#for#recstarBng#flights# a2er#a#large#disturbance# Taleris$

Slide 35

Slide 35 text

AnalyBcs#ApplicaBon#Areas## •  OperaBonal#Excellence# •  Customer#InBmacy# •  Product#Leadership#

Slide 36

Slide 36 text

Maschinenwartung# Opera-onal$Excellence:$Use#of#data#to#increase#the# efficiency#in#the#creaBon#of#products#and#service,#e.g.# through#proacBve#maintenance.#$# Metropolitan$Transporta-on$

Slide 37

Slide 37 text

Tailor# Customer$In-macy:$Use#of#data#to#be`er#tailor#products#and# services#to#customers,#e.g.#instantly#display#a#prospecBve# customers#value#in#order#to#tailor#offers#to#that.## Meena$Kadri$@$Flickr$

Slide 38

Slide 38 text

Eyakukükulioio# Product$Leadership:$Use#of#data#to#create#products#of# unmatched#quality,#e.g.#through#the#systemaBc#collecBon# of#DRO#data#for#all#cars#throughout#their#lifecycle.#$# Volvo$

Slide 39

Slide 39 text

360°#Everything# Most#common#pa`ern#is#the#integraBon#of#large$ amounts#of#diverse$informa-on#for#each# product/machine/customer#into#one#coherent# real$-me#picture#

Slide 40

Slide 40 text

Business#Models# •  Data#Driven#Business# •  Data#+#Product# – Data#for#Products# – Products#with#Data## •  Data#as#Business#

Slide 41

Slide 41 text

Eyakukükulioio# Data$for$Products:$Data#driven#services#that#enable#the# opBmal#use#of#products#sold,#e.g.#machines#and#services# to#opBmize#agricultural#yield#down#to#the#square#foot.## Monsanto$integrated$farming$systems$

Slide 42

Slide 42 text

Eyakukükulioio# ..#or#the#opBmal#deployment#and#maintenance#of# trucks### Navistar$

Slide 43

Slide 43 text

Strava# Product$with$Data:$Product#offering#funcBonality# strongly#dependent#on#AnalyBcs,#e.g.#an#alarm#clock# that#wakes#based#on#sensor#analyBcs$# Wiithings$Aura$

Slide 44

Slide 44 text

Strava# Data$as$Business:$Profit#from#harnessing#the#data# collecBng#through#other#services,#e.g.#data#for#urban# planning#from#fitness#devices## Strava$

Slide 45

Slide 45 text

Drilling#guys# or#collected#specifically#to#be#sold#…## Drilling$Info$

Slide 46

Slide 46 text

#Next$$

Slide 47

Slide 47 text

People#&#Skills# •  Build#up#capability#in#your#organizaBon#to# harness#Big#Data#technologies#to#build#data# management#infrastructures#that#are#cheaper,# simpler#and#more#powerful# •  Through# – Pilot#projects#with#these#technologies# – Guided#introducBons#such#as#our#own#“Big#Data# Hands#On”#workshop###

Slide 48

Slide 48 text

Big#Data#Hands#On#Workshop# •  Learning#about#BigData#by#building#a#big#Data# soluBon#guided#by#our#experts# – Day#1:#Data#integraBon#with#Logstash#and# SpringXD# – Day#2:#DescripBve#AnalyBcs#and#ExploraBon#with# ElasBc#Search/Kibana#and#Hive/Hadoop# – Day#3:#PredicBve#AnalyBcs#with#Mahout# •  On#premise,#contents#are#customizable.##

Slide 49

Slide 49 text

AnalyBcs#Use#Cases# •  Find#ways#data#can#improve#your#decisions,# that#of#your#customers,#your#suppliers#or#?# – By#reading## # # # – Or#in#workshops#(with#vendors,#independent# consultants,#strategy#consultants)# Cuckier# Big# Data#

Slide 50

Slide 50 text

connect#/#download#slides#at# www.vzach.de# Mo-va-on,$Data$Deluge,#DaBficaBon,#Volume,#Variety,#Velocity,#Data$Driven$ Decision,$n=All,#t=now,#Cost,$Data$OS,#Hadoop,#Enterprise$Data$Lake,# Distributed$Search,#ElasBcSearch,#NoSQL$et$al,#Cassandra,#mongoDB,#Riak,#Pivotal,# cloudera,#Distributed$Stream$Processing,#Storm,#In$Memory$Compu-ng,#SAP# Hana,#Spark,#EXASOL,#Reac-ve$Programming,#Moore’s#Law,#Cloud#CompuBng,#Data# Value#Chain,#Value,#Analy-cs,#DescripBve,#Visual,#Causal,#PredicBve,#PrescripBve,# Applica-on$Areas,#OperaBonal#Excellence,#Customer#InBmacy,#Product#Leadership,# 360°$Everything,$Business$Models,#Data#Driven#Business,#Data#for#Products,# Products#with#Data,#Data#as#Business#