Big Data and Analytics - Cost and Value

Big Data and Analytics - Cost and Value

Introduces all important Big Data Analytics concepts in one coherent picture. Also gives some starting points on where to go next.

B911d14451f50b883b4c4a122226b7f4?s=128

Valentin

May 22, 2014
Tweet

Transcript

  1. Big$Data$&$Analy-cs$ 0$ Cost$and$Value$ Valen-n$Zacharias,$codecentric.de$ Karlsruhe,$22.5.2014$

  2. 200#consultants,#enthusiasts,#engineers,#cra2smen,#experts,#nerds# •  build#systems#to#create#value#from#big#data# •  help#businesses#scale#so2ware# •  realize#agile#so2ware#development#–#with#our#customer‘s#in#house# development#and#in#custom#so2ware#development# We#are#codecentric.#

  3. Mo-va-on,$Data$Deluge,#DaBficaBon,#Volume,#Variety,#Velocity,#Data$Driven$ Decision,$n=All,#t=now,#Cost,$Data$OS,#Hadoop,#Enterprise$Data$Lake,# Distributed$Search,#ElasBcSearch,#NoSQL$et$al,#Cassandra,#mongoDB,#Riak,#Pivotal,# cloudera,#Distributed$Stream$Processing,#Storm,#In$Memory$Compu-ng,#SAP# Hana,#Spark,#EXASOL,#Reac-ve$Programming,#Moore’s#Law,#Cloud#CompuBng,#Data# Value#Chain,#Value,#Analy-cs,#DescripBve,#Visual,#Causal,#PredicBve,#PrescripBve,# Applica-on$Areas,#OperaBonal#Excellence,#Customer#InBmacy,#Product#Leadership,# 360°$Everything,$Business$Models,#Data#Driven#Business,#Data#for#Products,# Products#with#Data,#Data#as#Business# My#Goal#today:##

    •  You’ve#heard#the#most#important#Big#Data#terms#(see#above)#and# can#fit#them#into#one#coherent#picture#of#the#Big#Data#world# •  You#know#where#to#start#(if#you#want#to)#
  4. Mo-va-on$ $ The#Data#Deluge#and## The#Need#for#(more)#Data#Driven#Decision#

  5. The#Data#Deluge# •  Cheap#networked#sensors,#social#web,#digital# workflows#...#and#the#DaBficaBon#of# everything##lead#to:# # – Datasets#of#much#larger#size#(Volume)# – Datasets#with#many#different#formats#(Variety)# – Datasets#that#change#fast#(Velocity)#

  6. The#need#for#(more)## Data#Driven#Decision# CompeBBon#and#the#expectaBon#of#further# progress#require#the#use#and#fast#processing#of# this#data#

  7. Further#progress#in#medicine#rests#on#understanding# complex#relaBonships#and#individualized#treatments.#

  8. Further#producBvity#gains#in#farming#will#largely#have# to#come#from#the#opBmized#use#of#machines,# ferBlizer#and#pesBcides##

  9. With#every#product#available#everywhere#at#the#click# of#a#bu`on,#customizaBon#products#and#services#to# ever#smaller#customer#groups#becomes#paramount#

  10. Unpredictable#fluctuaBon#in#energy#producBon#(and# demand)#make#it#necessary#to#opBmize#the#energy# network#in#realBme.##

  11. InnovaBons#in#logisBcs#such#as#“SamecDaycDelivery”# rest#on#real#Bme#planning#and#opBmizaBon.##

  12. n=All#&#t=now# It#is#no#longer#enough#to#plan#for#averages#(Bme,# space,#customers),#but#necessary#to#opBmize#for# the#individual,#precise#locaBons#and#now#

  13. None
  14. Cost$ $ The#Big#Data#Technologies#used#to#lower#the# cost#to#build#systems#that#do#more#complex# processing#with#more#data#faster.##

  15. None
  16. Volume#&#Variety# Data#OS#–#Hadoop#&#Enterprise#Data#Lake# NoSQL#et#al.# Distributed#Search#

  17. The#Hadoop#ecosystem#provides#a#‘data#operaBng# system’#with#applicaBons#for#efficient,#diverse,#very# scalable#data#storage#and#processing.#

  18. The#use#of#hadoop#can#radically#lower#costs#for#many# wharehousing#/#data#storage#scenarios# BigData$–$what$does$it$really$cost?$ Winter$Corpora-on$

  19. Enterprise$Data$Lake$/$Data$Hub:#The#vision#to#transform# business#it#architecture#with#Hadoop#through#a#central#data# store#feeding#all#DWHs#–#simplifying#DWH/ETL#architecture#and# radically#speeding#up#the#creaBon#of#new#reports/dashboards.## …#

  20. Distributed#(No)SQL#databases#easily#scale#to#very# large#datasets,#very#high#load#and#do#not#need# predefined#schemata#(can#deal#easily#with#Variability)#

  21. The#ElasBcsearch#Stack#(with#Logstash#and#Kibana)# provides#an#end#to#end#soluBon#for#the#management# of#(semicstructured)#textual#data#(in#parBcular#logs)#

  22. Velocity# •  With$Distributed$Stream$Processing#from#hours# to#seconds## •  With#In0Memory$compuBng#from#minutes#to# seconds### •  With#Reac-ve$Programming#from#seconds#to# microseconds##

    #
  23. Distributed#Stream#Processing#systems#enable#the# cheap#creaBon#of#systems#that#create#realBme#views# from#fast#moving#data.## Apache#Storm#

  24. Building#on#speed#improvements#from#doing#computaBons#“In0 Memory”,#these#systems#reduce#the#Bme#for#large#Analysis# tasks#from#minutes#to#(milic)seconds##

  25. Reac-ve$Programming$techniques#from#domains#where# microseconds#count#(e.g.#High#Frequency#Trading,#Real#Time# AdverBsing,#intrusion#prevenBon,#sensors#with#high#data#rates#)# are#becoming#mainstream#and#easy#to#use.##

  26. Moores#Law,#Netzwerk#Äquivalent,#Cloud#CompuBng,#Data#Value#Chain# Moore’s#Law,#Cloud#CompuBng,#advances#in#networking# technology#and#emerging#data#value#chains#are#trends# that#further#increase#the#speed#by#which#costs#are#falling.###

  27. Value$ $ Big#Date#Technologies#add#value#by#making# more#pa`erns#(in#data)#visible#and#useful## c#through#AnalyBcs##

  28. None
  29. AnalyBcs#Types# •  Descrip-ve:#What#is#and#what#has#been?# –  Special#Cases:#Visual#AnalyBcs,#Causal#AnalyBcs# •  Predic-ve:#What#will#be?# •  Prescrip-ve:#What#is#my#opBmal#course#of# acBon?#

  30. Descrip-ve$Analy-cs:#IntegraBng#data#and#making#it# accessible#to#understand#past#and#present,#e.g.#the# success#of#a#tried#treatments#in#similar#paBents.## Fla-ron$Health$

  31. Visual$Analy-cs:#AggregaBng#informaBon#into#forms# that#human#knowledge#and#intuiBon#can#be#applied# to#harness#it,#e.g.#to#understand#elecBon#fraud.##

  32. Part#with#defect# Causal$Analy-cs:#Using#data#to#understand#the#root# cause#of#observed#phenomena#such#as#defects#(also# known#as#Root#Cause#Analysis)##

  33. BlueYonder#forward#Demand# Predic-ve$Analy-cs:$Using#pa`erns#in#data#to#predict# the#future,#e.g.#of#demand#

  34. Prescrip-ve$Analy-cs:#Find#an#opBmal#course#of# acBon,#e.g.#an#opBmal#sequence#for#recstarBng#flights# a2er#a#large#disturbance# Taleris$

  35. AnalyBcs#ApplicaBon#Areas## •  OperaBonal#Excellence# •  Customer#InBmacy# •  Product#Leadership#

  36. Maschinenwartung# Opera-onal$Excellence:$Use#of#data#to#increase#the# efficiency#in#the#creaBon#of#products#and#service,#e.g.# through#proacBve#maintenance.#$# Metropolitan$Transporta-on$

  37. Tailor# Customer$In-macy:$Use#of#data#to#be`er#tailor#products#and# services#to#customers,#e.g.#instantly#display#a#prospecBve# customers#value#in#order#to#tailor#offers#to#that.## Meena$Kadri$@$Flickr$

  38. Eyakukükulioio# Product$Leadership:$Use#of#data#to#create#products#of# unmatched#quality,#e.g.#through#the#systemaBc#collecBon# of#DRO#data#for#all#cars#throughout#their#lifecycle.#$# Volvo$

  39. 360°#Everything# Most#common#pa`ern#is#the#integraBon#of#large$ amounts#of#diverse$informa-on#for#each# product/machine/customer#into#one#coherent# real$-me#picture#

  40. Business#Models# •  Data#Driven#Business# •  Data#+#Product# – Data#for#Products# – Products#with#Data## •  Data#as#Business#

  41. Eyakukükulioio# Data$for$Products:$Data#driven#services#that#enable#the# opBmal#use#of#products#sold,#e.g.#machines#and#services# to#opBmize#agricultural#yield#down#to#the#square#foot.## Monsanto$integrated$farming$systems$

  42. Eyakukükulioio# ..#or#the#opBmal#deployment#and#maintenance#of# trucks### Navistar$

  43. Strava# Product$with$Data:$Product#offering#funcBonality# strongly#dependent#on#AnalyBcs,#e.g.#an#alarm#clock# that#wakes#based#on#sensor#analyBcs$# Wiithings$Aura$

  44. Strava# Data$as$Business:$Profit#from#harnessing#the#data# collecBng#through#other#services,#e.g.#data#for#urban# planning#from#fitness#devices## Strava$

  45. Drilling#guys# or#collected#specifically#to#be#sold#…## Drilling$Info$

  46. #Next$$

  47. People#&#Skills# •  Build#up#capability#in#your#organizaBon#to# harness#Big#Data#technologies#to#build#data# management#infrastructures#that#are#cheaper,# simpler#and#more#powerful# •  Through# – Pilot#projects#with#these#technologies# – Guided#introducBons#such#as#our#own#“Big#Data#

    Hands#On”#workshop###
  48. Big#Data#Hands#On#Workshop# •  Learning#about#BigData#by#building#a#big#Data# soluBon#guided#by#our#experts# – Day#1:#Data#integraBon#with#Logstash#and# SpringXD# – Day#2:#DescripBve#AnalyBcs#and#ExploraBon#with# ElasBc#Search/Kibana#and#Hive/Hadoop# – Day#3:#PredicBve#AnalyBcs#with#Mahout# • 

    On#premise,#contents#are#customizable.##
  49. AnalyBcs#Use#Cases# •  Find#ways#data#can#improve#your#decisions,# that#of#your#customers,#your#suppliers#or#?# – By#reading## # # # – Or#in#workshops#(with#vendors,#independent# consultants,#strategy#consultants)#

    Cuckier# Big# Data#
  50. connect#/#download#slides#at# www.vzach.de# Mo-va-on,$Data$Deluge,#DaBficaBon,#Volume,#Variety,#Velocity,#Data$Driven$ Decision,$n=All,#t=now,#Cost,$Data$OS,#Hadoop,#Enterprise$Data$Lake,# Distributed$Search,#ElasBcSearch,#NoSQL$et$al,#Cassandra,#mongoDB,#Riak,#Pivotal,# cloudera,#Distributed$Stream$Processing,#Storm,#In$Memory$Compu-ng,#SAP# Hana,#Spark,#EXASOL,#Reac-ve$Programming,#Moore’s#Law,#Cloud#CompuBng,#Data# Value#Chain,#Value,#Analy-cs,#DescripBve,#Visual,#Causal,#PredicBve,#PrescripBve,# Applica-on$Areas,#OperaBonal#Excellence,#Customer#InBmacy,#Product#Leadership,# 360°$Everything,$Business$Models,#Data#Driven#Business,#Data#for#Products,#

    Products#with#Data,#Data#as#Business#