Slide 1

Slide 1 text

1 Becoming Insight Driven With Big Data UKOUG Technology Conference & Exhibition 2017

Slide 2

Slide 2 text

2 Stay Ahead of the Competition Differentiate Change Improve Innovate Invest

Slide 3

Slide 3 text

3 Quistor enables Data Drive Decisions Come into Action Make a Desicion

Slide 4

Slide 4 text

4 Introduction Quistor: Your Business Analytics Partner of Choice Customers Worldwide 150+ Analytics & Big Data 12Years In Business Value Propositions 4 Delivery Centers 170 Employees 10 European Offices 35y Average Age Oracle Platinum Partner Managed Services JD Edwards Digital 24 7 Cloud ExaHotel

Slide 5

Slide 5 text

5 Who am I? http://www.daanbakboord.com https://twitter.com/daanbakboord https://nl.linkedin.com/in/daanbakboord Daan Bakboord • Oracle Big Data Anlytics Consultant @ Quistor – Oracle BI EE (OBIEE) – Oracle Analytics Cloud (OAC, BICS) – Oracle Data Visualization – Oracle Big Data – Oracle BI Applications (OBIA) • Information Architecture – TOGAF – Archimate http://blog.daanalytics.nl #obihackers nl.OUG BIWA SIG Lead

Slide 6

Slide 6 text

6 Data Driven Decisions Initiatives IT IT IT Reliable Information / Insights A B Change / Policy / Strategy Data - Organisations need to take actions to move forward from A to B - Initiatives involve a IT component, which are driven by reliable information / insights - It all begins by having access to the right data

Slide 7

Slide 7 text

7 Data Explosion Mobile Social Internet of Things

Slide 8

Slide 8 text

8

Slide 9

Slide 9 text

9 Today’s data challenges • Too many different variety’s of Data Sources • Big Data Technology fragmented and complex • Specialized skills in short supply

Slide 10

Slide 10 text

10 Oracle Information Management Reference Architecture – Data Management Data Fast Data Events Actions 1 2 3 Streams Data Management Reservoir Factory Warehouse Results People Data Services Smart Things Data Lab Data Science Discovery Data Sets Apps Packaged Custom Business Analytics Visualization Reports Execution Innovation

Slide 11

Slide 11 text

11 Traditional Business Intelligence – Oracle Data Warehouse Deployment Choice Data Management Reservoir Factory Warehouse • Ideal Database Hardware • Smart System Software • Full-Stack Integration On-Premises Oracle Exadata Customer Data Center Purchased Customer Managed Oracle Cloud Oracle Exadata Cloud Service Oracle Data Center Subscription Oracle Managed Customer Cloud Oracle Exadata Cloud Service Customer Data Center Subscription Oracle Managed Oracle Cloud Autonomous Database Cloud Service Oracle Data Center Subscription Oracle Managed

Slide 12

Slide 12 text

12 Traditional Business Intelligence – Data Warehousing Data Management Reservoir Factory Warehouse Oracle Database 12.2 – New Features • Better In-Memory capabilities for DWH – Data Scans – Joins – Aggregation • New SQL Features – Approximate Query processing – Faster JSON processing via in-memory – Analytic Views • Common business logic inside the database • New highly-scalable Property Graph analytics

Slide 13

Slide 13 text

13 What’s Hadoop? Hadoop is a Software Framework for Storing, Processing and Analyzing Big Data • Distributed • Scalable • Fault-tolerant • Open Source

Slide 14

Slide 14 text

14 Core Hadoop • Distributed File System HDFS – Stores data • Hadoop MapReduce – Processes data • Hadoop Yarn – Schedules work

Slide 15

Slide 15 text

15 Hadoop Eco System Data Management Reservoir Factory Warehouse

Slide 16

Slide 16 text

16 Big Data Distributions How to Answer to: • Support • Compliance • Performance • Scalability • Security Big Data Distributions Pre-Packaged, tested and validated packaged solutions based on Apache Hadoop – Technical Support – Services – Training

Slide 17

Slide 17 text

17 Big Data Distributions – Cloudera Enterprise Data Hub • Proven, user-friendly technology. – Use Case; Enterprise Data Hub. Let the Hadoop platform serve as a central data repository.

Slide 18

Slide 18 text

18 Big Data Distributions – MapR Converged Data Platform • Stable platform with a generic file-system and fast processing. – Use Case; Integrated platform with a focus on streaming.

Slide 19

Slide 19 text

19 Big Data Distributions – Hortonworks Connected Data Platforms • 100% Open source with minimal investment. – Use Case; Modernizing your traditional EDW.

Slide 20

Slide 20 text

20 Oracle Big Data Appliance Data Management Reservoir Factory Warehouse Hardware and Software engineered together Oracle Big Data Appliance includes: • Oracle Sun x86 servers powered by the Intel® Xeon® processor family • InfiniBand and Ethernet connectivity • Cloudera Enterprise – Data Hub Edition (including CDH, Impala, Spark, Kafka, etc.) • Oracle NoSQL Database Community Edition • Comprehensive security, including authentication, authorization, and auditing capabilities • Oracle Linux • Oracle Java JDK • Oracle R Distribution

Slide 21

Slide 21 text

21 Oracle Big Data Appliance Data Management Reservoir Factory Warehouse Hardware and Software engineered together Oracle Big Data Appliance includes optionally: • Oracle Big Data SQL • Oracle Big Data Connectors: – Oracle SQL Connector for Hadoop – Oracle Loader for Hadoop – Oracle XQuery for Hadoop – Oracle R Advanced Analytics for Hadoop – Oracle Data Integrator • Audit Vault and Database Firewall for Hadoop Auditing • Oracle Data Integrator • Oracle GoldenGate • Oracle NoSQL Database Enterprise Edition • Oracle Big Data Spatial and Graph • Oracle Big Data Discovery

Slide 22

Slide 22 text

22 Information Management Reference Architecture

Slide 23

Slide 23 text

23 Information Management Reference Architecture - Implementation

Slide 24

Slide 24 text

24 Information Management Reference Architecture – Oracle Implementation

Slide 25

Slide 25 text

25 Object Storage Cloud Object Storage Cloud Data in & Data out. • Ingest Data through Kafka (Event Hub – Kafka Cloud) • Process it through a processing tier – Oracle Big Data Cloud Service – Oracle Database Cloud • Land it in the Object Storage Cloud Service – Object Storage (Elastic, Fast and Secure) – Archive Storage (Infrequently accessed Data) – Database Backup – Large Dataset Transfer • Work with it via PaaS or Custom Service Database Cloud Event Hub – Kafka Cloud Big Data Cloud Object Storage Cloud Service

Slide 26

Slide 26 text

26 Oracle Big Data Platform

Slide 27

Slide 27 text

27 Oracle Big Data in the Cloud Services (Compute Edition) Big Data Cloud Compute Oracle Big Data Cloud Service (BDCS) • Long-Lived Clusters • Full Cloudera Eco-System • Engineered Systems backbone • Focus on Performance & Control of environment • Big Data Cloud SQL • Big Data Cloud Machine (Customer On-Premise) Big Data Cloud Oracle Big Data Cloud Service Compute Edition (BDCS-CE) • Short-Lived Clusters – POC • Apache Hadoop & Apache Spark • Focus on Flexibility & Simplicity

Slide 28

Slide 28 text

28 Oracle Event Hub Cloud Service • Apache Kafka delivered as a managed service • Real-time Streaming data into Oracle Object Storage • Integrated with Oracle Data Integration Cloud • Elastic o Dedicated by the nodes o Multi-tenant by the partitions • On Oracle Public Cloud or On-Premise through Cloud@Customer • Open Standards based Event Hub – Kafka Cloud

Slide 29

Slide 29 text

29 Oracle Big Data Platform Use Case – Data Lake

Slide 30

Slide 30 text

30 Oracle Big Data Platform Use Case – Data Science

Slide 31

Slide 31 text

31 Big Data SQL Architecture Oracle Big Data SQL Cloud Service

Slide 32

Slide 32 text

32 ”Change is the law of life. And those who look only to the past or present are certain to miss the future. – John F. Kennedy”

Slide 33

Slide 33 text

33 Oracle Analytics Cloud Service

Slide 34

Slide 34 text

34 Big Data Analytics – Oracle Analytics Cloud "The Forrester Wave™: Enterprise BI Platforms with Majority Cloud Deployments, Q3 2017” Complete spectrum of Enterprise BI needs – From Self Service to Oracle Essbase MOLAP – Common Enterprise Information Model – Oracle Day by Day Mobile • Smart • Governed • Hadoop / Spark • Search • Visual-based

Slide 35

Slide 35 text

35 Oracle Analytics Cloud Service (OAC) – Collaborative, providing efficient methods to interact and share information – Connected, to all the data required to support processes and decisions – Complete, providing all the needed analytical capabilities – Choice, about how to deploy, both now and in the future. ”Ask any Question of any Data, in any Environment, using any Device”

Slide 36

Slide 36 text

36 Oracle Analytics Cloud Editions

Slide 37

Slide 37 text

37 Data Lake • Collect & organize large volumes of diverse data for later use – raw / original / native / as-is • Preparation & transformation based on the use case – Schema on Read • Benefits – Lower costs – Greater flexibility James Dixon, CTO of Pentaho in 2012

Slide 38

Slide 38 text

38 Data Swamp ”Storing all data does not automatically return Value” • Context • Governance – planning (e.g. ingestion still needed?) – rules – processes – health checks • Security • Ownership & Sponsorship • Architecture & Technologies Data Lakes don’t replace the Data Warehouse

Slide 39

Slide 39 text

39 Analytics Cloud Data Lake Edition - Process Discover Prepare Analyze Predict

Slide 40

Slide 40 text

40 • Data Ingestion – Replicate (SaaS & Fusion apps) – Incremental – Continuous (Oracle GoldenGate) • Manage Data – Projects – Connections – Data Sets – Data Flows & Sequences Discover Discover & access diverse Data Sources Data Integration Services

Slide 41

Slide 41 text

41 Prepare • Prepare Data Sets – Excel like interface • Data Flows • Advanced Transforms & Scripts – Time Series Forecast – Sentiment Analysis – Custom Scripts in Python & R • Load data into Essbase Programmatic Integration

Slide 42

Slide 42 text

42 Analyze • Interactive Visual Composition • Automatic Explain of Data Sets Interactive Visualization

Slide 43

Slide 43 text

43 Predict • Machine Learning Data Flows • Various Machine-Learning Algorithms – Numeric Prediction – Multi-Classifier – Binary Classifier – Clustering – Custom Algorithms Machine Learning models Data Scientists Analysts • Machine to Human – Self-Learning Algorithms • Human to Machine – Define models

Slide 44

Slide 44 text

44 Manage & Monitor • Schedule – Data flows – Replication – Ingestion • Jobs overview – Statistics – Success or failure information Scheduling and Monitoring

Slide 45

Slide 45 text

45 Oracle Mobile for Oracle Analytics Cloud

Slide 46

Slide 46 text

46 Big Data Analytics in the Oracle Cloud

Slide 47

Slide 47 text

47 Oracle Data Warehouse Evolution – “Transforming to Big Data” ”Data Warehousing is Dead?” Data Management Reservoir Factory Warehouse Combine the best of both worlds • Extend Oracle DWH with Oracle Big Data • Combining (new) Big Data with Enterprise Data • Relational & Hadoop & NoSQL – On-Premises & Cloud • Transactional & Social and Web & IoT • Analytics & Data Mining & Machine Learning

Slide 48

Slide 48 text

48 Where? Hall 4 Tech17 Community drinks When? Monday 18:45 – 19:45

Slide 49

Slide 49 text

49

Slide 50

Slide 50 text

50 Let’s get SOCIAL