Becoming Insight Driven With Big Data

Becoming Insight Driven With Big Data

Organisations need to innovate to stay ahead of the competition and to survive into the future. Most of the time innovation needs IT. IT needs reliable information to get insights. Reliable information is derived from data.
Oracle has a (Big) Data platform which aims to offer organizations access to any data source of any data type. This data could be accessed by any type of analyses and with any type of language. Oracle offers the same platform both on-premise as well as in the cloud.

This presentation gives an introduction into the Oracle Big Data Cloud offering. Which services are offered and does this fit into the Oracle Information Reference Architecture.

The Oracle Cloud offering keeps evolving and growing. This presentation aims to put things in context. Which Oracle Cloud services would one need to fulfill the needs of an organization?

Ca2e6b328391fecf099a70083c361c36?s=128

Daan Bakboord

December 04, 2017
Tweet

Transcript

  1. 1 Becoming Insight Driven With Big Data UKOUG Technology Conference

    & Exhibition 2017
  2. 2 Stay Ahead of the Competition Differentiate Change Improve Innovate

    Invest
  3. 3 Quistor enables Data Drive Decisions Come into Action Make

    a Desicion
  4. 4 Introduction Quistor: Your Business Analytics Partner of Choice Customers

    Worldwide 150+ Analytics & Big Data 12Years In Business Value Propositions 4 Delivery Centers 170 Employees 10 European Offices 35y Average Age Oracle Platinum Partner Managed Services JD Edwards Digital 24 7 Cloud ExaHotel
  5. 5 Who am I? http://www.daanbakboord.com https://twitter.com/daanbakboord https://nl.linkedin.com/in/daanbakboord Daan Bakboord •

    Oracle Big Data Anlytics Consultant @ Quistor – Oracle BI EE (OBIEE) – Oracle Analytics Cloud (OAC, BICS) – Oracle Data Visualization – Oracle Big Data – Oracle BI Applications (OBIA) • Information Architecture – TOGAF – Archimate http://blog.daanalytics.nl #obihackers nl.OUG BIWA SIG Lead
  6. 6 Data Driven Decisions Initiatives IT IT IT Reliable Information

    / Insights A B Change / Policy / Strategy Data - Organisations need to take actions to move forward from A to B - Initiatives involve a IT component, which are driven by reliable information / insights - It all begins by having access to the right data
  7. 7 Data Explosion Mobile Social Internet of Things

  8. 8

  9. 9 Today’s data challenges • Too many different variety’s of

    Data Sources • Big Data Technology fragmented and complex • Specialized skills in short supply
  10. 10 Oracle Information Management Reference Architecture – Data Management Data

    Fast Data Events Actions 1 2 3 Streams Data Management Reservoir Factory Warehouse Results People Data Services Smart Things Data Lab Data Science Discovery Data Sets Apps Packaged Custom Business Analytics Visualization Reports Execution Innovation
  11. 11 Traditional Business Intelligence – Oracle Data Warehouse Deployment Choice

    Data Management Reservoir Factory Warehouse • Ideal Database Hardware • Smart System Software • Full-Stack Integration On-Premises Oracle Exadata Customer Data Center Purchased Customer Managed Oracle Cloud Oracle Exadata Cloud Service Oracle Data Center Subscription Oracle Managed Customer Cloud Oracle Exadata Cloud Service Customer Data Center Subscription Oracle Managed Oracle Cloud Autonomous Database Cloud Service Oracle Data Center Subscription Oracle Managed
  12. 12 Traditional Business Intelligence – Data Warehousing Data Management Reservoir

    Factory Warehouse Oracle Database 12.2 – New Features • Better In-Memory capabilities for DWH – Data Scans – Joins – Aggregation • New SQL Features – Approximate Query processing – Faster JSON processing via in-memory – Analytic Views • Common business logic inside the database • New highly-scalable Property Graph analytics
  13. 13 What’s Hadoop? Hadoop is a Software Framework for Storing,

    Processing and Analyzing Big Data • Distributed • Scalable • Fault-tolerant • Open Source
  14. 14 Core Hadoop • Distributed File System HDFS – Stores

    data • Hadoop MapReduce – Processes data • Hadoop Yarn – Schedules work
  15. 15 Hadoop Eco System Data Management Reservoir Factory Warehouse

  16. 16 Big Data Distributions How to Answer to: • Support

    • Compliance • Performance • Scalability • Security Big Data Distributions Pre-Packaged, tested and validated packaged solutions based on Apache Hadoop – Technical Support – Services – Training
  17. 17 Big Data Distributions – Cloudera Enterprise Data Hub •

    Proven, user-friendly technology. – Use Case; Enterprise Data Hub. Let the Hadoop platform serve as a central data repository.
  18. 18 Big Data Distributions – MapR Converged Data Platform •

    Stable platform with a generic file-system and fast processing. – Use Case; Integrated platform with a focus on streaming.
  19. 19 Big Data Distributions – Hortonworks Connected Data Platforms •

    100% Open source with minimal investment. – Use Case; Modernizing your traditional EDW.
  20. 20 Oracle Big Data Appliance Data Management Reservoir Factory Warehouse

    Hardware and Software engineered together Oracle Big Data Appliance includes: • Oracle Sun x86 servers powered by the Intel® Xeon® processor family • InfiniBand and Ethernet connectivity • Cloudera Enterprise – Data Hub Edition (including CDH, Impala, Spark, Kafka, etc.) • Oracle NoSQL Database Community Edition • Comprehensive security, including authentication, authorization, and auditing capabilities • Oracle Linux • Oracle Java JDK • Oracle R Distribution
  21. 21 Oracle Big Data Appliance Data Management Reservoir Factory Warehouse

    Hardware and Software engineered together Oracle Big Data Appliance includes optionally: • Oracle Big Data SQL • Oracle Big Data Connectors: – Oracle SQL Connector for Hadoop – Oracle Loader for Hadoop – Oracle XQuery for Hadoop – Oracle R Advanced Analytics for Hadoop – Oracle Data Integrator • Audit Vault and Database Firewall for Hadoop Auditing • Oracle Data Integrator • Oracle GoldenGate • Oracle NoSQL Database Enterprise Edition • Oracle Big Data Spatial and Graph • Oracle Big Data Discovery
  22. 22 Information Management Reference Architecture

  23. 23 Information Management Reference Architecture - Implementation

  24. 24 Information Management Reference Architecture – Oracle Implementation

  25. 25 Object Storage Cloud Object Storage Cloud Data in &

    Data out. • Ingest Data through Kafka (Event Hub – Kafka Cloud) • Process it through a processing tier – Oracle Big Data Cloud Service – Oracle Database Cloud • Land it in the Object Storage Cloud Service – Object Storage (Elastic, Fast and Secure) – Archive Storage (Infrequently accessed Data) – Database Backup – Large Dataset Transfer • Work with it via PaaS or Custom Service Database Cloud Event Hub – Kafka Cloud Big Data Cloud Object Storage Cloud Service
  26. 26 Oracle Big Data Platform

  27. 27 Oracle Big Data in the Cloud Services (Compute Edition)

    Big Data Cloud Compute Oracle Big Data Cloud Service (BDCS) • Long-Lived Clusters • Full Cloudera Eco-System • Engineered Systems backbone • Focus on Performance & Control of environment • Big Data Cloud SQL • Big Data Cloud Machine (Customer On-Premise) Big Data Cloud Oracle Big Data Cloud Service Compute Edition (BDCS-CE) • Short-Lived Clusters – POC • Apache Hadoop & Apache Spark • Focus on Flexibility & Simplicity
  28. 28 Oracle Event Hub Cloud Service • Apache Kafka delivered

    as a managed service • Real-time Streaming data into Oracle Object Storage • Integrated with Oracle Data Integration Cloud • Elastic o Dedicated by the nodes o Multi-tenant by the partitions • On Oracle Public Cloud or On-Premise through Cloud@Customer • Open Standards based Event Hub – Kafka Cloud
  29. 29 Oracle Big Data Platform Use Case – Data Lake

  30. 30 Oracle Big Data Platform Use Case – Data Science

  31. 31 Big Data SQL Architecture Oracle Big Data SQL Cloud

    Service
  32. 32 ”Change is the law of life. And those who

    look only to the past or present are certain to miss the future. – John F. Kennedy”
  33. 33 Oracle Analytics Cloud Service

  34. 34 Big Data Analytics – Oracle Analytics Cloud "The Forrester

    Wave™: Enterprise BI Platforms with Majority Cloud Deployments, Q3 2017” Complete spectrum of Enterprise BI needs – From Self Service to Oracle Essbase MOLAP – Common Enterprise Information Model – Oracle Day by Day Mobile • Smart • Governed • Hadoop / Spark • Search • Visual-based
  35. 35 Oracle Analytics Cloud Service (OAC) – Collaborative, providing efficient

    methods to interact and share information – Connected, to all the data required to support processes and decisions – Complete, providing all the needed analytical capabilities – Choice, about how to deploy, both now and in the future. ”Ask any Question of any Data, in any Environment, using any Device”
  36. 36 Oracle Analytics Cloud Editions

  37. 37 Data Lake • Collect & organize large volumes of

    diverse data for later use – raw / original / native / as-is • Preparation & transformation based on the use case – Schema on Read • Benefits – Lower costs – Greater flexibility James Dixon, CTO of Pentaho in 2012
  38. 38 Data Swamp ”Storing all data does not automatically return

    Value” • Context • Governance – planning (e.g. ingestion still needed?) – rules – processes – health checks • Security • Ownership & Sponsorship • Architecture & Technologies Data Lakes don’t replace the Data Warehouse
  39. 39 Analytics Cloud Data Lake Edition - Process Discover Prepare

    Analyze Predict
  40. 40 • Data Ingestion – Replicate (SaaS & Fusion apps)

    – Incremental – Continuous (Oracle GoldenGate) • Manage Data – Projects – Connections – Data Sets – Data Flows & Sequences Discover Discover & access diverse Data Sources Data Integration Services
  41. 41 Prepare • Prepare Data Sets – Excel like interface

    • Data Flows • Advanced Transforms & Scripts – Time Series Forecast – Sentiment Analysis – Custom Scripts in Python & R • Load data into Essbase Programmatic Integration
  42. 42 Analyze • Interactive Visual Composition • Automatic Explain of

    Data Sets Interactive Visualization
  43. 43 Predict • Machine Learning Data Flows • Various Machine-Learning

    Algorithms – Numeric Prediction – Multi-Classifier – Binary Classifier – Clustering – Custom Algorithms Machine Learning models Data Scientists Analysts • Machine to Human – Self-Learning Algorithms • Human to Machine – Define models
  44. 44 Manage & Monitor • Schedule – Data flows –

    Replication – Ingestion • Jobs overview – Statistics – Success or failure information Scheduling and Monitoring
  45. 45 Oracle Mobile for Oracle Analytics Cloud

  46. 46 Big Data Analytics in the Oracle Cloud

  47. 47 Oracle Data Warehouse Evolution – “Transforming to Big Data”

    ”Data Warehousing is Dead?” Data Management Reservoir Factory Warehouse Combine the best of both worlds • Extend Oracle DWH with Oracle Big Data • Combining (new) Big Data with Enterprise Data • Relational & Hadoop & NoSQL – On-Premises & Cloud • Transactional & Social and Web & IoT • Analytics & Data Mining & Machine Learning
  48. 48 Where? Hall 4 Tech17 Community drinks When? Monday 18:45

    – 19:45
  49. 49

  50. 50 Let’s get SOCIAL