Upgrade to Pro — share decks privately, control downloads, hide ads and more …

What is Google BigQuery?

What is Google BigQuery?

For CROSS 2016 presentation

Kazunori Sato

February 04, 2016
Tweet

More Decks by Kazunori Sato

Other Decks in Programming

Transcript

  1. What is Google BigQuery?

    View full-size slide

  2. +Kazunori Sato
    @kazunori_279
    Kaz Sato
    Staff Developer Advocate,
    Tech Lead for Data & Analytics,
    Cloud Platform, Google Inc.

    View full-size slide

  3. = The Datacenter as a Computer

    View full-size slide

  4. Jupiter network
    40GbE ports
    10GbE x 100K = 1 Pbps
    CLOS topology
    Software Defined Network

    View full-size slide

  5. Borg
    Launches 2B containers / week
    Manages 10K machines / Cell
    DC-scale proactive job sched
    (CPU, mem, disk IO, TCP ports)
    Paxos-based metadata store

    View full-size slide

  6. Google BigQuery

    View full-size slide

  7. 1 B
    1 B 100 B 900 M

    View full-size slide

  8. At Google, MapReduce is classic.
    We use BigQuery.
    Confidential & Proprietary
    Google Cloud Platform 10

    View full-size slide

  9. SELECT your_data FROM billions_of_rows
    WHERE full_disk_scan_required = true;
    Scanning 1 TB in 1 sec
    with 5,000 - 10,000 disk spindles

    View full-size slide

  10. BigQuery Analytic Service in the Cloud
    BigQuery
    Analyze Export
    Import
    How to use BigQuery?
    Google
    Analytics
    ETL tools
    Connectors
    Google Cloud
    BI tools and
    Visualization
    Google Cloud
    Spreadsheets, R,
    Hadoop

    View full-size slide

  11. Blazingly Fast
    Capable of scanning 10B rows in ~10 sec
    Low Cost
    Storage: $0.020 per GB per month
    Queries: $5 per TB
    Fully Managed
    Use thousands of servers with zero-ops
    SQL
    Simple and Intuitive SQL with JS UDF
    Benefits of BigQuery

    View full-size slide