MapD Workshop: Visualizing Billions Of Data Points With GPUs

© MapD 2018 MapD Workshop: Visualizing Billions Of Data Points
With GPUs FOSSCON | August 25, 2018

© MapD 2018 Aaron Williams VP of Global Community @_arw_
[email protected] /in/aaronwilliams/ /williamsaaron slides: https://speakerdeck.com/mapd

© MapD 2018 Do This Now If you want to
participate in the tutorials, sign up for a free trial account on MapD Cloud http://mapd.cloud 3

Core Density Makes a Huge Difference 4 GPU Processing CPU
Processing 40,000 Cores 20 Cores *fictitious example Latency Throughput CPU 1 ns per task (1 task/ns) x (20 cores) = 20 tasks/ns GPU 10 ns per task (0.1 task per ns) x (40,000 cores) = 4,000 task per ns Latency: Time to do a task. | Throughput: Number of tasks per unit time.

© MapD 2018 MapD is the analytics platform created for
GPUs /mapd/mapd-core

© MapD 2018 Advanced memory management Three-tier caching to GPU
RAM for speed and to SSDs for persistent storage 7 SSD or NVRAM STORAGE (L3) 250GB to 20TB 1-2 GB/sec CPU RAM (L2) 32GB to 3TB 70-120 GB/sec GPU RAM (L1) 24GB to 256GB 1000-6000 GB/sec Hot Data Speedup = 1500x to 5000x Over Cold Data Warm Data Speedup = 35x to 120x Over Cold Data Cold Data COMPUTE LAYER STORAGE LAYER Data Lake/Data Warehouse/System Of Record

© MapD 2018 Ibis Interface Scaling the familiar pandas DataFrame
API into the billions of records at interactive speed https://www.mapd.com/blog/scaling-pandas-t o-the-billions-with-ibis-and-mapd/ 10

© MapD 2018 LIDAR in 3D with deck.gl Check out
our custom app to visualize large, complex data sets like LIDAR https://www.mapd.com/blog/3d-lidar-with-m apd-and-ubers-deck-gl/ 11

© MapD 2018 Last Chance ... If you want to
participate in the tutorials, sign up for a free trial account on MapD Cloud http://mapd.cloud 13

© MapD 2018 Step 1: MapD Immerse Basics 1. View
your data in the Data Manager 2. Import data a. Local CSV b. S3 Bucket 3. View your dashboards 4. Create a new dashboard a. SAVE! 14

© MapD 2018 Geospatial Objects Type Description POINT A point
described by two coordinates. LINESTRING A sequence of 2 or more points and the lines that connect them. POLYGON A set of one or more rings (closed line strings), with the first representing the shape (external ring) and the rest representing holes in that shape (internal rings) MULTIPOLYGON A set of one or more polygons.

© MapD 2018 Step 2: Loading MapD Shapefiles into Immerse
1. Polygons SF City and County Subdivision Parcels MULTIPOLYGONS in GeoJSON https://s3.amazonaws.com/mapd-data/geodata/citylots.json 2. Points SF City-owned Critical Facilities POINTS in ESRI Shapefile https://s3.amazonaws.com/mapd-data/geodata/sffacs_current.zip 16

© MapD 2018 Aaron Williams VP of Global Community @_arw_
[email protected] /in/aaronwilliams/ /williamsaaron slides: https://speakerdeck.com/mapd Thank you! Questions?

MapD Workshop: Visualizing Billions Of Data Poi...

MapD Workshop: Visualizing Billions Of Data Points With GPUs

OmniSci

More Decks by OmniSci

Other Decks in Technology

Featured

Transcript

© MapD 2018 MapD Workshop: Visualizing Billions Of Data Points

© MapD 2018 Aaron Williams VP of Global Community @_arw_

© MapD 2018 Do This Now If you want to

Core Density Makes a Huge Difference 4 GPU Processing CPU

© MapD 2018 MapD is the analytics platform created for

© MapD 2018 © MapD 2018 6 DEMO TIME!

© MapD 2018 Advanced memory management Three-tier caching to GPU

© MapD 2018 MapD Core: Query Compilation with LLVM

© MapD 2018 Get That Open Source 9 /mapd/mapd-core community.mapd.com

© MapD 2018 Ibis Interface Scaling the familiar pandas DataFrame

© MapD 2018 LIDAR in 3D with deck.gl Check out

© MapD 2018 Tutorial: Geospatial Data in MapD Cloud 12

© MapD 2018 Last Chance ... If you want to

© MapD 2018 Step 1: MapD Immerse Basics 1. View

© MapD 2018 Geospatial Objects Type Description POINT A point

© MapD 2018 Step 2: Loading MapD Shapefiles into Immerse

© MapD 2018 Aaron Williams VP of Global Community @_arw_