Global Big Data Conference: Speed Meets Scale For Predictive Analytics

www.globalbigdataconference.com Twitter : @bigdataconf

© MapD 2018 Speed Meets Scale For Predictive Analytics: Running
Billions Of Data Points Through A Full Machine Learning Pipeline Aaron Williams | April 3, 2018

© MapD 2018 Aaron Williams VP of Global Community @_arw_
[email protected] /in/aaronwilliams/ /williamsaaron slides: https://speakerdeck.com/mapd

© MapD 2018 © MapD 2018 4 “Every business will
become a software business, build applications, use advanced analytics and provide SaaS services.” - Smart CEO Guy has

© MapD 2018 The Evolution of Data as a Weapon
5 Collect It Make It Actionable Make it Predictive

© MapD 2018 MapD is the analytics platform created for
GPUs

© MapD 2018 Advanced memory management Three-tier caching to GPU
RAM for speed and to SSDs for persistent storage 9 SSD or NVRAM STORAGE (L3) 250GB to 20TB 1-2 GB/sec CPU RAM (L2) 32GB to 3TB 70-120 GB/sec GPU RAM (L1) 24GB to 256GB 1000-6000 GB/sec Hot Data Speedup = 1500x to 5000x Over Cold Data Warm Data Speedup = 35x to 120x Over Cold Data Cold Data COMPUTE LAYER STORAGE LAYER Data Lake/Data Warehouse/System Of Record

© MapD 2018 The GPU Open Analytics Initiative (GOAI) Creating
common data frameworks to accelerate data science on GPUs 10 /mapd/pymapd /gpuopenanalytics/pygdf

© MapD 2018 Machine Learning Pipeline 11 Personas in Analytics
Lifecycle (Illustrative) Business Analyst Data Scientist Data Engineer IT Systems Admin Data Scientist / Business Analyst Data Preparation Data Discovery & Feature Engineering Model & Validate Predict Operationalize Monitoring & Refinement Evaluate & Decide GPUs

© MapD 2018 • We’ve published a few notebooks showing
how to connect to a MapD database and use an ML algorithm to make predictions • We’ve also shared a real-world example of churn, which we implemented with VW 13 ML Examples /gpuopenanalytics/demo-docker /mapd/mapd-ml-demo

© MapD 2018 © MapD 2018 • mapd.com/demos Play with
our demos • mapd.com/platform/download-community/ Get our free Community Edition and start playing • mapd.com/cloud Get your own instance in 60 seconds • community.mapd.com Ask questions and share your experiences 14 Next Steps

© MapD 2018 Aaron Williams VP of Global Community @_arw_
[email protected] /in/aaronwilliams/ /williamsaaron slides: https://speakerdeck.com/mapd Thank you! Any questions?

Global Big Data Conference: Speed Meets Scale F...

Global Big Data Conference: Speed Meets Scale For Predictive Analytics

OmniSci

More Decks by OmniSci

Other Decks in Technology

Featured

Transcript

www.globalbigdataconference.com Twitter : @bigdataconf

© MapD 2018 Speed Meets Scale For Predictive Analytics: Running

© MapD 2018 Aaron Williams VP of Global Community @_arw_

© MapD 2018 © MapD 2018 4 “Every business will

© MapD 2018 The Evolution of Data as a Weapon

© MapD 2018 MapD is the analytics platform created for

© MapD 2018 © MapD 2018 7 DEMO TIME!

© MapD 2018 Fresh News! We’ve Launched MapD Cloud mapd.com/cloud

© MapD 2018 Advanced memory management Three-tier caching to GPU

© MapD 2018 The GPU Open Analytics Initiative (GOAI) Creating

© MapD 2018 Machine Learning Pipeline 11 Personas in Analytics

© MapD 2018 © MapD 2018 12 YET ANOTHER DEMO

© MapD 2018 • We’ve published a few notebooks showing

© MapD 2018 © MapD 2018 • mapd.com/demos Play with

© MapD 2018 Aaron Williams VP of Global Community @_arw_