Upgrade to Pro — share decks privately, control downloads, hide ads and more …

CF Summit - Data Science on Cloud Foundry

CF Summit - Data Science on Cloud Foundry

Talk by Ian Huston and Alexander Kagoshima at CFSummit 2015
Video: https://www.youtube.com/watch?v=n95hCVvuPKQ

Data Scientists frequently need to create applications that enable interactive data exploration, deliver predictive analytics APIs or simply publish results. Cloud Foundry provides an ideal platform for data scientists by making it easy to quickly deploy data driven apps backed by a variety of data stores. In this talk, Ian Huston will outline how to use Cloud Foundry for data science, describe how CF has been used in customer projects, explain why data services are essential, and discuss how community buildpacks enable data scientists to use their familiar R and Python Data packages with CF.

Ian Huston

May 11, 2015
Tweet

More Decks by Ian Huston

Other Decks in Technology

Transcript

  1. Who are we? •  Data Scientists at Pivotal Labs • 

    Using Cloud Foundry since 2013 •  Working with enterprises to get value out of their data
  2. Data Scientist (n.): Person who is better at statistics than

    any software engineer and better at software engineering than any statistician. - Josh Wills
  3. Data Services Bind and scale system services –  Databases, NoSQL,

    message queues etc. $  cf  create-­‐service  rediscloud  PLAN_NAME   INSTANCE_NAME   $  cf  bind-­‐service  APP_NAME  INSTANCE_NAME     Add User Provided Services –  Standalone Hadoop or Apache Spark cluster, Big Data System $  cf  cups  SERVICE_INSTANCE  -­‐p  "host,   port,  username,  password"       Data Service App App App App App App
  4. Siloed Data Siloed Systems Distributed Big Data Platform HOW TO

    DEPLOY MODELS? Data Extract ? (Model development happens here!) (Business needs model predictions here!)
  5. R E S T A P I Send data as

    JSON Data Ingest Model Create Model Redis Kicking off periodic retraining Save training data Save model object Send JSON data without label Receive prediction from trained model instance Deployed at: http://dsoncf.cfapps.io Code: https://github.com/pivotalsoftware/ds-cfpylearning PREDICTION API ARCHITECTURE $  cf  create-­‐service   rediscloud   PLAN_NAME   INSTANCE_NAME  
  6. R E S T A P I Send data as

    JSON Data Ingest Model Create Model Redis Kicking off periodic retraining Save training data Save model object Send JSON data without label Receive prediction from trained model instance Deployed at: http://dsoncf.cfapps.io Code: https://github.com/pivotalsoftware/ds-cfpylearning Visualization PREDICTION API ARCHITECTURE
  7. Data Services Bind and scale system services –  Databases, NoSQL,

    message queues etc. $  cf  create-­‐service  rediscloud  PLAN_NAME  INSTANCE_NAME   $  cf  bind-­‐service  APP_NAME  INSTANCE_NAME     Add User Provided Services –  Standalone Hadoop or Apache Spark cluster, Big Data System $  cf  cups  SERVICE_INSTANCE  -­‐p  "host,  port,  username,   password"