Upgrade to Pro — share decks privately, control downloads, hide ads and more …

Anaconda Fusion

Anaconda Fusion

Leveraging Jupyter to build an Excel-Python bridge, JupyterCON 2017

6cc5be6a122c6e768981003fd2e24789?s=128

Christine Doig

August 24, 2017
Tweet

Transcript

  1. © 2016 Continuum Analytics - Confidential & Proprietary © 2016

    Continuum Analytics - Confidential & Proprietary Leveraging Jupyter to build an Excel-Python bridge JupyterCON 2017 Christine Doig, Senior Product Manager & Data Scientist Fabio Pliger, Tech Lead August 24th, 2017
  2. 2 Overview & Examples

  3. © 2016 Continuum Analytics - Confidential & Proprietary 3 How

    to increase by 100x the impact of data science in your organization? Data Scientist Business Analysts Jupyter Users Est. 3-6 million Excel Users Est. 750 Million
  4. © 2016 Continuum Analytics - Confidential & Proprietary 4 Business

    analysts vs Data scientists Works with: Excel, Tableau, SQL Python, Hadoop, Spark Data: spreadsheets, tables dataframes, arrays Delivers: Reports, dashboards, spreadsheets Notebooks, code, interactive visualizations
  5. © 2016 Continuum Analytics - Confidential & Proprietary 5 Business

    analysts are being left out of the data science revolution 5 Big Data & ETL Interactive Data Visualizations Machine Learning Statistics and Advanced Analytics
  6. © 2016 Continuum Analytics - Confidential & Proprietary 6 Anaconda

    Fusion is a bridge between Excel & Python 6 Big Data & ETL Interactive Data Visualizations Machine Learning Statistics and Advanced Analytics
  7. © 2016 Continuum Analytics - Confidential & Proprietary 7 Analysts

    and Data Scientists can keep using their preferred tools 7
  8. © 2016 Continuum Analytics - Confidential & Proprietary 8 Self-service

    Big Data analytics Head node Compute nodes Jupyter notebook Interactive Data Visualizations Machine Learning Predictions Extract, transform and query data
  9. © 2016 Continuum Analytics - Confidential & Proprietary 9 “No

    Code” Data Science Example 1 2 Select Anaconda Fusion Notebook and click “Upload” Select function you wish to run Click “Run” Data is loaded into spreadsheet 3 4
  10. © 2016 Continuum Analytics - Confidential & Proprietary 10 Just

    change one line of code in your notebook
  11. © 2016 Continuum Analytics - Confidential & Proprietary 11 •

    Extract data - pull data directly into Excel to perform analysis • Machine Learning – use trained models created by Data Scientists and plug them into your spreadsheet data • Interactive Visualizations – create custom advanced interactive graphs, charts and plots from Excel data • Big Data – analyze, transform, model and query data stored in Hadoop and Spark Figure: Anaconda Fusion on Mac Anaconda Fusion use cases
  12. © 2016 Continuum Analytics - Confidential & Proprietary 12 12

    • Run Queries • Run Predictive Models • Running Big Data Text Analytics Examples
  13. © 2016 Continuum Analytics - Confidential & Proprietary 13

  14. © 2016 Continuum Analytics - Confidential & Proprietary 14

  15. © 2016 Continuum Analytics - Confidential & Proprietary 15

  16. 16 Features & architecture

  17. © 2016 Continuum Analytics - Confidential & Proprietary 17 Features

    – Formula bar
  18. © 2016 Continuum Analytics - Confidential & Proprietary 18 Features

    – Write back to Excel
  19. © 2016 Continuum Analytics - Confidential & Proprietary 19 Features

    – Interactive visualizations
  20. © 2016 Continuum Analytics - Confidential & Proprietary 20 Features

    – Save custom advanced visualizations
  21. © 2016 Continuum Analytics - Confidential & Proprietary 21 •

    OSS – base of most successful modern software • Maturity – long history • Diversity • 100s of projects • 1000s of contributors • Vision • Jupyterlab • Community & Support • Popularity Jupyter as a Platform
  22. © 2016 Continuum Analytics - Confidential & Proprietary 22 •

    The jupyter ecosystem • https://github.com/jupyter • https://github.com/jupyterlab • https://github.com/phosphorjs • Great community/support • Very pluggable* • Perfect for our use case • I.e.: why can’t excel have ML? • i.e.: why can’t excel do things that numpy/pandas do? • i.e.: we need better graphics (ala bokeh ;) ) for a dashboard of our metrics in excel Jupyter As a Tech Choice
  23. © 2016 Continuum Analytics - Confidential & Proprietary 23 Use

    Case Excel fusion Fusion Server Notebook kernels
  24. © 2016 Continuum Analytics - Confidential & Proprietary 24 Use

    case Excel fusion Anaconda Platform Notebooks, Apps, … kernels API Apps
  25. © 2016 Continuum Analytics - Confidential & Proprietary 25 A

    marketplace for Anaconda Fusion notebooks https://anaconda.org
  26. © 2016 Continuum Analytics - Confidential & Proprietary Christine Doig

    cdoig@continuum.io @ch_doig Fabio Pliger fpliger@continuum.io @b_smoke THANK YOU!
  27. © 2016 Continuum Analytics - Confidential & Proprietary QUESTIONS?