Slide 1

Slide 1 text

@WillingCarol Start Small and Scale Carol Willing PyData LA December 5, 2019 https://speakerdeck.com/willingc Big Data and Jupyter's Ecosystem

Slide 2

Slide 2 text

@WillingCarol Hi! I'm Carol. • Python Steering Council • Jupyter Steering Council • Core Developer, Python, Jupyter, nteract • PSF Fellow and Former Director • Frank Willison Award 2019 • Open Source Directions Podcast Co- host 2

Slide 3

Slide 3 text

@WillingCarol 3 Core maintainer Papermill, Scrapbook, Bookstore, Commuter Steering Council, Core Developer JupyterHub, BinderHub, mybinder.org I love creating tools which educate and empower people.

Slide 4

Slide 4 text

@WillingCarol What is Data Science 4 Start Here

Slide 5

Slide 5 text

@WillingCarol 5 Using data responsibly to solve real world issues and improve human lives

Slide 6

Slide 6 text

@WillingCarol Predictions at Scale 6 A real world tale

Slide 7

Slide 7 text

@WillingCarol 7 San Diego, CA

Slide 8

Slide 8 text

@WillingCarol 8 Tokyo

Slide 9

Slide 9 text

@WillingCarol 9 Sunday Oct 6 Source: ECMWF

Slide 10

Slide 10 text

Copyright: 2019 European Union, contains modified Copernicus Sentinel data 2019, processed by EUMETSAT Super Typhoon Hagibis View of Super Typhoon Hagibis south-west of Japan, as captured by the Copernicus Sentinel-3 satellite on 08 October at 00:16 UTC.

Slide 11

Slide 11 text

Title Typhoon Hagibis Released 10/10/2019 4:45 pm Copyright contains modified Copernicus Sentinel data (2019), processed by ESA, CC BY-SA 3.0 IGO

Slide 12

Slide 12 text

Source:Twitter

Slide 13

Slide 13 text

@WillingCarol 13

Slide 14

Slide 14 text

@WillingCarol 14

Slide 15

Slide 15 text

A sign is partially submerged as the Tama River floods during Typhoon Hagibis. Source:Getty Images Source:Japan Times

Slide 16

Slide 16 text

@WillingCarol Preparation Evacuation Safety 16

Slide 17

Slide 17 text

@WillingCarol With scale comes responsibility 17 Reality Check

Slide 18

Slide 18 text

@WillingCarol 18 "Diabetes awareness: character design" by Carlos Hernandez is licensed under CC BY-NC-ND 4.0

Slide 19

Slide 19 text

@WillingCarol –Kevin Sayer, DexCom CEO This whole integration of health care data is really going to be the next frontier. 19 https://www.cnbc.com/2019/11/13/big-data-is-the-next-frontier-for-medicine-says-dexcom-ceo.html https://www.businesswire.com/news/home/20191106005764/en/Dexcom-Reports-Quarter-2019-Financial-Results

Slide 20

Slide 20 text

@WillingCarol Outage Midnight Friday: mysterious outage Dexcom did not announce there was an outage until about 8 a.m. Pacific time Saturday, which is 11 a.m. on the East Coast, when it posted a brief notice on its Facebook page. Monday morning: Dexcom Follow partly restored 20 https://www.nytimes.com/2019/12/02/well/live/Dexcom-G6-diabetes-monitor-outage.html Source: https://www.dexcom.com/

Slide 21

Slide 21 text

@WillingCarol Getting from Start to Scale 21 The Challenge

Slide 22

Slide 22 text

@WillingCarol 22 jupyter.org

Slide 23

Slide 23 text

@WillingCarol 23 2014 Now, 5 years later... 23

Slide 24

Slide 24 text


 Millions of Notebooks https://github.com/trending/jupyter-notebook Over 5 million on GitHub

Slide 25

Slide 25 text

@WillingCarol 25 ‣ Growth ‣ ACM Award ‣ Industry adoption ‣ Creative uses ‣ Open Source Book https://www.youtube.com/watch?v=qbtDVdEr8SY

Slide 26

Slide 26 text

jupyter.org

Slide 27

Slide 27 text

@WillingCarol 27 Start Small Deploy and Sustain Explore Paths to Scale The Roadmap

Slide 28

Slide 28 text

@WillingCarol Start Small 28 Step 1

Slide 29

Slide 29 text

@WillingCarol 29 Source: xkcd

Slide 30

Slide 30 text

@WillingCarol 30 small

Slide 31

Slide 31 text

@WillingCarol 31 Binder 2.0 blog post elifesciences: Share your interactive research environment Nature article about Binder 31 mybinder.org Try it. No install needed.

Slide 32

Slide 32 text

@WillingCarol 32 Scale in Production

Slide 33

Slide 33 text

@WillingCarol Choose your Tools 33

Slide 34

Slide 34 text

@WillingCarol JupyterLab 34

Slide 35

Slide 35 text

35 jupyter.org demo

Slide 36

Slide 36 text

36 jupyter.org demo

Slide 37

Slide 37 text

@WillingCarol 37 ReactJS front end nteract nteract.io

Slide 38

Slide 38 text

@WillingCarol VS Code 38 PyCharm

Slide 39

Slide 39 text

@WillingCarol Avoid reinventing the wheel 39

Slide 40

Slide 40 text

@WillingCarol ecosystem 40

Slide 41

Slide 41 text

@WillingCarol Install Promising Libraries 41 Use Anaconda Use pip Use miniconda, conda, and conda-forge

Slide 42

Slide 42 text

@WillingCarol Start 42 Try it in the browser Install Libraries Choose your tools Avoid reinventing the wheel Step 1

Slide 43

Slide 43 text

@WillingCarol Explore Paths to Scale 43 Step 2

Slide 44

Slide 44 text

@WillingCarol Turn no way into it's possible 44

Slide 45

Slide 45 text

@WillingCarol Community 45 Conferences Meetups PyLadies Carpentries Photo: Python Sul

Slide 46

Slide 46 text

@WillingCarol 46 Ten Simple Rules for Reproducible Research in Jupyter Notebooks Adam Rule et al. https://github.com/jupyter-guide/ ten-rules-jupyter https://github.com/jupyter-guide/ jupyter-guide

Slide 47

Slide 47 text

@WillingCarol 47 build, try, change, repeat

Slide 48

Slide 48 text

@WillingCarol 48 ipyvolume https://towardsdatascience.com/multivolume- rendering-in-jupyter-with-ipyvolume-cross- language-3d-visualization-64389047634a

Slide 49

Slide 49 text

@WillingCarol 49

Slide 50

Slide 50 text

@WillingCarol 50 github.com/napari/napari napari https://ilovesymposia.com/2019/10/24/introducing-napari- a-fast-n-dimensional-image-viewer-in-python/

Slide 51

Slide 51 text

A pictorial representation of the different tools constituting BinderHub. This image was created by Scriberia for The Turing Way community and is used under a CC-BY licence. Zenodo record. https://blog.jupyter.org/diving-into- leadership-to-build-push-button-code- df2a075c9914

Slide 52

Slide 52 text

@WillingCarol What's new 52 Talk Python to Me Tracking Jupyter Newsletter https://tinyletter.com/TrackingJupyter/archive Open Source Directions GitHub Trending Follow projects on Social Media

Slide 53

Slide 53 text

@WillingCarol Explore 53 Use the ecosystem to learn Best practices Infrastructure/Analysis What's new Step 2

Slide 54

Slide 54 text

@WillingCarol Deploy and Sustain 54 Step 3

Slide 55

Slide 55 text

@WillingCarol 55 "Digital World" by NBroekzitter86 is licensed under CC BY 2.0

Slide 56

Slide 56 text

@WillingCarol Notebooks to web 56 https://blog.jupyter.org/and- voil%C3%A0-f6a2c08a4a93

Slide 57

Slide 57 text

@WillingCarol 57 nteract Papermill Scrapbook Bookstore Commuter Production data at scale 57 https://medium.com/netflix-techblog/notebook-innovation-591ee3221233 Bookstore

Slide 58

Slide 58 text

@WillingCarol 58 Papermill - parameterize / run Scrapbook - recording / reading Bookstore - store notebooks Commuter - share notebooks Production data at scale 58

Slide 59

Slide 59 text

@WillingCarol 59 Enterprise data workflows 59

Slide 60

Slide 60 text

zero-to-jupyterhub.readthedocs.io

Slide 61

Slide 61 text

@WillingCarol 61 61 Deploy your own BinderHub

Slide 62

Slide 62 text

62 Juliette Taka

Slide 63

Slide 63 text

@WillingCarol 63 From a phone in the park! 63

Slide 64

Slide 64 text

@WillingCarol Pangeo 64 https://pangeo.io

Slide 65

Slide 65 text

@WillingCarol 65

Slide 66

Slide 66 text

@WillingCarol Deploy and Sustain 66 Workflows Document Monitor Involvement Step 3

Slide 67

Slide 67 text

@WillingCarol 67 From Small to Scale Keys for Success

Slide 68

Slide 68 text

@WillingCarol Choose to Start 68

Slide 69

Slide 69 text

@WillingCarol Why > how 69

Slide 70

Slide 70 text

@WillingCarol Automate the Boring Stuff 70

Slide 71

Slide 71 text

@WillingCarol Plan Execute Change 71 https://jupyterhub-team-compass.readthedocs.io https://github.com/jupyterhub/team-compass

Slide 72

Slide 72 text

@WillingCarol Consider complexity and observability 72

Slide 73

Slide 73 text

@WillingCarol People = Responsibility 73

Slide 74

Slide 74 text

@WillingCarol 74 Call to Action

Slide 75

Slide 75 text

@WillingCarol ecosystem 75

Slide 76

Slide 76 text

@WillingCarol 76 Using data responsibly to solve real world issues and improve human lives

Slide 77

Slide 77 text

@WillingCarol 77 Justine Dupont surfs the greatest wave of her life in Nazaré, Portugal © Rafael G. Riancho / Red Bull Content Pool

Slide 78

Slide 78 text

@WillingCarol 78 Thank you https://speakerdeck.com/willingc @WillingCarol

Slide 79

Slide 79 text

@WillingCarol 79 Questions https://speakerdeck.com/willingc

Slide 80

Slide 80 text

@WillingCarol 80 Thank you PyData LA Project Jupyter Team Core Python Team PSF NumFOCUS

Slide 81

Slide 81 text

@WillingCarol Attributions 81 Attributions on slides. Photos [7-8] Carol Willing and Linnea Willing [14] The Carpentries, Tracy Teal, Bérénice Batut [14] Godzilla By Toho Company Ltd. (東宝株式会社, Tōhō Kabushiki-kaisha) © 1954 - movie poster made by Toho Company Ltd. (東宝株式会社, Tōhō Kabushiki-kaisha), Public Domain, https://commons.wikimedia.org/w/index.php?curid=3648684