Jupyter in Production

whois Patrick Harrison

whois Patrick Harrison Data Theoretic

whois Patrick Harrison Data Theoretic Previously: Led AI Engineering at
a major fi nancial data company

Source: https://ipython.org/ipython-doc/rel-0.12/whatsnew/version0.12.html

Jupyter Notebooks just turned ten years old Source: https://ipython.org/ipython-doc/rel-0.12/whatsnew/version0.12.html

Jupyter Notebooks just turned ten years old The original IPython
Notebook was fi rst released on December 19, 2011 Source: https://ipython.org/ipython-doc/rel-0.12/whatsnew/version0.12.html

Source: https://github.com/parente/nbestimate/blob/master/estimate.ipynb Public Jupyter Notebooks on GitHub

Source: https://github.com/parente/nbestimate/blob/master/estimate.ipynb Public Jupyter Notebooks on GitHub ≈0

Source: https://github.com/parente/nbestimate/blob/master/estimate.ipynb Public Jupyter Notebooks on GitHub ≈0 ≈10,000,000

8,000+ new public Jupyter Notebooks posted on GitHub every day
in 2022, on average Source: https://github.com/parente/nbestimate/blob/master/ipynb_counts.csv

Jupyter Notebooks have been used to do some amazing things

Source: https://blog.jupyter.org/congratulations-to-the-ligo-and-virgo-collaborations-from-project-jupyter-5923247be019 On behalf of the entire Project Jupyter team,
we’d like to say congratulations to Rainer Weiss, Barry C. Barish, Kip S. Thorne and the rest of the LIGO and VIRGO teams for the Nobel Prize in Physics 2017. Since 2015, the LIGO and VIRGO Collaborations have observed multiple instances of gravitational waves due to colliding black holes (and more recently neutron stars). These observations represent decades of work and confirm what Einstein had theorized a hundred years ago. ... To communicate to the broader community, the LIGO/VIRGO Collaboration has created tutorials with Jupyter Notebooks that describe how to use LIGO/ VIRGO data and reproduce analyses related to their academic publications.

Source: https://blog.jupyter.org/jupyter-receives-the-acm-software-system-award-d433b0dfe3a2 It is our pleasure to announce that Project
Jupyter has been awarded the 2017 ACM Software System Award, a significant honor for the project. We are humbled to join an illustrious list of projects that contains major highlights of computing history, including Unix, TeX, S (R’s predecessor), the Web, Mosaic, Java, INGRES (modern databases) and more.

Jupyter Notebooks have some compelling strengths

Interactive, exploratory programming with immediate feedback #1

Build a computational narrative bringing together code, results, explanatory prose,
plots, images, widgets, and more in a single, human-friendly document #2

Lower barriers to entry

...many more people and roles can access, use, and collaborate
on programming and data analysis in their work Lower barriers to entry

Increased productivity

Increased productivity ...for programmers of all skill levels

"We’ve found that we’re 2x-3x more productive using [notebook-based development]
than using traditional programming tools... Source: https://www.fast.ai/2019/12/02/nbdev/

than using traditional programming tools... ...this is a big surprise, since I have coded nearly every day for over 30 years, and in that time have tried dozens of tools, libraries, and systems for building programs." Source: https://www.fast.ai/2019/12/02/nbdev/

than using traditional programming tools... ...this is a big surprise, since I have coded nearly every day for over 30 years, and in that time have tried dozens of tools, libraries, and systems for building programs." Source: https://www.fast.ai/2019/12/02/nbdev/ — Jeremy Howard, fast.ai

Jupyter Notebooks have become an essential part of the data
scientist's toolkit

But, a story you've probably heard before...

The magic words...

"Let's put this in production" The magic words...

"You can't use Jupyter Notebooks in production"

Why not?

"It's not supported."

This is a pain to version control.

This is a pain to version control. This is monolithic.
How will we collaborate effectively?

How will we collaborate effectively? How can we share and reuse this code?

How will we collaborate effectively? How can we share and reuse this code? How do we apply our code quality standards?

How will we collaborate effectively? How can we share and reuse this code? How do we apply our code quality standards? How do we test this code?

How will we collaborate effectively? How can we share and reuse this code? How do we apply our code quality standards? How do we test this code? Will this work with our continuous integration system?

How will we collaborate effectively? How can we share and reuse this code? How do we apply our code quality standards? How do we test this code? Will this work with our continuous integration system? How do we schedule and trigger automatic execution?

How will we collaborate effectively? How can we share and reuse this code? How do we apply our code quality standards? How do we test this code? Will this work with our continuous integration system? How do we schedule and trigger automatic execution? Out-of-order cell execution!

How will we collaborate effectively? How can we share and reuse this code? How do we apply our code quality standards? How do we test this code? Will this work with our continuous integration system? How do we schedule and trigger automatic execution? Out-of-order cell execution! ...

OK, how should we get this work into production?

OK, how should we get this work into production? “It
looks like there's a lot going on in your notebook…"

Your notebook has reusable code... How should we get this
work into production?

Your notebook has reusable code... ... you're going to need
to reimplement this code as proper software libraries, How should we get this work into production?

to reimplement this code as proper software libraries, ... subject to our company-wide software engineering standards, How should we get this work into production?

to reimplement this code as proper software libraries, ... subject to our company-wide software engineering standards, ... with reimplemented tests using our company's preferred testing framework, How should we get this work into production?

to reimplement this code as proper software libraries, ... subject to our company-wide software engineering standards, ... with reimplemented tests using our company's preferred testing framework, ... using our preferred enterprise continuous integration system, How should we get this work into production?

to reimplement this code as proper software libraries, ... subject to our company-wide software engineering standards, ... with reimplemented tests using our company's preferred testing framework, ... using our preferred enterprise continuous integration system, ... and deploy to our preferred enterprise artifact repository. How should we get this work into production?

Your notebook is accessing and transforming data... How should we
get this work into production?

Your notebook is accessing and transforming data... ... you're going
to need to reimplement this logic as data pipelines in our preferred enterprise data pipeline framework, How should we get this work into production?

to need to reimplement this logic as data pipelines in our preferred enterprise data pipeline framework, ... which has its own engineering practices and conventions, How should we get this work into production?

to need to reimplement this logic as data pipelines in our preferred enterprise data pipeline framework, ... which has its own engineering practices and conventions, ... and may not even use the same programming language. How should we get this work into production?

Your notebook generates predictions... How should we get this work
into production?

Your notebook generates predictions... ... you're going to need to
reimplement the model as a web service, How should we get this work into production?

reimplement the model as a web service, ... wrap it in a Docker container, How should we get this work into production?

reimplement the model as a web service, ... wrap it in a Docker container, ... store it in our preferred enterprise container registry, How should we get this work into production?

reimplement the model as a web service, ... wrap it in a Docker container, ... store it in our preferred enterprise container registry, ... and deploy it to our preferred enterprise container orchestration platform. How should we get this work into production?

Your notebook presents results to end users... How should we
get this work into production?

Your notebook presents results to end users... ... you're going
to need to reimplement these reports in our preferred enterprise business intelligence platform, How should we get this work into production?

to need to reimplement these reports in our preferred enterprise business intelligence platform, ... which has its own engineering practices and conventions, How should we get this work into production?

to need to reimplement these reports in our preferred enterprise business intelligence platform, ... which has its own engineering practices and conventions, ... and may not even use the same programming language. How should we get this work into production?

So you're telling me that if we're going to get
our work in production, either:

our work in production, either: 1. Our data science teams have to be stacked with unicorns,

our work in production, either: 1. Our data science teams have to be stacked with unicorns, or

our work in production, either: 1. Our data science teams have to be stacked with unicorns, or 2. We have to loop in a bunch of other teams and create dependencies between them

My teams went through this process so many times we
had a name for it

de • notebook • i fi cation

de • notebook • i fi cation The long, painful
process of exploding a Jupyter Notebook that de fi nitely works into a constellation of disparate production artifacts that maybe don't

⚠ WARNING: De-notebook-i fi cation has been shown to have
side effects including increased complexity, elongated timelines, unhappy stakeholders, frustrated data scientists, increased risk of project cancelation, and loss of data science team credibility.

Additional problem:

Additional problem: If Jupyter is only for demos and prototypes...

Additional problem: If Jupyter is only for demos and prototypes...
Why bother writing good code in notebooks?

"Maybe you shouldn't use Jupyter in the fi rst place"

"Maybe you shouldn't use Jupyter in the fi rst place"
There has to be a better answer

enter the Jupyter in Production ecosystem

But fi rst... what does in production mean, anyway?

For this talk, we'll focus on: What does in production
mean, anyway?

For this talk, we'll focus on: •Developing and distributing software
libraries What does in production mean, anyway?

libraries •Building and running data pipelines What does in production mean, anyway?

libraries •Building and running data pipelines •Creating interactive reports and dashboards What does in production mean, anyway?

For each of these tools, I'll try to answer...

... what is it? For each of these tools, I'll
try to answer...

... what is it? ... what do I have to
do to use it? For each of these tools, I'll try to answer...

... what is it? ... what do I have to
do to use it? ... what's in it for me? For each of these tools, I'll try to answer...

Developing and distributing software libraries

nbdev •Initial Release: 2019 •GitHub Stars: 3.2k 🌟 •GitHub: https://github.com/fastai/nbdev/

What is it? nbdev

A collection of tools that let you use Jupyter Notebooks
as the source code for Python software libraries nbdev

What do I have to do to use it? nbdev

Setup • pip install nbdev or conda install nbdev -c
fastai nbdev

fastai • Initialize your git repository as an nbdev project: nbdev_new   (Or, copy the of fi cial nbdev template repo on GitHub) nbdev

fastai • Initialize your git repository as an nbdev project: nbdev_new   (Or, copy the of fi cial nbdev template repo on GitHub) • Install the nbdev git hooks: nbdev_install_git_hooks nbdev

fastai • Initialize your git repository as an nbdev project: nbdev_new   (Or, copy the of fi cial nbdev template repo on GitHub) • Install the nbdev git hooks: nbdev_install_git_hooks • Enter some basic project information in settings.ini nbdev

Basic Usage • Start with exploratory programming in Jupyter Notebooks,
as usual nbdev

as usual • As you go, notice when it would make sense to reuse or share bits of the code you write nbdev

as usual • As you go, notice when it would make sense to reuse or share bits of the code you write • Reshape this code into functions and classes in a notebook nbdev

as usual • As you go, notice when it would make sense to reuse or share bits of the code you write • Reshape this code into functions and classes in a notebook • Add the #export fl ag (code comment) at the start of your main code cells nbdev

as usual • As you go, notice when it would make sense to reuse or share bits of the code you write • Reshape this code into functions and classes in a notebook • Add the #export fl ag (code comment) at the start of your main code cells • Next to your main code cells, add rich explanatory text, images, code usage examples, sample output, and assert statements nbdev

Source: https://nbdev.fast.ai/example.html

What's in it for me? nbdev

Quite a bit, actually. nbdev

Automatically export the code from your Jupyter Notebooks into a
fully-functional Python package: nbdev nbdev_build_lib

Automatically publish new releases of your package to PyPI and
conda: nbdev make release

Automatically generate a rich documentation website for your package from
your Jupyter Notebooks: nbdev nbdev_build_docs

Avoid common version control con fl icts and resolving them
when they occur: nbdev nbdev_clean_nbs & nbdev_fix_merge

Source: https://nbdev.fast.ai/merge.html

Automatically run tests on your notebooks: nbdev nbdev_test_nbs

nbdev $ nbdev_test_nbs testing: card.ipynb testing: deck.ipynb All tests are
passing! Source: https://nbdev.fast.ai/tutorial.html

Continuous integration out-of-the-box with git hooks and GitHub Actions nbdev

Conceptual shift nbdev ⚠

With nbdev, your source code, tests, and documentation all live
together in one place nbdev

Source: https://nbdev.fast.ai/example.html Code

Source: https://nbdev.fast.ai/example.html Code Docs Docs

Source: https://nbdev.fast.ai/example.html Code Tests Docs Docs

"The magic of nbdev is that it doesn’t actually change
programming that much; you add a #export or #hide tag to your notebook cells once in a while, and you run nbdev_build_lib and nbdev_build_docs when you fi nish up your code.   Source: https://www.overstory.com/blog/how-nbdev-helps-us-structure-our-data-science-work fl ow-in-jupyter-notebooks nbdev

"The magic of nbdev is that it doesn’t actually change
programming that much; you add a #export or #hide tag to your notebook cells once in a while, and you run nbdev_build_lib and nbdev_build_docs when you fi nish up your code.   That’s it! There’s nothing new to learn, nothing to unlearn. It’s just notebooks." Source: https://www.overstory.com/blog/how-nbdev-helps-us-structure-our-data-science-work fl ow-in-jupyter-notebooks nbdev

“[nbdev] incentives us to write clear code, use proper Git
version control and document and test our codebase continuously... [while] preserving the bene fi ts of having interactive Jupyter notebooks in which it is easy to experiment." Source: https://www.overstory.com/blog/how-nbdev-helps-us-structure-our-data-science-work fl ow-in-jupyter-notebooks nbdev

“[nbdev] incentives us to write clear code, use proper Git
version control and document and test our codebase continuously... [while] preserving the bene fi ts of having interactive Jupyter notebooks in which it is easy to experiment." Source: https://www.overstory.com/blog/how-nbdev-helps-us-structure-our-data-science-work fl ow-in-jupyter-notebooks nbdev — Overstory

Bonus Picks

Visually compare notebook versions

nbdime and ReviewNB Visually compare notebook versions

Source: https://nbdime.readthedocs.io

Source: https://www.reviewnb.com/

Run your favorite code quality tools on notebooks

nbQA Run your favorite code quality tools on notebooks

$ nbqa black my_notebook.ipynb reformatted my_notebook.ipynb All done! ✨ 🍰
✨ 1 files reformatted. Source: https://nbqa.readthedocs.io/en/latest/examples.html nbQA

Building and running data pipelines

Source: https://docs.ploomber.io/en/latest/use-cases/ml.html

“We’re currently in the process of migrating all 10,000 of
the scheduled jobs running on the Net fl ix Data Platform to use notebook-based execution…   Source: https://net fl ixtechblog.com/scheduling-notebooks-348e6c14cfd6

the scheduled jobs running on the Net fl ix Data Platform to use notebook-based execution…   When we’re done, more than 150,000 [pipeline executions] will be running through notebooks on our platform every single day.” Source: https://net fl ixtechblog.com/scheduling-notebooks-348e6c14cfd6

the scheduled jobs running on the Net fl ix Data Platform to use notebook-based execution…   When we’re done, more than 150,000 [pipeline executions] will be running through notebooks on our platform every single day.” Source: https://net fl ixtechblog.com/scheduling-notebooks-348e6c14cfd6 — Net fl ix (2018)

ploomber •Initial Release: 2020 •GitHub Stars: 2.3k 🌟 •GitHub: https://github.com/ploomber/ploomber

What is it? ploomber

A framework to build and execute data pipelines made out
of Jupyter Notebooks ploomber

What do I have to do to use it? ploomber

Setup • pip install ploomber or   conda install ploomber
-c conda-forge ploomber

-c conda-forge • Initialize your git repository as a ploomber project: ploomber

-c conda-forge • Initialize your git repository as a ploomber project: • ploomber scaffold --empty ploomber

-c conda-forge • Initialize your git repository as a ploomber project: • ploomber scaffold --empty • Add information about your pipeline to pipeline.yaml as you go ploomber

as usual ploomber

as usual • As you go, notice when chunks of your code would make sense as modular "tasks" in a data transformation work fl ow ploomber

as usual • As you go, notice when chunks of your code would make sense as modular "tasks" in a data transformation work fl ow • Move the code for each task into its own dedicated notebook ploomber

as usual • As you go, notice when chunks of your code would make sense as modular "tasks" in a data transformation work fl ow • Move the code for each task into its own dedicated notebook • Next to your code cells, add rich explanatory text, images, example expected output, and data quality checks ploomber

Basic Usage • Record information about your task notebooks in
pipeline.yaml ploomber

pipeline.yaml • Add a few variables to your task notebooks to de fi ne upstream dependencies ploomber

pipeline.yaml • Add a few variables to your task notebooks to de fi ne upstream dependencies • Run your pipeline with ploomber build ploomber

Source: https://docs.ploomber.io/en/latest/get-started/basic-concepts.html

Source: https://docs.ploomber.io/en/latest/get-started/basic-concepts.html .ipynb

Source: https://docs.ploomber.io/en/latest/get-started/basic-concepts.html .ipynb .ipynb .ipynb

pipeline.yaml ploomber Source: https://docs.ploomber.io/en/latest/get-started/ fi rst-pipeline.html

pipeline.yaml tasks: ploomber Source: https://docs.ploomber.io/en/latest/get-started/ fi rst-pipeline.html

pipeline.yaml tasks: # source is the code you want to
execute   - source: raw.ipynb ploomber Source: https://docs.ploomber.io/en/latest/get-started/ fi rst-pipeline.html

execute   - source: raw.ipynb # products are task's outputs   product: ploomber Source: https://docs.ploomber.io/en/latest/get-started/ fi rst-pipeline.html

execute   - source: raw.ipynb # products are task's outputs   product: # tasks generate executed notebooks as outputs   nb: output/raw.ipynb ploomber Source: https://docs.ploomber.io/en/latest/get-started/ fi rst-pipeline.html

execute   - source: raw.ipynb # products are task's outputs   product: # tasks generate executed notebooks as outputs   nb: output/raw.ipynb # you can define as many outputs as you want   data: output/raw_data.csv   ploomber Source: https://docs.ploomber.io/en/latest/get-started/ fi rst-pipeline.html

execute   - source: raw.ipynb # products are task's outputs   product: # tasks generate executed notebooks as outputs   nb: output/raw.ipynb # you can define as many outputs as you want   data: output/raw_data.csv   - source: clean.ipynb ploomber Source: https://docs.ploomber.io/en/latest/get-started/ fi rst-pipeline.html

execute   - source: raw.ipynb # products are task's outputs   product: # tasks generate executed notebooks as outputs   nb: output/raw.ipynb # you can define as many outputs as you want   data: output/raw_data.csv   - source: clean.ipynb product: ploomber Source: https://docs.ploomber.io/en/latest/get-started/ fi rst-pipeline.html

execute   - source: raw.ipynb # products are task's outputs   product: # tasks generate executed notebooks as outputs   nb: output/raw.ipynb # you can define as many outputs as you want   data: output/raw_data.csv   - source: clean.ipynb product: nb: output/clean.ipynb ploomber Source: https://docs.ploomber.io/en/latest/get-started/ fi rst-pipeline.html

execute   - source: raw.ipynb # products are task's outputs   product: # tasks generate executed notebooks as outputs   nb: output/raw.ipynb # you can define as many outputs as you want   data: output/raw_data.csv   - source: clean.ipynb product: nb: output/clean.ipynb data: output/clean_data.parquet   ploomber Source: https://docs.ploomber.io/en/latest/get-started/ fi rst-pipeline.html

execute   - source: raw.ipynb # products are task's outputs   product: # tasks generate executed notebooks as outputs   nb: output/raw.ipynb # you can define as many outputs as you want   data: output/raw_data.csv   - source: clean.ipynb product: nb: output/clean.ipynb data: output/clean_data.parquet   - source: plot.ipynb ploomber Source: https://docs.ploomber.io/en/latest/get-started/ fi rst-pipeline.html

execute   - source: raw.ipynb # products are task's outputs   product: # tasks generate executed notebooks as outputs   nb: output/raw.ipynb # you can define as many outputs as you want   data: output/raw_data.csv   - source: clean.ipynb product: nb: output/clean.ipynb data: output/clean_data.parquet   - source: plot.ipynb product: output/plot.ipynb ploomber Source: https://docs.ploomber.io/en/latest/get-started/ fi rst-pipeline.html

$ ploomber build ploomber Source: https://docs.ploomber.io/en/latest/get-started/ fi rst-pipeline.html

?it/s]   Executing: 0%| | 0/6 [00:00<?, ?cell/s]   Executing: 17%|█▋ | 1/6 [00:04<00:21, 4.25s/cell]   Executing: 33%|███▎ | 2/6 [00:04<00:07, 1.82s/cell]   Executing: 100%|██████████| 6/6 [00:05<00:00, 1.11cell/s] Building task 'clean': 20%|██ | 1/5 [00:05<00:21, 5.47s/it]   Executing: 0%| | 0/7 [00:00<?, ?cell/s]   Executing: 14%|█▍ | 1/7 [00:01<00:10, 1.76s/cell]   Executing: 43%|████▎ | 3/7 [00:23<00:34, 8.63s/cell]   Executing: 71%|███████▏ | 5/7 [00:25<00:09, 4.69s/cell]   Executing: 86%|████████▌ | 6/7 [00:28<00:04, 4.14s/cell]   Executing: 100%|██████████| 7/7 [00:29<00:00, 4.24s/cell] ploomber Source: https://docs.ploomber.io/en/latest/get-started/ fi rst-pipeline.html

?it/s]   Executing: 0%| | 0/6 [00:00<?, ?cell/s]   Executing: 17%|█▋ | 1/6 [00:04<00:21, 4.25s/cell]   Executing: 33%|███▎ | 2/6 [00:04<00:07, 1.82s/cell]   Executing: 100%|██████████| 6/6 [00:05<00:00, 1.11cell/s] Building task 'clean': 20%|██ | 1/5 [00:05<00:21, 5.47s/it]   Executing: 0%| | 0/7 [00:00<?, ?cell/s]   Executing: 14%|█▍ | 1/7 [00:01<00:10, 1.76s/cell]   Executing: 43%|████▎ | 3/7 [00:23<00:34, 8.63s/cell]   Executing: 71%|███████▏ | 5/7 [00:25<00:09, 4.69s/cell]   Executing: 86%|████████▌ | 6/7 [00:28<00:04, 4.14s/cell]   Executing: 100%|██████████| 7/7 [00:29<00:00, 4.24s/cell] Building task ‘plot': 40%|████ | 2/5 [00:35<00:59, 19.75s/it]   Executing: 0%| | 0/9 [00:00<?, ?cell/s]   Executing: 11%|█ | 1/9 [00:02<00:22, 2.80s/cell]   Executing: 33%|███▎ | 3/9 [00:02<00:04, 1.28cell/s]   Executing: 56%|█████▌ | 5/9 [00:03<00:01, 2.42cell/s]   Executing: 100%|██████████| 9/9 [00:03<00:00, 2.26cell/s] ploomber Source: https://docs.ploomber.io/en/latest/get-started/ fi rst-pipeline.html

What's in it for me? ploomber

A human-friendly computational narrative of every pipeline execution ploomber

“[W]e’ve gained a key improvement over a non-notebook execution pattern:
our input and outputs are complete documents, wholly executable and shareable in the same interface.” Source: https://net fl ixtechblog.com/scheduling-notebooks-348e6c14cfd6 — Net fl ix (2018)

Interactive pipeline inspection and debugging in Jupyter Notebooks ploomber

“Say something went wrong… How might we debug and fi
x the issue? The fi rst place we’d want to look is the notebook output. It will have a stack trace, and ultimately any output information related to an error…   Source: https://net fl ixtechblog.com/scheduling-notebooks-348e6c14cfd6 — Net fl ix (2018)

“Say something went wrong… How might we debug and fi
x the issue? The fi rst place we’d want to look is the notebook output. It will have a stack trace, and ultimately any output information related to an error…   [W]e simply take the output notebook with our exact failed runtime parameterizations and load it into a notebook server… With a few iterations… we can quickly fi nd a fi x for the failure. Source: https://net fl ixtechblog.com/scheduling-notebooks-348e6c14cfd6 — Net fl ix (2018)

Incremental builds ploomber

Source: https://docs.ploomber.io/en/latest/use-cases/ml.html

Test each stage of your data pipeline ploomber

Modular pipelines → collaborative development ploomber

Source: https://docs.ploomber.io/en/latest/use-cases/ml.html 👩💻

Source: https://docs.ploomber.io/en/latest/use-cases/ml.html 👨💻 👩💻

Source: https://docs.ploomber.io/en/latest/use-cases/ml.html 👩💻 👨💻 🧑💻

Automated deployment   to Air fl ow, AWS Batch, or
Kubernetes ploomber

Bonus Pick

Store Jupyter Notebooks as plain text   for easier version
control

jupytext Store Jupyter Notebooks as plain text   for easier
version control

Creating interactive reports and dashboards

voilà •Initial Release: 2018 •GitHub Stars: 4.1k 🌟 •GitHub: https://github.com/voila-dashboards/voila

What is it? voilà

A tool for serving Jupyter Notebooks as clean, stand-alone web
applications voilà

What do I have to do to use it? voilà

Not much! voilà

Setup • pip install voila or conda install voila -c
conda-forge voilà

conda-forge • To serve a single notebook: voila my_notbook.ipynb voilà

conda-forge • To serve a single notebook: voila my_notbook.ipynb • To serve a whole directory of notebooks: voila voilà

conda-forge • To serve a single notebook: voila my_notbook.ipynb • To serve a whole directory of notebooks: voila • Optionally specify a custom template: voilà

conda-forge • To serve a single notebook: voila my_notbook.ipynb • To serve a whole directory of notebooks: voila • Optionally specify a custom template: • voila my_notebook.ipynb --template=gridstack voilà

What's in it for me? voilà

Execute and serve Jupyter Notebooks for end users voilà

Source: https://github.com/sysuin/covid-19-world-dashboard

Interactive plots and widgets still work voilà

Source: https://github.com/dhaitz/machine-learning-interactive-visualization

Customize the look and feel of your dashboard with templates
voilà

voilà Source: https://github.com/voila-dashboards/voila-vuetify

Long-running notebooks voilà ⚠

So, where does this leave us?

A smoother path to production for work that starts in
Jupyter Notebooks

• Software Libraries → nbdev projects • Data Transformation Work
fl ows → ploomber pipelines • Reports and Dashboards → voilà dashboards

Data science teams can own a project end-to-end in a
tool and environment they're already comfortable with

Jupyter Notebooks become production artifacts

We can retain the interactivity and computational narrative strengths of
Jupyter Notebooks, even in production settings

Where to go from here?

Jupyter in Production Data Theoretic

Jupyter in Production - Rev 3

Jupyter in Production - Rev 3

More Decks by Patrick Harrison

Other Decks in Programming

Featured

Transcript