%%async_run: an IPython notebook extension for asynchronous cell execution

[%]%async_run an IPython notebook* magic for asynchronous (code) cell execution
Fondazione Bruno Kessler (FBK) - Trento, IT [email protected] Valerio Maggio @leriomaggio

Jupyter Notebook

Reproducible Research Open Science and Open Code

Motivations • Sometimes it may be required to apply for
heavy computations in IPython notebooks • computationally intensive code cells • Moreover, sometimes may be required that this computation is actually executed on a remote server machine • reminder: Jupyter Notebook Server • In the general case, this could work… but since…

–Murphy’s Law, 1952 “Anything that can possibly go wrong, does.”

Main Goal (for this one-weekend hack project) Try to define
a strategy to cope with this kind of situation keeping the following requirements in mind: • Allow the execution on a remote machine (also) • Avoid the client machine to busy waiting • Keep the interactivity of the notebook as much as possible

What I learned during my adventures in the   world of Jupyter, Multiprocessing and Asynchronous I/O

IPython Magics (since IPython 3.x) • IPython has a system
of commands we call magics • provide eﬀectively a mini command language that is orthogonal to the syntax of Python • easily extensible by the user with new commands. • Magics are meant to be typed interactively • i.e. command-line conventions • e.g. whitespace for separating arguments, dashes for options. • Magics come in two kinds: • Line magics: prepended by one % character • Cell magics: two percent characters as a marker (%%)

Activate matplotlib inline-backend to have charts displayed inline with notebook
cells

[%]%timeit Line Magic Cell Magic

Custom Magics: how to

Notebook Data Format

First Idea (very early stage) Drawbacks: • No interactivity •
No way to auto-refresh the content run the heavy computation (in some way) and use the write API to add a new cell to the notebook and that’s it.

Try to see if there’s any existing solution to this!
Take away: avoid reinventing the wheel!

%run to the rescue (?) ipython.org/ipython-doc/3/interactive/magics.html#magic-run

Test in the notebook

A bit more complicated

test it! Blocking Call No interactivity

runipy to the rescue (?) https://github.com/paulgb/runipy

A closer look

Notebook Runner

runipy features • (+) Notebook APIs • (+) Kernel Protocol
Messaging • (+) Support for multiple document formats • nbformat.versions • (-) No interactivity • (-) No support for online/non-blocking execution • (~) No support for multi-processing

Idea: try to borrow some code from runipy and re-implement
it as an IPython Magic (w/ steroids)

• But if you : Hangs on protocol communication and
it has no link with the current shell

What I learned during my adventures in the   world of Jupyter, Multiprocessing and Asynchronous I/O

IPython is based on Tornado!

Reference

AsyncRun Magic

AsyncRun Server

AsyncRun Handler

Javascript (Browser) Side

WSConnector

AsyncRun Handler

Cell Execution

After Execution

WSConnector

Limitations and   concurrent.futures (work) • Pickle Serialisation Dependency •
Major flaw of Python Multiprocessing Module • Try to use dill & multiprocess* • Improve the infrastructure to handle errors • not really handled yet • Allow the async execution of multiple cells at a time • revise multiprocessing architecture and dependencies

Demo Time even considering the aforementioned Murphy’s Laws :P https://github.com/leriomaggio/async-ipython-magic

Thanks a lot for your kind attention +ValerioMaggio [email protected] it.linkedin.com/in/valeriomaggio
@leriomaggio https://github.com/leriomaggio/async-ipython-magic

%%async_run: an IPython notebook extension for ...

%%async_run: an IPython notebook extension for asynchronous cell execution

More Decks by Valerio Maggio

Other Decks in Programming

Featured

Transcript