Upgrade to Pro — share decks privately, control downloads, hide ads and more …

Open Infrastructure in the Cloud with JupyterHub

Open Infrastructure in the Cloud with JupyterHub

In this talk I cover the principles, design, goals, and general structure of JupyterHub, a tool for providing interactive computing environments to others on shared infrastructure. I also cover several cool deployments of JupyterHub in the open community - using JupyterHub for education (data8.org), for large-scale geoscience in the cloud (pangeo.io) and for open community collaboration (openhumans.org).

Chris Holdgraf

May 30, 2019
Tweet

More Decks by Chris Holdgraf

Other Decks in Science

Transcript

  1. @choldgraf A bit about me now... Research and Open Source

    Education and Open Source Jupyter @ Berkeley
  2. a community of people and an ecosystem of open tools

    and standards for interactive computing
  3. You Your awesome report The Jupyter Notebook is a stack

    of modular, open tools server .ipynb package ecosystem Notebook document specification Jupyter server protocol Interactive Kernels Notebook interfaces
  4. Build infrastructure tools that are workflow- and platform-agnostic. Give people

    control over resources, freedom to deploy what and where they wish. (in the ☁)
  5. JupyterHub distributions The Littlest JupyterHub tljh.jupyter.org JupyterHub on Kubernetes z2jh.jupyter.org

    A pre-configured JupyterHub setup with sensible defaults and lots of documentation, fit for many use-cases ☁
  6. Scalable in both users and in resources Uses Docker for

    environment management Agnostic to the provider and hardware configuration Zero to JupyterHub for Kubernetes z2jh.jupyter.org
  7. The littlest JupyterHub Deploy JupyterHub on a single virtual machine

    Faster, lightweight setup and administration More easily created and destroyed tljh.jupyter.org
  8. • A JupyterHub + BinderHub managing an open stack of

    tools for geospatial analysis • Utilize and improve pre-existing tools, rather than build new ones. Push improvements upstream. • Add value with customization and configuration • Provide access to high-performance hardware The Pangeo pattern pangeo.io
  9. Chris Is Trying A Live Demo Hopefully he doesn’t embarrass

    himself too badly. exploratory.openhumans.org
  10. • JupyterHub connects users with interactive environments on shared infrastructure

    using open tools • JupyterHub distributions are opinionated deployments of a JupyterHub for a specific scale or purpose. ◦ The Littlest JupyterHub - Deploy on a single VM ◦ Zero to JupyterHub for Kubernetes - Deploy in the ☁ • JupyterHub has been used for ◦ Large-scale education (data8.org) ◦ High-performance analysis in the ☁(pangeo.io) ◦ Collaborative community analytics (openhumans.org) In summary
  11. Get involved with Jupyter @choldgraf jupyterhub-team-compass.readthedocs.io discourse.jupyter.org • All of

    these projects are open source, run by open communities • Jupyter is a place where *anybody* can participate • If you’d like to get involved: