Upgrade to Pro — share decks privately, control downloads, hide ads and more …

Jutopia - closing the distance between AI parti...

LINE DevDay 2020
November 26, 2020
1.4k

Jutopia - closing the distance between AI participants

Changqin He
LINE China LINE Service Development Server-side Engineer
https://linedevday.linecorp.com/2020/ja/sessions/9652
https://linedevday.linecorp.com/2020/en/sessions/9652

LINE DevDay 2020

November 26, 2020
Tweet

Transcript

  1. 40% of digital transformation initiatives will use AIML services. AIML

    spending will grow to $110 billion in 2024. Source: IDC https://www.idc.com/getdoc.jsp?containerId=prUS46794720
  2. Machine Learning is Hard! Drinking from a fire hose Supervised/unsupervised/reinforcement

    learning … Data sourcing, cleanup, tagging & classification Linear/logistic regression, Random forest, Decision tree, … Linear algebra, Statistics, Probability TensorFlow, PyTorch, MXNet, Caffe2, Keras… Python, Julia, R, … Training and evaluating models Distributed training IntelliJ, VSCode, PyCharm, Jupyter notebook Hyperparameter Tuning GPU or CPU MLOps https://happykaty.com/2018/05/15/drinking-from-a-fire-hose/
  3. What is Jutopia? Notebooks Jutopia builds ML pipeline around notebooks

    to make all stages of ML smoother. You can customize your notebooks and compute resources to suit your data science needs. Multi-framework Jutopia provides support for PyTorch, TF, XGBoost, Keras and other environments. And support you to build a custom ML environment. Model Serving Jutopia is integrated with BentoML, an open-source framework for ML model serving, which package once and deploy anywhere, supporting Docker, K8s, Kuberflow, and more. Jupyter To Pipeline Architecture
  4. What is Jutopia? Pipelines Jutopia uses Airflow for deploying and

    managing ML workflows. A visual pipeline editor to design Airflow DAGs without knowing Python or learning Airflow primitives. Infrastructures Jutopia provides best-of-breed open-source systems as a service for ML, which will be deployed to diverse infrastructures such as Docker, Kubernetes, etc. Open Source Jutopia’s goal is not to rebuild other services, but we will expand open source projects and open source our excellent modules. Jupyter To Pipeline Architecture
  5. Architecture & Workflow Data ETL Development Model Training Persistent Storage

    Record Experiment Sharing Review Scheduling Parametric Notebooks Model Deployment Model Serving Pipeline Persistent Storage Model Serving
  6. Jutopia Component-Notebooks Why Jupyter Notebooks? Aggregate Report Build Model Create

    Dataset Data exploration, Data preparation, Data validation, Productionalization Run code, Explore data, Present results https://netflixtechblog.com/notebook-innovation-591ee3221233
  7. Jutopia Component-JupyterNBViewer Share Notebooks from Local or S3 Commuter No

    Web Context Different styles Jupyter_nb_viewer Easy to integrate Line Style https://github.com/line/devday_2020_jupyter_nb_viewer https://github.com/nteract/commuter https://linedevday.linecorp.co/2020/en/sessions/5342
  8. Why BentoML for Serving The easiest way to build Machine

    Learning APIs Core Team Member Features Any ML frameworks Deploy anywhere High-Performance Central hub Modular and flexible https://github.com/bentoml/BentoML
  9. Who are using Jutopia? LINE Corp LINE Plus Corp LINE

    China LINE Taiwan LINE Thailand NAVER Organization Statistics
  10. Future works Model Deployment API Gateway - A/B test -

    Multi-armed bandit Distributed training Kuberflow - MPI-job Horovod Hyperparameter tuning Ray Katib