Slide 43
Slide 43 text
43
Tensorflow Extended Pipeline
Orchestration
• Production: you will use an orchestrator such as Apache Airflow, Kubeflow Pipelines,
or Apache Beam to orchestrate a pre-defined pipeline graph of TFX components.
• Interactive notebook(local): the notebook itself is the orchestrator, running each TFX
component as you execute the notebook cells.
Metadata
• Production: MLMD stores metadata properties in a database such as MySQL or
SQLite, and stores the metadata payloads in a persistent store such as on your
filesystem.
• Interactive notebook(local): both properties and payloads are stored in an ephemeral
SQLite database in the /tmp directory on the Jupyter notebook or Colab server.