Reimagining Kubernetes Pods: Nested Containers with CRI-O

Sohan Kunkerkar, Red Hat Reimagining Kubernetes Pods: Nested Containers with
CRI-O

About the Speaker Sohan Kunkerkar Senior Software Engineer - Red
Hat • CRI-O maintainer • Member of SIG-Node • Love playing the flute • Enjoy trekking and outdoor activities

Agenda • Introduction • Real-world demands and aspirations for Pods
• Exploring Nested Containers • Examples • Limitations in the Current Pod Model • Upstream Kubernetes Features • Demo • Future Directions • Q&A session

Introduction • What is a Pod? ◦ Smallest deployable unit
in Kubernetes. ◦ Fundamental building block for deploying applications. • Pod as a Logical Host: ◦ Multiple containers that share the same network namespace and storage volumes. • Primary Use cases: ◦ Microservices needing tight communication. ◦ Sharing volumes for inter-container file access. • Security and Isolation in Pods: ◦ Pods are isolated from other Pods by Kubernetes’ network and storage policies, but containers within the same Pod share certain resources.

Real-world Demands and Aspirations for Pods • Running Containerized Toolchains
◦ Enable tools like Podman/Docker or Buildah to run seamlessly inside Pods. • Virtual Machine-like Isolation ◦ Secured and isolated environments for untrusted workloads. • Flexible Development Environments ◦ Support nested development setups within Pods. • Enhanced Security without Privileges ◦ Run workloads securely using rootless or restricted containers.

Nested Containers

Examples

Limitations with the Current Pod Model • Shared Resources ◦
Containers share namespaces like /proc, limiting certain workloads ◦ No per-container user namespaces for fine-grained isolation. • Security Limitations ◦ Privileged containers expose Pod environment, increasing risks. ◦ Rootless containers are constrained by Pod-level resource limits. • Need for Flexibility ◦ Running tools or build systems inside Pods. ◦ Enhanced isolation for multi-tenant workloads. Pod Pod Security Standards Pod Security Admission Pod Security Context

Upstream Kubernetes Features • User Namespaces Support • UserNamespacesPodSecurityStandards •
Add ProcMount Option

User Namespaces Support • https://github.com/kubernetes/enhancements/issues/127 (Beta in v1.31) • Maps
container root to a non-root user on the host. • Offers stronger isolation, especially in multi-tenant clusters. • How k8s uses it? ◦ Maps container user IDs to different host IDs to reduce privilege escalation risks. ◦ Enables running containers as root inside the container while being non-root on the host. • Why it’s needed? ◦ Isolating security identifiers for enhanced security. ◦ Running privileged processes in pods as unprivileged on the host. ◦ Mitigating security vulnerabilities if containers break out.

UserNamespacesPodSecurityStandards • Alpha in Kubernetes v1.29 • To enhance and
regulate how UserNamespaces are utilized in multi-tenant environments. • Key Points: ◦ Relaxing Pod Security Standards. ◦ Ensures only compliant workloads can leverage user namespaces. ◦ Requires enabling UserNamespacesPodSecurityStandards feature gate and cluster-wide node compatibility.

Add ProcMount Option • https://github.com/kubernetes/enhancements/issues/4265 (Beta in v1.31) • Default
/proc mount exposes sensitive host process details to containers like PIDs and Host kernel configuration. • Provides control over how the /proc filesystem is mounted inside containers. • Why it’s needed? ◦ Security Improvement ◦ Controlled Flexibility

User Namespaced Pod Requirement Version Kubernetes 1.30+ CRI-O 1.30+ container-selinux
2.234.1 Linux Kernel 5.12+

What’s CRI-O? Supports OCI based container images, runtimes, and registries
Implementation of the Kubernetes Container Runtime Interface - compliant with the Open Container Initiative Balance stability and features Focus on security Purpose-built for Kubernetes

Additional Demo https://asciinema.org/a/WsklvCodQmUzL92suQVPNSYZG

Future Directions • Move Kubernetes Features to GA. • Enhanced
Security and Isolation. • Advanced Use Cases and Real-World Feedback.

Thank you!

References • https://www.redhat.com/en/blog/podman-inside-container • https://www.redhat.com/en/blog/podman-inside-kubernetes • https://docs.openshift.com/container-platform/4.17/nodes/pods/nodes-pods-use r-namespaces.html • https://github.com/kubernetes/enhancements/blob/master/keps/sig-node/127-u
ser-namespaces/README.md#motivation • https://github.com/cgruver/ocp-4-17-nested-container-tech-preview

Reimagining Kubernetes Pods: Nested Containers ...

Reimagining Kubernetes Pods: Nested Containers with CRI-O

Sohan Kunkerkar

More Decks by Sohan Kunkerkar

Featured

Transcript

Sohan Kunkerkar, Red Hat Reimagining Kubernetes Pods: Nested Containers with

About the Speaker Sohan Kunkerkar Senior Software Engineer - Red

Agenda • Introduction • Real-world demands and aspirations for Pods

Introduction • What is a Pod? ◦ Smallest deployable unit

Real-world Demands and Aspirations for Pods • Running Containerized Toolchains

Nested Containers

Examples

Limitations with the Current Pod Model • Shared Resources ◦

Upstream Kubernetes Features • User Namespaces Support • UserNamespacesPodSecurityStandards •

User Namespaces Support • https://github.com/kubernetes/enhancements/issues/127 (Beta in v1.31) • Maps

UserNamespacesPodSecurityStandards • Alpha in Kubernetes v1.29 • To enhance and

Add ProcMount Option • https://github.com/kubernetes/enhancements/issues/4265 (Beta in v1.31) • Default

User Namespaced Pod Requirement Version Kubernetes 1.30+ CRI-O 1.30+ container-selinux

What’s CRI-O? Supports OCI based container images, runtimes, and registries

Demo

Demo

Additional Demo https://asciinema.org/a/WsklvCodQmUzL92suQVPNSYZG

Future Directions • Move Kubernetes Features to GA. • Enhanced

Thank you!