GPUs are the new black box. As AI workloads take over production, SREs are left blind your observability stack stops at the CPU boundary. When an inference job degrades at 3am, you have no idea where to start looking.
This talk covers how eBPF is being pushed beyond the CPU, from tracing inference pipelines with zero code changes to injecting eBPF inside GPU kernels and where research is going on that matter.