Skip to content

Disaggregated-Serving

Disaggregated Inference on Kubernetes: Routing, Scheduling, and Scaling Beyond One GPU

August 29, 2025