Adaptive Network-Aware Scheduling Extension for Mixture-of-Experts Model Deployments in Kubernetes with RoCE and GPUDirect RDMA by Aleksandr Filich, Enes Bajrovic and Siegfried Benkner
You can find more information on AINA2026 on the conference website↗.
See you in Wellington (NZ) in April!