Scaling AI Workloads with Kubernetes: Sharing GPU Resources Across Multiple Containers - Jack Ong

Similar Tracks
Scaling Kubernetes Clusters for Generative Models: Managing GPU Resources for AI App... Jack Min Ong
The Linux Foundation
Divide and Conquer: Master GPU Partitioning and Visualize Savings with OpenCost - Kaysie Yu
CNCF [Cloud Native Computing Foundation]
Bringing Service Security to a New Level: An Introduction to SaaSBOMs - Ivana Atanasova & Rose Judge
The Linux Foundation
Efficient Large-Scale Language Model Training on GPU Clusters Using Megatron-LM | Jared Casper
@Scale
Mastering GPU Management in Kubernetes Using the Operator Pattern- Shiva Krishna Merla & Kevin Klues
CNCF [Cloud Native Computing Foundation]
Enabling Fault Tolerance for GPU Accelerated AI Workloads in Kubernetes - A. Singh & A. Paithankar
CNCF [Cloud Native Computing Foundation]