Scaling Kubernetes for AI Workloads November 22, 2023 Lessons learned from managing 10k+ node clusters for large language model training. #kubernetes#ai#infrastructure