sebae banner ad-300x250
sebae intro coupon 30 off
sebae banner 728x900
sebae banner 300x250

65K node Kubernetes AI Platform – A Reality

0 views
0%

65K node Kubernetes AI Platform - A Reality

The size of generative AI models is constantly increasing, with current models reaching hundreds of billions of parameters and the most advanced ones approaching 2 trillion. Training such large models on modern accelerators necessitates clusters exceeding 10,000 nodes. GKE, currently supporting the world’s largest managed Kubernetes clusters with 15,000 nodes, has the capacity to handle these demanding training workloads. Anticipating further advancements and even larger models, we are introducing support for 65,000-node clusters. This expansion, combined with innovations in accelerator computing power, will enable the training of models with 10 trillion parameters or more.

Subscribe to Google Cloud Tech → https://goo.gle/GoogleCloudTech

Date: November 13, 2024