distributed training 1 What Infrastructure does it take to train a 405B Llama3-like model? Jul 28, 2024