train_mixtral_8x7b_distributed.sh 2.5 KB