enable cpu distribution training using mpirun (#17570)
* enable cpu distribution training using mpirun *command like * mpirun -n 2 python3 run_qa.py --no_cuda --xpu_backend ccl xxxx *MASTER_ADDR and MASTER_PORT should be set as env *export MASTER_ADDR=127.0.0.1 *export MASTER_PORT=29500 Signed-off-by:Wang, Yi A <yi.a.wang@intel.com> * fix according to the review comment Signed-off-by:
Wang, Yi A <yi.a.wang@intel.com> * use accelerate logic for cpu distribution training to set "RANK","LOCAL_RANK","WORLD_SIZE" environment Signed-off-by:
Wang, Yi A <yi.a.wang@intel.com>
Showing
Please register or sign in to comment