• fxmarty's avatar
    CI: update to ROCm 6.0.2 and test MI300 (#30266) · 37bba2a3
    fxmarty authored
    
    
    * update to ROCm 6.0.2 and test MI300
    
    * add callers for mi300
    
    * update dockerfile
    
    * fix trainer tests
    
    * remove apex
    
    * style
    
    * Update tests/trainer/test_trainer_seq2seq.py
    
    * Update tests/trainer/test_trainer_seq2seq.py
    
    * Update tests/trainer/test_trainer_seq2seq.py
    
    * Update tests/trainer/test_trainer_seq2seq.py
    
    * update to torch 2.3
    
    * add workflow dispatch target
    
    * we may need branches: mi300-ci after all
    
    * nit
    
    * fix docker build
    
    * nit
    
    * add check runner
    
    * remove docker-gpu
    
    * fix issues
    
    * fix
    
    ---------
    Co-authored-by: default avatarYih-Dar <2521628+ydshieh@users.noreply.github.com>
    Co-authored-by: default avatarydshieh <ydshieh@users.noreply.github.com>
    37bba2a3
perf_infer_gpu_one.md 24.7 KB