"src/turbomind/utils/memory_utils.cu" did not exist on "4d42a781254e85176bd91f943a28b2d0360e7768"
Merge branch 'lmcafee/byte-buffer' into 'main'
Perform distributed optimizer's all-gather in param dtype (instead of grad dtype) See merge request ADLR/megatron-lm!448
Showing
Please register or sign in to comment