"docs/source/en/_toctree.yml" did not exist on "8ff777d3c141191725e61a85399333e936c1bf22"
Merge branch 'ckpt_transpose' into 'main'
Rework handling of older checkpoint's attention weight/bias ordering. See merge request ADLR/megatron-lm!219
Showing
Please register or sign in to comment