"src/targets/vscode:/vscode.git/clone" did not exist on "a9e5d73c4b3209b81787579049e63dad549fdb8c"
Merge branch 'ckpt_transpose' into 'main'
Rework handling of older checkpoint's attention weight/bias ordering. See merge request ADLR/megatron-lm!219
Showing
Please register or sign in to comment