"examples/tutorial/vscode:/vscode.git/clone" did not exist on "79079a9d0c931f6dfad22536f17c6c9e30bcafdd"
[zero] add load_state_dict for sharded model (#894)
* add load_state_dict for sharded model * fix bug * fix bug * fix ckpt dtype and device * support load state dict in zero init ctx * fix bugs
Showing
Please register or sign in to comment