"...text-generation-inference.git" did not exist on "f9cf3456250e420af65e1d813ccee6af749658ad"
[fix] [MEVO]: make mevo work with eval and optim_state checkpointing (#851)
* [fix]: fix eval for shared weight FSDP
* fixing optim state saving
* add changelog
* reformat with newer local isort
* update test
* avoid computing reference state unless we are testing training
* added optim_state test
* make mypy happy
* move tests; maybe we need to CUDA memory related tests in the first of the lists
Co-authored-by:
Min Xu <min.xu.public@gmail.com>
Showing
Please register or sign in to comment