"git@developer.sourcefind.cn:OpenDAS/lmdeploy.git" did not exist on "d5cb0be2cd16e6c5eefd4d266a38357fde83a660"
[test] AdaScale & SDP/FSDP (#468)
- cover them in terms of code path only - numerically, AdaScale is different on SDP/FSDP than DDP, mainly due to partial view of the gradients. - this doesn't mean it is definitely not useful but it is yet to be validated. - not going to spend too much time until we have a real use case.
Showing
Please register or sign in to comment