"git@developer.sourcefind.cn:zhaoyu6/sglang.git" did not exist on "92ab0a2055889d4a72edf4fb2dc1e4143161828f"
-
Reza Yazdani authored
* add adamW to CPU-ADAM implementation * supporting cpu-adam optimizer for zero-offload on deepspeed side * bump DSE to match cpu-adam updates Co-authored-by:Jeff Rasley <jerasley@microsoft.com>
f5aa2547