"...gmock/git@developer.sourcefind.cn:yangql/googletest.git" did not exist on "d5e9e0c38f85363e90b0a3e95a9484fe896d38e5"
[JAX] Fix incorrect sharding when only enable FSDP and Mem Misaligned in LN_BWD. (#379)
* [JAX] Fix incorrect sharding when only enable FSDP. Signed-off-by:Ming Huang <mingh@nvidia.com> * [JAX] Add WAR to memory misaligned issues of LN BWD. Signed-off-by:
Ming Huang <mingh@nvidia.com> * [JAX] Reuse sm_arch for avoiding duplicate code. Signed-off-by:
Ming Huang <mingh@nvidia.com> * [JAX] Support multiple sizes allocation in WorkspaceManager. Signed-off-by:
Ming Huang <mingh@nvidia.com> * [JAX] Use template and ariadic arguments to improve multple sizes allocator. Signed-off-by:
Ming Huang <mingh@nvidia.com> --------- Signed-off-by:
Ming Huang <mingh@nvidia.com>
Showing
Please register or sign in to comment