"vscode:/vscode.git/clone" did not exist on "24cde76a152fbffde30fa2be0d08dcbad490530e"
[JAX] Context Parallel Attention with All-Gather (#1106)
Implementation of context parallel fused attention using all-gather.
Signed-off-by:
Michael Goldfarb <mgoldfarb@nvidia.com>
Showing
Please register or sign in to comment