-
Michael Goldfarb authored
Implementation of context parallel fused attention using all-gather. Signed-off-by:Michael Goldfarb <mgoldfarb@nvidia.com>
9101a78f
Implementation of context parallel fused attention using all-gather.
Signed-off-by:
Michael Goldfarb <mgoldfarb@nvidia.com>