Add post-processing API for FP8 primary weights to support CUDA Graph (#2266)
* Add post-processing API for FP8 primary weights to support CUDA Graph Signed-off-by:kunlunl <kunlunl@nvidia.com> * Add post-processing support for plain pytorch tensors Signed-off-by:
kunlunl <kunlunl@nvidia.com> * Update type hint Signed-off-by:
Tim Moon <4406448+timmoon10@users.noreply.github.com> --------- Signed-off-by:
kunlunl <kunlunl@nvidia.com> Signed-off-by:
Tim Moon <4406448+timmoon10@users.noreply.github.com> Co-authored-by:
Tim Moon <4406448+timmoon10@users.noreply.github.com>
Showing
Please register or sign in to comment