feat: Use ForwardPassCallback api from TRTLLM to register end of forward pass...
feat: Use ForwardPassCallback api from TRTLLM to register end of forward pass callback to enable cuda graphs (#3297)
Signed-off-by:
Kyle McGill <kmcgill@nvidia.com>
Showing
Please register or sign in to comment