"tests/kernels/attention/test_cascade_flash_attn.py" did not exist on "bf33700ecd6db472c4aeb489c5d42aa47a735198"
-
Benjamin Chislett authored
[Perf] Refactor cudagraph_support to enable full CUDA graphs for spec decoding with FlashInfer (#28479) Signed-off-by:Benjamin Chislett <bchislett@nvidia.com>
30441957