Fix the condition error when checking fp8 attn in `get_attention_backend` (#1965)
Update utils.py Fix the condition error of the FP8 attention in `get_attention_backend` Signed-off-by:yuzhongw-nvidia <yuzhongw@nvidia.com> Co-authored-by:
Xiaowei Ren <103958965+xrennvidia@users.noreply.github.com>
Showing
Please register or sign in to comment