[Docs] [V1] [Hybrid] Update docs to remove FlashInfer constraint for hybrid models (#23665)

Signed-off-by: Thomas Parnell <tpa@zurich.ibm.com>

[Docs] [V1] [Hybrid] Update docs to remove FlashInfer constraint for hybrid models (#23665)
Signed-off-by: Thomas Parnell <tpa@zurich.ibm.com>
227e231b · Thomas Parnell · GitHub · 730d0ac8 · 227e231b
Unverified Commit 227e231b authored Aug 26, 2025 by Thomas Parnell Committed by GitHub Aug 26, 2025
Show whitespace changes
Inline Side-by-side

Showing with 2 additions and 3 deletions

docs/usage/v1_guide.md docs/usage/v1_guide.md +2 -3

No files found.
--- a/docs/usage/v1_guide.md
+++ b/docs/usage/v1_guide.md
@@ -111,11 +111,10 @@ Models that use Mamba-2 and Mamba-1 layers (e.g., `Mamba2ForCausalLM`, `MambaFor

 Models that combine Mamba-2 and Mamba-1 layers with standard attention layers are also supported (e.g., `BambaForCausalLM`,
 `Zamba2ForCausalLM`, `NemotronHForCausalLM`, `FalconH1ForCausalLM` and `GraniteMoeHybridForCausalLM`, `JambaForCausalLM`). Please note that
-these models currently require disabling prefix caching and using the FlashInfer attention backend in V1.
+these models currently require disabling prefix caching in V1.

 Hybrid models with mechanisms different to Mamba are also supported (e.g, `MiniMaxText01ForCausalLM`, `MiniMaxM1ForCausalLM`).
-Please note that these models currently require disabling prefix caching, enforcing eager mode, and using the FlashInfer
-attention backend in V1.
+Please note that these models currently require disabling prefix caching and enforcing eager mode in V1.

 #### Encoder-Decoder Models