You need to sign in or sign up before continuing.
Unverified Commit 8ba7e5d4 authored by wang jiahao's avatar wang jiahao Committed by GitHub
Browse files

Merge pull request #1227 from kvcache-ai/change-yaml

change inject yaml
parents 2a224b25 48dfbc8f
...@@ -44,7 +44,7 @@ ...@@ -44,7 +44,7 @@
- match: - match:
name: "^model\\.layers\\..*\\.self_attn$" name: "^model\\.layers\\..*\\.self_attn$"
replace: replace:
class: ktransformers.operators.attention.flashinfer_attn # optimized MLA implementation class: ktransformers.operators.balance_serve_attention.flashinfer_attn # optimized MLA implementation
kwargs: kwargs:
generate_device: "cuda" generate_device: "cuda"
prefill_device: "cuda" prefill_device: "cuda"
......
Markdown is supported
0% or .
You are about to add 0 people to the discussion. Proceed with caution.
Finish editing this message first!
Please register or to comment