[Bugfix][Qwen][DCA] fixes bug in dual-chunk-flash-attn backend for qwen 1m models. (#21364)
Signed-off-by:
Tao He <linzhu.ht@alibaba-inc.com>
Showing
Please register or sign in to comment
Signed-off-by:
Tao He <linzhu.ht@alibaba-inc.com>