- 14 Apr, 2026 3 commits
-
-
laibao authored
-
laibao authored
-
laibao authored
fix: - 修复 Step3p5 MTP 在加载 checkpoint 时对可选标量参数的识别逻辑,将 q/k/v zero_point 纳入 optional 参数集合,避免参数校验与加载不一致。 revert: - 回退 EAGLE 中针对 MTP shared_head.head 强制复用 target lm_head 的逻辑,避免与当前 Step3p5 MTP 权重结构产生冲突。 目的: - 降低 Step3p5 MTP 在权重加载阶段的兼容性问题,减少由于 lm_head 共享路径不一致导致的异常行为,方便后续排查和协作。
-
- 10 Apr, 2026 4 commits
- 08 Apr, 2026 2 commits
- 03 Apr, 2026 2 commits
- 02 Apr, 2026 2 commits
- 01 Apr, 2026 3 commits
- 28 Mar, 2026 1 commit
-
-
wanglong3 authored
-
- 27 Mar, 2026 3 commits
-
-
flyingdown authored
-
laibao authored
-
flyingdown authored
-
- 26 Mar, 2026 6 commits
-
-
laibao authored
-
laibao authored
feat(v1 attention): 为 ROCm FlashAttention 接入 unified kv layout,并打通 mm_prefix、qq_bias 与 use_alibi_sqrt 透传 在 ROCm FlashAttention 后端增加 unified KV layout 选择逻辑 接入 unified varlen kernel 调用路径 在 FlashAttention metadata 中补充 mm_prefix_range 与 qq_bias 透传
-
wanghl6 authored
-
wanghl6 authored
-
wanghl6 authored
-
wanglong3 authored
-
- 24 Mar, 2026 6 commits
- 23 Mar, 2026 1 commit
-
-
guanyu1 authored
-
- 21 Mar, 2026 6 commits
- 20 Mar, 2026 1 commit
-
-
laibao authored
-