1. 24 Oct, 2025 2 commits
  2. 23 Oct, 2025 2 commits
    • zhuwenwen's avatar
      update test_moe.py · 86d92eb9
      zhuwenwen authored
      set USE_FUSED_RMS_QUANT=1 and USE_FUSED_SILU_MUL_QUANT=1
      86d92eb9
    • zhuwenwen's avatar
      support prefix cache on kme · 6f1db287
      zhuwenwen authored
      fix the error in test_moe caused by moe align not supporting 511
      multi-modal switching to torch implementation on z100l&k100
      6f1db287
  3. 20 Oct, 2025 2 commits
  4. 17 Oct, 2025 2 commits
  5. 16 Oct, 2025 3 commits
  6. 15 Oct, 2025 6 commits
  7. 13 Oct, 2025 11 commits
  8. 11 Oct, 2025 6 commits
  9. 10 Oct, 2025 6 commits