1. 10 Feb, 2026 1 commit
    • zhuwenwen's avatar
      [qwen3-235b] MoE(TN&NN) configs for nmz TP=8 · 7624bd05
      zhuwenwen authored
      [qwen3-480b] MoE(TN) configs for nmz TP=8
      [opt] 优化deepep相关代码
      [fix] 修复deepseek moe模型的awq量化推理bug和精度问题, 修复awq模型的VLLM_USE_LIGHTOP_MOE_SUM_MUL_ADD设置位置, update_state,优化性能,去除冗余操作
      pcie 解决custom cudagraph模式需要拷贝的问题,需要配合dtk进行使用
      [feat] Switch default w8a8 gemm impl to blaslt. Support w8a8-fp8 GEMM backend.MoE 路由抓取:新增 router_capture 工具链与 envs 统一配置
      [envs] set VLLM_CUSTOM_CACHE=1、VLLM_USE_FUSED_RMS_ROPE=1、VLLM_USE_FUSED_FILL_RMS_CAT=1、VLLM_USE_FLASH_ATTN_FP8=1、VLLM_USE_FLASH_MLA_FP8=1、update VLLM_USE_TOPK_RENORM
      7624bd05
  2. 03 Jun, 2025 1 commit
  3. 15 May, 2025 1 commit
  4. 13 May, 2025 1 commit
  5. 07 Apr, 2025 1 commit
  6. 06 Apr, 2025 1 commit
  7. 02 Feb, 2025 1 commit
    • Russell Bryant's avatar
      [Misc] Add SPDX-License-Identifier headers to python source files (#12628) · e489ad7a
      Russell Bryant authored
      - **Add SPDX license headers to python source files**
      - **Check for SPDX headers using pre-commit**
      
      commit 9d7ef44c3cfb72ca4c32e1c677d99259d10d4745
      Author: Russell Bryant <rbryant@redhat.com>
      Date:   Fri Jan 31 14:18:24 2025 -0500
      
          Add SPDX license headers to python source files
          
      This commit adds SPDX license headers to python source files as
      recommended to
      the project by the Linux Foundation. These headers provide a concise way
      that is
      both human and machine readable for communicating license information
      for each
      source file. It helps avoid any ambiguity about the license of the code
      and can
          also be easily used by tools to help manage license compliance.
          
      The Linux Foundation runs license scans against the codebase to help
      ensure
          we are in compliance with the licenses of the code we use, including
      dependencies. Having these headers in place helps that tool do its job.
          
          More information can be found on ...
      e489ad7a
  8. 27 Dec, 2024 1 commit
  9. 24 Dec, 2024 1 commit
  10. 27 Nov, 2024 1 commit