- 10 Feb, 2026 1 commit
-
-
zhuwenwen authored
[qwen3-480b] MoE(TN) configs for nmz TP=8 [opt] 优化deepep相关代码 [fix] 修复deepseek moe模型的awq量化推理bug和精度问题, 修复awq模型的VLLM_USE_LIGHTOP_MOE_SUM_MUL_ADD设置位置, update_state,优化性能,去除冗余操作 pcie 解决custom cudagraph模式需要拷贝的问题,需要配合dtk进行使用 [feat] Switch default w8a8 gemm impl to blaslt. Support w8a8-fp8 GEMM backend.MoE 路由抓取:新增 router_capture 工具链与 envs 统一配置 [envs] set VLLM_CUSTOM_CACHE=1、VLLM_USE_FUSED_RMS_ROPE=1、VLLM_USE_FUSED_FILL_RMS_CAT=1、VLLM_USE_FLASH_ATTN_FP8=1、VLLM_USE_FLASH_MLA_FP8=1、update VLLM_USE_TOPK_RENORM
-
- 03 Jun, 2025 1 commit
-
-
Simon Mo authored
Signed-off-by:simon-mo <simon.mo@hey.com>
-
- 15 May, 2025 1 commit
-
-
Harry Mellor authored
Signed-off-by:Harry Mellor <19981378+hmellor@users.noreply.github.com>
-
- 13 May, 2025 1 commit
-
-
zhuwenwen authored
support telechat2 and glm4 nn layout remove log of request_id
-
- 07 Apr, 2025 1 commit
-
-
Lu Fang authored
Signed-off-by:
Aston Zhang <22279212+astonzhang@users.noreply.github.com> Signed-off-by:
Chris Thi <chris.c.thi@gmail.com> Signed-off-by:
drisspg <drisspguessous@gmail.com> Signed-off-by:
Jon Swenson <jmswen@gmail.com> Signed-off-by:
Keyun Tong <tongkeyun@gmail.com> Signed-off-by:
Lu Fang <fanglu@meta.com> Signed-off-by:
Xiaodong Wang <xdwang@meta.com> Signed-off-by:
Yang Chen <yangche@fb.com> Signed-off-by:
Ye (Charlotte) Qi <yeq@meta.com> Signed-off-by:
Yong Hoon Shin <yhshin@meta.com> Signed-off-by:
Zijing Liu <liuzijing2014@gmail.com> Signed-off-by:
Lu Fang <lufang@fb.com> Signed-off-by:
Lu Fang <fanglu@fb.com> Signed-off-by:
Lucia Fang <fanglu@fb.com> Signed-off-by:
Roger Wang <ywang@roblox.com> Signed-off-by:
DarkLight1337 <tlleungac@connect.ust.hk> Co-authored-by:
Lu Fang <fanglu@fb.com> Co-authored-by:
Roger Wang <ywang@roblox.com> Co-authored-by:
DarkLight1337 <tlleungac@connect.ust.hk>
-
- 06 Apr, 2025 1 commit
-
-
Lucia Fang authored
Signed-off-by:Lu Fang <fanglu@fb.com>
-
- 02 Feb, 2025 1 commit
-
-
Russell Bryant authored
- **Add SPDX license headers to python source files** - **Check for SPDX headers using pre-commit** commit 9d7ef44c3cfb72ca4c32e1c677d99259d10d4745 Author: Russell Bryant <rbryant@redhat.com> Date: Fri Jan 31 14:18:24 2025 -0500 Add SPDX license headers to python source files This commit adds SPDX license headers to python source files as recommended to the project by the Linux Foundation. These headers provide a concise way that is both human and machine readable for communicating license information for each source file. It helps avoid any ambiguity about the license of the code and can also be easily used by tools to help manage license compliance. The Linux Foundation runs license scans against the codebase to help ensure we are in compliance with the licenses of the code we use, including dependencies. Having these headers in place helps that tool do its job. More information can be found on ...
-
- 27 Dec, 2024 1 commit
-
-
Jee Jee Li authored
Signed-off-by:Jee Jee Li <pandaleefree@gmail.com>
-
- 24 Dec, 2024 1 commit
-
-
Jee Jee Li authored
Signed-off-by:Jee Jee Li <pandaleefree@gmail.com>
-
- 27 Nov, 2024 1 commit
-
-
shunxing12345 authored
Signed-off-by:
Isotr0py <2037008807@qq.com> Co-authored-by:
xiangw2 <xiangw2@chinatelecom.cn> Co-authored-by:
Isotr0py <2037008807@qq.com>
-