- 09 Feb, 2026 1 commit
-
-
zhuwenwen authored
-
- 06 Feb, 2026 4 commits
- 05 Feb, 2026 1 commit
-
-
zhuwenwen authored
-
- 04 Feb, 2026 3 commits
- 28 Jan, 2026 1 commit
-
-
Roger Wang authored
Signed-off-by:
wanglinian <wanglinian@stu.pku.edu.cn> Signed-off-by:
wangln19 <96399074+wangln19@users.noreply.github.com> Signed-off-by:
Isotr0py <mozf@mail2.sysu.edu.cn> Signed-off-by:
youkaichao <youkaichao@gmail.com> Signed-off-by:
Roger Wang <hey@rogerw.io> Co-authored-by:
wanglinian <wanglinian@stu.pku.edu.cn> Co-authored-by:
wangln19 <96399074+wangln19@users.noreply.github.com> Co-authored-by:
Zaida Zhou <58739961+zhouzaida@users.noreply.github.com> Co-authored-by:
Isotr0py <mozf@mail2.sysu.edu.cn> Co-authored-by:
Nick Hill <nickhill123@gmail.com> Co-authored-by:
youkaichao <youkaichao@gmail.com> Co-authored-by:
gemini-code-assist[bot] <176961590+gemini-code-assist[bot]@users.noreply.github.com> (cherry picked from commit b539f988)
-
- 27 Jan, 2026 1 commit
-
-
Roger Wang authored
Signed-off-by:
wanglinian <wanglinian@stu.pku.edu.cn> Signed-off-by:
wangln19 <96399074+wangln19@users.noreply.github.com> Signed-off-by:
Isotr0py <mozf@mail2.sysu.edu.cn> Signed-off-by:
youkaichao <youkaichao@gmail.com> Signed-off-by:
Roger Wang <hey@rogerw.io> Co-authored-by:
wanglinian <wanglinian@stu.pku.edu.cn> Co-authored-by:
wangln19 <96399074+wangln19@users.noreply.github.com> Co-authored-by:
Zaida Zhou <58739961+zhouzaida@users.noreply.github.com> Co-authored-by:
Isotr0py <mozf@mail2.sysu.edu.cn> Co-authored-by:
Nick Hill <nickhill123@gmail.com> Co-authored-by:
youkaichao <youkaichao@gmail.com> Co-authored-by:
gemini-code-assist[bot] <176961590+gemini-code-assist[bot]@users.noreply.github.com>
-
- 26 Jan, 2026 2 commits
-
-
dolpm authored
Signed-off-by:dolpm <34420038+dolpm@users.noreply.github.com>
-
Alex Brooks authored
Signed-off-by:Alex-Brooks <Alex.Brooks@ibm.com>
-
- 25 Jan, 2026 1 commit
-
-
Jee Jee Li authored
Signed-off-by:Jee Jee Li <pandaleefree@gmail.com>
-
- 23 Jan, 2026 4 commits
-
-
dolpm authored
Signed-off-by:dolpm <34420038+dolpm@users.noreply.github.com>
-
Xin Yang authored
Signed-off-by:Xin Yang <xyangx@amazon.com>
-
Isotr0py authored
Signed-off-by:
Isotr0py <mozf@mail2.sysu.edu.cn> (cherry picked from commit 8ebf271b)
-
Nick Hill authored
Signed-off-by:Nick Hill <nickhill123@gmail.com>
-
- 22 Jan, 2026 3 commits
-
-
Isotr0py authored
Signed-off-by:Isotr0py <mozf@mail2.sysu.edu.cn>
-
Alex Sun authored
Signed-off-by:Alex Sun <alex.s@amd.com>
-
Wentao Ye authored
Signed-off-by:yewentao256 <zhyanwentao@126.com>
-
- 20 Jan, 2026 2 commits
-
-
dolpm authored
Signed-off-by:dolpm <34420038+dolpm@users.noreply.github.com>
-
Walter Beller-Morales authored
Signed-off-by:walterbm <walter.beller.morales@gmail.com>
-
- 18 Jan, 2026 1 commit
-
-
Karan Bansal authored
Signed-off-by:Karan Bansal <karanb192@gmail.com>
-
- 16 Jan, 2026 3 commits
-
-
zhuwenwen authored
区分pcie和hglink custom allreduce的使用 vllm:export VLLM_CUSTOM_CACHE=1 dtk:export HIP_KERNEL_EVENT_SYSTENFENCE=1 set VLLM_USE_FUSED_RMS_ROPE=1 add SUPPORT_MOE_MARLIN_W16A16 to use moe marlin on bw support fa kvcache fp8 (todo: add VLLM_USE_QUERY_QUANT to not use q quant) update moe_align_block_size
-
zhuwenwen authored
fix _forward_encoder_attention remove medusa set VLLM_PCIE_USE_CUSTOM_ALLREDUCE=1
-
TomerBN-Nvidia authored
Signed-off-by:
Tomer Natan <tbarnatan@computelab-frontend-8.nvidia.com> Signed-off-by:
mgoin <mgoin64@gmail.com> Co-authored-by:
Tomer Natan <tbarnatan@computelab-frontend-8.nvidia.com> Co-authored-by:
mgoin <mgoin64@gmail.com> Co-authored-by:
Tomer Natan <tbarnatan@ipp1-1429.ipp1a1.colossus.nvidia.com>
-
- 15 Jan, 2026 2 commits
-
-
Aleksandr Malyshev authored
Signed-off-by:
Aleksandr Malyshev <maleksan@amd.com> Co-authored-by:
Aleksandr Malyshev <maleksan@amd.com>
-
Pleaplusone authored
[ROCm][Perf] Enable shuffle kv cache layout and assembly paged attention kernel for `AiterFlashAttentionBackend` (#29887) Signed-off-by:ganyi <ygan@amd.com>
-
- 13 Jan, 2026 1 commit
-
-
Roberto L. Castro authored
Signed-off-by:
LopezCastroRoberto <roberto.lopez.castro@udc.es> Signed-off-by:
Roberto L. Castro <38211239+LopezCastroRoberto@users.noreply.github.com> Signed-off-by:
LopezCastroRoberto <rocastro@redhat.com>
-
- 11 Jan, 2026 1 commit
-
-
Fadi Arafeh authored
Signed-off-by:Fadi Arafeh <fadi.arafeh@arm.com>
-
- 09 Jan, 2026 3 commits
-
-
Matthew Bonanni authored
Signed-off-by:Matthew Bonanni <mbonanni@redhat.com>
-
Michael Goin authored
Signed-off-by:mgoin <mgoin64@gmail.com>
-
inkcherry authored
Signed-off-by:inkcherry <mingzhi.liu@amd.com>
-
- 07 Jan, 2026 2 commits
-
-
Kate Cheng authored
Signed-off-by:
Kate Cheng <yunhsuanc@nvidia.com> Signed-off-by:
Jhao-Ting Chen <jhaotingc@nvidia.com> Co-authored-by:
Jhao-Ting Chen <jhaotingc@nvidia.com>
-
zhuwenwen authored
-
- 05 Jan, 2026 1 commit
-
-
Wentao Ye authored
Signed-off-by:yewentao256 <zhyanwentao@126.com>
-
- 19 Dec, 2025 2 commits
-
-
Seiji Eicher authored
Signed-off-by:Seiji Eicher <seiji@anyscale.com>
-
zhuwenwen authored
-
- 18 Dec, 2025 1 commit
-
-
Elizabeth Thomas authored
Signed-off-by:
Elizabeth Thomas <email2eliza@gmail.com> Signed-off-by:
Harry Mellor <19981378+hmellor@users.noreply.github.com> Co-authored-by:
Harry Mellor <19981378+hmellor@users.noreply.github.com>
-