- 07 Dec, 2025 12 commits
-
-
Isotr0py authored
Signed-off-by:Isotr0py <mozf@mail2.sysu.edu.cn>
-
Wentao Ye authored
[Perf] Deepgemm fused layout kernel for activations, 4.3% throughput improvement, 10.7% TTFT improvement. (#29546) Signed-off-by:yewentao256 <zhyanwentao@126.com>
-
Jee Jee Li authored
Signed-off-by:Jee Jee Li <pandaleefree@gmail.com>
-
Jinzhen Lin authored
Signed-off-by:
Jinzhen Lin <jinzhen.ljz@antgroup.com> Co-authored-by:
Jee Jee Li <pandaleefree@gmail.com>
-
Yifan Qiao authored
Signed-off-by:Yifan Qiao <yifanqiao@berkeley.edu>
-
Cyrus Leung authored
-
Cyrus Leung authored
Signed-off-by:DarkLight1337 <tlleungac@connect.ust.hk>
-
Luke authored
Signed-off-by:Luke <yq0536@gmail.com>
-
Wentao Ye authored
Signed-off-by:yewentao256 <zhyanwentao@126.com>
-
jeremyteboul authored
Signed-off-by:
Jeremy Teboul <jeremyteboul@fb.com> Co-authored-by:
Jeremy Teboul <jeremyteboul@fb.com>
-
Yanan Cao authored
Signed-off-by:Yanan Cao <gmagogsfm@gmail.com>
-
AuruTus authored
-
- 06 Dec, 2025 21 commits
-
-
Andrew Xia authored
Signed-off-by:
Andrew Xia <axia@fb.com> Co-authored-by:
Andrew Xia <axia@fb.com>
-
Cyrus Leung authored
Signed-off-by:DarkLight1337 <tlleungac@connect.ust.hk>
-
Viacheslav authored
Signed-off-by:Viacheslav Barinov <viacheslav.teh@gmail.com>
-
Chukwuma Nwaugha authored
Signed-off-by:Chukwuma Nwaugha <nwaughac@gmail.com>
-
Ye (Charlotte) Qi authored
Signed-off-by:Ye (Charlotte) Qi <yeq@meta.com>
-
Yu Jiaqi authored
Signed-off-by:piood <2477084691@qq.com>
-
Cyrus Leung authored
Signed-off-by:DarkLight1337 <tlleungac@connect.ust.hk>
-
redwrasse authored
Signed-off-by:redwrasse <mail@redwrasse.io>
-
kx authored
Signed-off-by:
01267596 <xiongkai123@cmbchina.com> Co-authored-by:
01267596 <xiongkai123@cmbchina.com>
-
Woosuk Kwon authored
Signed-off-by:Woosuk Kwon <woosuk.kwon@berkeley.edu>
-
Nick Hill authored
Signed-off-by:Nick Hill <nhill@redhat.com>
-
rasmith authored
[CI/Build][AMD] Use ROCM_ATTN instead of FLASH_ATTN test for test_register_kv_caches for ROCm and update test for TRITON_ATTN (#29985) Signed-off-by:
Randall Smith <ransmith@amd.com> Co-authored-by:
Randall Smith <ransmith@amd.com> Co-authored-by:
TJian <tunjian.tan@embeddedllm.com>
-
Rohan Potdar authored
Signed-off-by:Rohan138 <rohanpotdar138@gmail.com>
-
Peter Salas authored
Signed-off-by:Peter Salas <peter@fixie.ai>
-
Dongjie Zou authored
Signed-off-by:baonudesifeizhai <baonudesifeizhai@gmail.com>
-
yuttian1 authored
Signed-off-by:yuttian1 <yuttian@amd.com>
-
rasmith authored
[CI/Build][AMD] Skip marlin, machete, and hadacore tests since these require _C functions not defined for ROCm (#30109) Signed-off-by:
Randall Smith <ransmith@amd.com> Co-authored-by:
Randall Smith <ransmith@amd.com>
-
Harry Mellor authored
Better error when world size is larger than node and `distributed_executor_backend` is not set (#30140) Signed-off-by:Harry Mellor <19981378+hmellor@users.noreply.github.com>
-
Samuel Shen authored
Signed-off-by:
Samuel Shen <slshen@uchicago.edu> Co-authored-by:
Samuel Shen <slshen@uchicago.edu>
-
rasmith authored
[CI/Build][AMD][Quantization] Fix test_int8_kernel.py by updating int8_utils to use hip.libdevice.round (#30151) Signed-off-by:
Randall Smith <ransmith@amd.com> Co-authored-by:
Randall Smith <ransmith@amd.com>
-
Deboleina authored
Signed-off-by:Debolina Roy <debroy@redhat.com>
-
- 05 Dec, 2025 7 commits
-
-
Wentao Ye authored
Signed-off-by:yewentao256 <zhyanwentao@126.com>
-
Bangsheng Tang authored
-
Divakar Verma authored
Signed-off-by:Divakar Verma <divakar.verma@amd.com>
-
Nicolò Lucchesi authored
Signed-off-by:NickLucche <nlucches@redhat.com>
-
Russell Bryant authored
Signed-off-by:Russell Bryant <rbryant@redhat.com>
-
Nicolò Lucchesi authored
Signed-off-by:NickLucche <nlucches@redhat.com>
-
Tova Movshovitz authored
Signed-off-by:
tovam <tovam@pliops.com> Signed-off-by:
Tova Movshovitz <tovam@pliops.com> Co-authored-by:
gemini-code-assist[bot] <176961590+gemini-code-assist[bot]@users.noreply.github.com>
-