- 08 Dec, 2025 5 commits
-
-
Zhiwei authored
Signed-off-by:ZhiweiYan-96 <zhiwei.yan@amd.com>
-
Nick Hill authored
Signed-off-by:Nick Hill <nhill@redhat.com>
-
Zhijian Jiang authored
Signed-off-by:Zhijian Jiang <Zhijian.Jiang@outlook.com>
-
Andrew Xia authored
Signed-off-by:
Andrew Xia <axia@meta.com> Signed-off-by:
Andrew Xia <axia@fb.com> Co-authored-by:
Andrew Xia <axia@fb.com> Co-authored-by:
Cyrus Leung <tlleungac@connect.ust.hk>
-
daniel-salib authored
Signed-off-by:Daniel Salib <danielsalib@meta.com>
-
- 07 Dec, 2025 14 commits
-
-
ElizaWszola authored
Signed-off-by:
ElizaWszola <ewszola@redhat.com> Signed-off-by:
yewentao256 <zhyanwentao@126.com> Co-authored-by:
yewentao256 <zhyanwentao@126.com>
-
Lucas Wilkinson authored
-
Isotr0py authored
Signed-off-by:Isotr0py <mozf@mail2.sysu.edu.cn>
-
Wentao Ye authored
[Perf] Deepgemm fused layout kernel for activations, 4.3% throughput improvement, 10.7% TTFT improvement. (#29546) Signed-off-by:yewentao256 <zhyanwentao@126.com>
-
Jee Jee Li authored
Signed-off-by:Jee Jee Li <pandaleefree@gmail.com>
-
Jinzhen Lin authored
Signed-off-by:
Jinzhen Lin <jinzhen.ljz@antgroup.com> Co-authored-by:
Jee Jee Li <pandaleefree@gmail.com>
-
Yifan Qiao authored
Signed-off-by:Yifan Qiao <yifanqiao@berkeley.edu>
-
Cyrus Leung authored
-
Cyrus Leung authored
Signed-off-by:DarkLight1337 <tlleungac@connect.ust.hk>
-
Luke authored
Signed-off-by:Luke <yq0536@gmail.com>
-
Wentao Ye authored
Signed-off-by:yewentao256 <zhyanwentao@126.com>
-
jeremyteboul authored
Signed-off-by:
Jeremy Teboul <jeremyteboul@fb.com> Co-authored-by:
Jeremy Teboul <jeremyteboul@fb.com>
-
Yanan Cao authored
Signed-off-by:Yanan Cao <gmagogsfm@gmail.com>
-
AuruTus authored
-
- 06 Dec, 2025 21 commits
-
-
Andrew Xia authored
Signed-off-by:
Andrew Xia <axia@fb.com> Co-authored-by:
Andrew Xia <axia@fb.com>
-
Cyrus Leung authored
Signed-off-by:DarkLight1337 <tlleungac@connect.ust.hk>
-
Viacheslav authored
Signed-off-by:Viacheslav Barinov <viacheslav.teh@gmail.com>
-
Chukwuma Nwaugha authored
Signed-off-by:Chukwuma Nwaugha <nwaughac@gmail.com>
-
Ye (Charlotte) Qi authored
Signed-off-by:Ye (Charlotte) Qi <yeq@meta.com>
-
Yu Jiaqi authored
Signed-off-by:piood <2477084691@qq.com>
-
Cyrus Leung authored
Signed-off-by:DarkLight1337 <tlleungac@connect.ust.hk>
-
redwrasse authored
Signed-off-by:redwrasse <mail@redwrasse.io>
-
kx authored
Signed-off-by:
01267596 <xiongkai123@cmbchina.com> Co-authored-by:
01267596 <xiongkai123@cmbchina.com>
-
Woosuk Kwon authored
Signed-off-by:Woosuk Kwon <woosuk.kwon@berkeley.edu>
-
Nick Hill authored
Signed-off-by:Nick Hill <nhill@redhat.com>
-
rasmith authored
[CI/Build][AMD] Use ROCM_ATTN instead of FLASH_ATTN test for test_register_kv_caches for ROCm and update test for TRITON_ATTN (#29985) Signed-off-by:
Randall Smith <ransmith@amd.com> Co-authored-by:
Randall Smith <ransmith@amd.com> Co-authored-by:
TJian <tunjian.tan@embeddedllm.com>
-
Rohan Potdar authored
Signed-off-by:Rohan138 <rohanpotdar138@gmail.com>
-
Peter Salas authored
Signed-off-by:Peter Salas <peter@fixie.ai>
-
Dongjie Zou authored
Signed-off-by:baonudesifeizhai <baonudesifeizhai@gmail.com>
-
yuttian1 authored
Signed-off-by:yuttian1 <yuttian@amd.com>
-
rasmith authored
[CI/Build][AMD] Skip marlin, machete, and hadacore tests since these require _C functions not defined for ROCm (#30109) Signed-off-by:
Randall Smith <ransmith@amd.com> Co-authored-by:
Randall Smith <ransmith@amd.com>
-
Harry Mellor authored
Better error when world size is larger than node and `distributed_executor_backend` is not set (#30140) Signed-off-by:Harry Mellor <19981378+hmellor@users.noreply.github.com>
-
Samuel Shen authored
Signed-off-by:
Samuel Shen <slshen@uchicago.edu> Co-authored-by:
Samuel Shen <slshen@uchicago.edu>
-
rasmith authored
[CI/Build][AMD][Quantization] Fix test_int8_kernel.py by updating int8_utils to use hip.libdevice.round (#30151) Signed-off-by:
Randall Smith <ransmith@amd.com> Co-authored-by:
Randall Smith <ransmith@amd.com>
-
Deboleina authored
Signed-off-by:Debolina Roy <debroy@redhat.com>
-