- 13 Nov, 2025 20 commits
-
-
Huamin Li authored
Signed-off-by:Huamin Li <3ericli@gmail.com>
-
Pleaplusone authored
Signed-off-by:ganyi <ygan@amd.com>
-
zofia authored
Signed-off-by:Zhu, Zufang <zufang.zhu@intel.com>
-
baonudesifeizhai authored
Signed-off-by:baonudesifeizhai <baonudesifeizhai@gmail.com>
-
Zijing Liu authored
Signed-off-by:Zijing Liu <liuzijing2014@gmail.com>
-
Yong Hoon Shin authored
Signed-off-by:Yong Hoon Shin <yhshin@meta.com>
-
Chauncey authored
Signed-off-by:chaunceyjiang <chaunceyjiang@gmail.com>
-
Jiangyun Zhu authored
Signed-off-by:zjy0516 <riverclouds.zhu@qq.com>
-
tjandy98 authored
Signed-off-by:tjandy98 <3953059+tjandy98@users.noreply.github.com>
-
Pleaplusone authored
Signed-off-by:ganyi <ygan@amd.com>
-
Lucia Fang authored
Support DeepEP for Kimi-k2-thinking through enabling gemm selection for compressed-tensor marlin wna16 (#28574) Signed-off-by:Lu Fang <fanglu@fb.com>
-
Fanli Lin authored
Signed-off-by:Fanli Lin <fanli.lin@intel.com>
-
Pleaplusone authored
Signed-off-by:ganyi <ygan@amd.com>
-
Andrew Xia authored
[Frontend][responsesAPI][1/n] convert responses API tool input to chat completions tool format (#28231) Signed-off-by:
Andrew Xia <axia@fb.com> Co-authored-by:
Andrew Xia <axia@fb.com> Co-authored-by:
Chauncey <chaunceyjiang@gmail.com>
-
Andrew Xia authored
Signed-off-by:
Andrew Xia <axia@fb.com> Co-authored-by:
Andrew Xia <axia@fb.com>
-
Jialin Ouyang authored
Signed-off-by:Jialin Ouyang <Jialin.Ouyang@gmail.com>
-
Harry Mellor authored
Signed-off-by:Harry Mellor <19981378+hmellor@users.noreply.github.com>
-
Nick Hill authored
Signed-off-by:Nick Hill <nhill@redhat.com>
-
Michael Goin authored
Signed-off-by:mgoin <mgoin64@gmail.com>
-
wangxiyuan authored
Signed-off-by:wangxiyuan <wangxiyuan1007@gmail.com>
-
- 12 Nov, 2025 20 commits
-
-
Harry Mellor authored
Signed-off-by:Harry Mellor <19981378+hmellor@users.noreply.github.com>
-
Alexander Matveev authored
Signed-off-by:Alexander Matveev <amatveev@redhat.com>
-
ℍ𝕠𝕝𝕝𝕠𝕨 𝕄𝕒𝕟 authored
Signed-off-by:
Hollow Man <hollowman@opensuse.org> Signed-off-by:
tjtanaa <tunjian.tan@embeddedllm.com> Co-authored-by:
tjtanaa <tunjian.tan@embeddedllm.com> Co-authored-by:
kliuae <kuanfu.liu@embeddedllm.com>
-
QiliangCui authored
Signed-off-by:Qiliang Cui <derrhein@gmail.com>
-
vllmellm authored
[ROCM] Fix ROCm warnings, environment flag access, and GEMM kernel naming for consistency in `_aiter_ops.py` (#28464) Signed-off-by:vllmellm <vllm.ellm@embeddedllm.com>
-
Varun Sundar Rabindranath authored
Signed-off-by:
Varun Sundar Rabindranath <vsundarr@redhat.com> Co-authored-by:
Varun Sundar Rabindranath <vsundarr@redhat.com>
-
Wei Wei authored
Signed-off-by:Wei Wei <wwei6@meta.com>
-
Andy Lo authored
Signed-off-by:Andy Lo <andy@mistral.ai>
-
Yihua Cheng authored
Signed-off-by:ApostaC <yihua98@uchicago.edu>
-
Thomas Parnell authored
Signed-off-by:Thomas Parnell <tpa@zurich.ibm.com>
-
PerryZhang01 authored
Signed-off-by:
Perry Zhang <perzhang@amd.com> Co-authored-by:
Perry Zhang <perzhang@amd.com>
-
alberto authored
Signed-off-by:
Alberto Perdomo <aperdomo@redhat.com> Signed-off-by:
alberto <aperdomo@redhat.com> Co-authored-by:
Or Ozeri <or@ozery.com>
-
Benjamin Chislett authored
[Perf] Refactor cudagraph_support to enable full CUDA graphs for spec decoding with FlashInfer (#28479) Signed-off-by:Benjamin Chislett <bchislett@nvidia.com>
-
Harry Mellor authored
Signed-off-by:Harry Mellor <19981378+hmellor@users.noreply.github.com>
-
Nicolò Lucchesi authored
Signed-off-by:
NickLucche <nlucches@redhat.com> Signed-off-by:
Mark McLoughlin <markmc@redhat.com> Co-authored-by:
Mark McLoughlin <markmc@redhat.com>
-
Canlin Guo authored
Signed-off-by:gcanlin <canlinguosdu@gmail.com>
-
Alexander Matveev authored
[Performance][Hopper] Avoid M dim padding to 4x for most cases (due to cuda graphs paddings) (#28492) Signed-off-by:Alexander Matveev <amatveev@redhat.com>
-
Harry Mellor authored
Signed-off-by:Harry Mellor <19981378+hmellor@users.noreply.github.com>
-
wangxiyuan authored
Signed-off-by:wangxiyuan <wangxiyuan1007@gmail.com>
-
Jee Jee Li authored
Signed-off-by:Jee Jee Li <pandaleefree@gmail.com>
-