- 13 Nov, 2025 13 commits
-
-
Lucia Fang authored
Support DeepEP for Kimi-k2-thinking through enabling gemm selection for compressed-tensor marlin wna16 (#28574) Signed-off-by:Lu Fang <fanglu@fb.com>
-
Fanli Lin authored
Signed-off-by:Fanli Lin <fanli.lin@intel.com>
-
Pleaplusone authored
Signed-off-by:ganyi <ygan@amd.com>
-
Andrew Xia authored
[Frontend][responsesAPI][1/n] convert responses API tool input to chat completions tool format (#28231) Signed-off-by:
Andrew Xia <axia@fb.com> Co-authored-by:
Andrew Xia <axia@fb.com> Co-authored-by:
Chauncey <chaunceyjiang@gmail.com>
-
Andrew Xia authored
Signed-off-by:
Andrew Xia <axia@fb.com> Co-authored-by:
Andrew Xia <axia@fb.com>
-
Jialin Ouyang authored
Signed-off-by:Jialin Ouyang <Jialin.Ouyang@gmail.com>
-
Harry Mellor authored
Signed-off-by:Harry Mellor <19981378+hmellor@users.noreply.github.com>
-
Harry Mellor authored
Signed-off-by:Harry Mellor <19981378+hmellor@users.noreply.github.com>
-
Nick Hill authored
Signed-off-by:Nick Hill <nhill@redhat.com>
-
Michael Goin authored
Signed-off-by:mgoin <mgoin64@gmail.com>
-
wangxiyuan authored
Signed-off-by:wangxiyuan <wangxiyuan1007@gmail.com>
-
Gregory Shtrasberg authored
Signed-off-by:Gregory Shtrasberg <Gregory.Shtrasberg@amd.com>
-
Michael Goin authored
Signed-off-by:mgoin <mgoin64@gmail.com>
-
- 12 Nov, 2025 27 commits
-
-
Harry Mellor authored
Signed-off-by:Harry Mellor <19981378+hmellor@users.noreply.github.com>
-
Alexander Matveev authored
Signed-off-by:Alexander Matveev <amatveev@redhat.com>
-
ℍ𝕠𝕝𝕝𝕠𝕨 𝕄𝕒𝕟 authored
Signed-off-by:
Hollow Man <hollowman@opensuse.org> Signed-off-by:
tjtanaa <tunjian.tan@embeddedllm.com> Co-authored-by:
tjtanaa <tunjian.tan@embeddedllm.com> Co-authored-by:
kliuae <kuanfu.liu@embeddedllm.com>
-
Michael Goin authored
-
QiliangCui authored
Signed-off-by:Qiliang Cui <derrhein@gmail.com>
-
vllmellm authored
[ROCM] Fix ROCm warnings, environment flag access, and GEMM kernel naming for consistency in `_aiter_ops.py` (#28464) Signed-off-by:vllmellm <vllm.ellm@embeddedllm.com>
-
Varun Sundar Rabindranath authored
Signed-off-by:
Varun Sundar Rabindranath <vsundarr@redhat.com> Co-authored-by:
Varun Sundar Rabindranath <vsundarr@redhat.com>
-
Wei Wei authored
Signed-off-by:Wei Wei <wwei6@meta.com>
-
Andy Lo authored
Signed-off-by:Andy Lo <andy@mistral.ai>
-
Yihua Cheng authored
Signed-off-by:ApostaC <yihua98@uchicago.edu>
-
Harry Mellor authored
[CI] Skip "Multi-Modal Models Test (Extended) 3" test that's broken in current Transformers (#28559) Signed-off-by:Harry Mellor <19981378+hmellor@users.noreply.github.com>
-
Thomas Parnell authored
Signed-off-by:Thomas Parnell <tpa@zurich.ibm.com>
-
PerryZhang01 authored
Signed-off-by:
Perry Zhang <perzhang@amd.com> Co-authored-by:
Perry Zhang <perzhang@amd.com>
-
alberto authored
Signed-off-by:
Alberto Perdomo <aperdomo@redhat.com> Signed-off-by:
alberto <aperdomo@redhat.com> Co-authored-by:
Or Ozeri <or@ozery.com>
-
Benjamin Chislett authored
[Perf] Refactor cudagraph_support to enable full CUDA graphs for spec decoding with FlashInfer (#28479) Signed-off-by:Benjamin Chislett <bchislett@nvidia.com>
-
Harry Mellor authored
Signed-off-by:Harry Mellor <19981378+hmellor@users.noreply.github.com>
-
Nicolò Lucchesi authored
Signed-off-by:
NickLucche <nlucches@redhat.com> Signed-off-by:
Mark McLoughlin <markmc@redhat.com> Co-authored-by:
Mark McLoughlin <markmc@redhat.com>
-
Canlin Guo authored
Signed-off-by:gcanlin <canlinguosdu@gmail.com>
-
Alexander Matveev authored
[Performance][Hopper] Avoid M dim padding to 4x for most cases (due to cuda graphs paddings) (#28492) Signed-off-by:Alexander Matveev <amatveev@redhat.com>
-
Harry Mellor authored
Signed-off-by:Harry Mellor <19981378+hmellor@users.noreply.github.com>
-
wangxiyuan authored
Signed-off-by:wangxiyuan <wangxiyuan1007@gmail.com>
-
Jee Jee Li authored
Signed-off-by:Jee Jee Li <pandaleefree@gmail.com>
-
TJian authored
Signed-off-by:tjtanaa <tunjian.tan@embeddedllm.com>
-
ZhengHongming888 authored
Signed-off-by:Hongming Zheng <hongming.zheng@intel.com>
-
ziruiliu authored
Signed-off-by:
Zirui Liu <ziliu@ddn.com> Signed-off-by:
ziruiliu <ziliu@ddn.com> Co-authored-by:
gemini-code-assist[bot] <176961590+gemini-code-assist[bot]@users.noreply.github.com> Co-authored-by:
Nicolò Lucchesi <nlucches@redhat.com>
-
Chaojun Zhang authored
Signed-off-by:chaojun-zhang <chaojun.zhang@intel.com>
-
wuyaoxuehun authored
Signed-off-by:
wuao.scotty <wuao.scotty@bytedance.com> Co-authored-by:
wuao.scotty <wuao.scotty@bytedance.com>
-