- 15 Nov, 2025 8 commits
-
-
Cyrus Leung authored
Signed-off-by:
Jialin Ouyang <Jialin.Ouyang@gmail.com> Signed-off-by:
DarkLight1337 <tlleungac@connect.ust.hk> Co-authored-by:
Jialin Ouyang <Jialin.Ouyang@gmail.com>
-
Varun Sundar Rabindranath authored
Signed-off-by:
Varun Sundar Rabindranath <vsundarr@redhat.com> Co-authored-by:
Varun Sundar Rabindranath <vsundarr@redhat.com>
-
Chendi.Xue authored
Signed-off-by:
Chendi Xue <chendi.xue@intel.com> Signed-off-by:
Chendi.Xue <chendi.xue@intel.com> Co-authored-by:
Nicolò Lucchesi <nicolo.lucchesi@gmail.com>
-
Mohammad Othman authored
Signed-off-by:
Mohammad Othman <Mo@MohammadOthman.com> Co-authored-by:
Mohammad Othman <Mo@MohammadOthman.com>
-
Nick Hill authored
-
Lukas Geiger authored
Signed-off-by:
Lukas Geiger <lukas.geiger94@gmail.com> Co-authored-by:
Cyrus Leung <tlleungac@connect.ust.hk>
-
QiliangCui authored
Signed-off-by:Qiliang Cui <derrhein@gmail.com>
-
Jialin Ouyang authored
[Core] Performance: Use list[np.ndarray] instead of list[list[int]] for output tokens for GC optimization (#26368) Signed-off-by:Jialin Ouyang <Jialin.Ouyang@gmail.com>
-
- 14 Nov, 2025 32 commits
-
-
rasmith authored
Signed-off-by:
Randall Smith <ransmith@amd.com> Co-authored-by:
Randall Smith <ransmith@amd.com>
-
Thomas Parnell authored
Signed-off-by:Thomas Parnell <tpa@zurich.ibm.com>
-
Benjamin Chislett authored
Signed-off-by:Benjamin Chislett <bchislett@nvidia.com>
-
Alexander Matveev authored
[Bugfix] Fix incorrect use of hidden_states for shared_experts due to do_naive_dispatch_combine (#28740) Signed-off-by:Alexander Matveev <amatveev@redhat.com>
-
Laith Sakka authored
Signed-off-by:Laith Sakka <lsakka@meta.com>
-
Andrey Khalyavin authored
Signed-off-by:Andrey Khalyavin <halyavin@yandex-team.ru>
-
GuanH authored
Signed-off-by:
GuanH <guansdrailib@gmail.com> Signed-off-by:
Isotr0py <mozf@mail2.sysu.edu.cn> Co-authored-by:
Isotr0py <mozf@mail2.sysu.edu.cn>
-
Cyrus Leung authored
Signed-off-by:DarkLight1337 <tlleungac@connect.ust.hk>
-
TJian authored
Signed-off-by:tjtanaa <tunjian.tan@embeddedllm.com>
-
Fardin Hoque authored
Signed-off-by:
Fardin Hoque <kfhfar@amazon.com> Co-authored-by:
Wei Wei <wwei6@meta.com>
-
Chen Wang authored
Signed-off-by:
Chen Wang <Chen.Wang1@ibm.com> Signed-off-by:
Harry Mellor <19981378+hmellor@users.noreply.github.com> Co-authored-by:
Harry Mellor <19981378+hmellor@users.noreply.github.com>
-
Mohammad Othman authored
Signed-off-by:Mohammad Othman <emranm226@hotmail.com>
-
Nicolò Lucchesi authored
Signed-off-by:
NickLucche <nlucches@redhat.com> Signed-off-by:
Harry Mellor <19981378+hmellor@users.noreply.github.com> Co-authored-by:
Harry Mellor <19981378+hmellor@users.noreply.github.com>
-
Harry Mellor authored
Signed-off-by:Harry Mellor <19981378+hmellor@users.noreply.github.com>
-
dongbo910220 authored
[Fix] improve aspect ratio in dummy image generation and add common VLM tests for PaddleOCR-VL (#28711) Signed-off-by:dongbo910220 <1275604947@qq.com>
-
Duncan Moss authored
Signed-off-by:
Duncan Moss <djm.moss@gmail.com> Co-authored-by:
Robert Shaw <114415538+robertgshaw2-redhat@users.noreply.github.com> Co-authored-by:
Wentao Ye <44945378+yewentao256@users.noreply.github.com>
-
Cyrus Leung authored
Signed-off-by:DarkLight1337 <tlleungac@connect.ust.hk>
-
Nicolò Lucchesi authored
Signed-off-by:NickLucche <nlucches@redhat.com>
-
zhaozx-cn authored
Signed-off-by:zhaozx-cn <zhaozx2116@163.com>
-
Lucas Wilkinson authored
-
Yong Hoon Shin authored
Signed-off-by:
Yong Hoon Shin <yhshin@meta.com> Co-authored-by:
Cyrus Leung <tlleungac@connect.ust.hk>
-
Jingchun Gao authored
Signed-off-by:
gaojc <1055866782@qq.com> Signed-off-by:
Jingchun Gao <gaojingchun1@huawei.com> Signed-off-by:
Jingchun Gao <63247409+gjc0824@users.noreply.github.com> Signed-off-by:
QiuChunshuo <qiuchunshuo@huawei.com> Co-authored-by:
gaojingchun (A) <g00955623@china.huawei.com> Co-authored-by:
Jingchun Gao <gaojingchun1@huawei.com> Co-authored-by:
QiuChunshuo <qiuchunshuo@huawei.com>
-
Shanshan Shen authored
Signed-off-by:
shen-shanshan <467638484@qq.com> Signed-off-by:
Isotr0py <mozf@mail2.sysu.edu.cn> Co-authored-by:
Isotr0py <mozf@mail2.sysu.edu.cn>
-
Srreyansh Sethi authored
Signed-off-by:
WorldExplored <srreyansh.sethi@gmail.com> Signed-off-by:
Srreyansh Sethi <107075589+WorldExplored@users.noreply.github.com> Signed-off-by:
vnadathur <glvikramn@gmail.com> Signed-off-by:
wang.yuqi <yuqi.wang@daocloud.io> Co-authored-by:
vnadathur <236933696+vnadathur@users.noreply.github.com> Co-authored-by:
gemini-code-assist[bot] <176961590+gemini-code-assist[bot]@users.noreply.github.com> Co-authored-by:
vnadathur <glvikramn@gmail.com> Co-authored-by:
wang.yuqi <noooop@126.com> Co-authored-by:
wang.yuqi <yuqi.wang@daocloud.io>
-
lyn610 authored
Add tracking and periodic logging for the number of preempted requests in the metrics logger. This helps monitor system behavior under load. Signed-off-by:Yining Liu <610lyn@gmail.com>
-
Nick Hill authored
Signed-off-by:Nick Hill <nhill@redhat.com>
-
Jiangyun Zhu authored
Signed-off-by:zjy0516 <riverclouds.zhu@qq.com>
-
Yan Ma authored
Signed-off-by:Yan Ma <yan.ma@intel.com>
-
rasmith authored
[Bugfix][CI/Test][Spec Decode] Fix illegal memory access in offline_inference/spec_decode.py (Issue 27619) (#28432) Signed-off-by:
Randall Smith <ransmith@amd.com> Co-authored-by:
Randall Smith <ransmith@amd.com> Co-authored-by:
TJian <tunjian.tan@embeddedllm.com>
-
haoyangli-amd authored
Signed-off-by:Haoyang Li <lihaoyang0109@gmail.com>
-
Cyrus Leung authored
Signed-off-by:DarkLight1337 <tlleungac@connect.ust.hk>
-
Hank_ authored
Signed-off-by:
Hank <hcc.mayday@gmail.com> Co-authored-by:
Michael Goin <mgoin64@gmail.com>
-