- 05 Dec, 2025 14 commits
-
-
amitz-nv authored
[Frontend][Model] Add 'float16' to possible mamba cache dtype values, override mamba SSM cache dtype value for NemotronH (#29978) Signed-off-by:amitz-nv <203509407+amitz-nv@users.noreply.github.com>
-
Tiger Xu / Zhonghu Xu authored
Signed-off-by:Zhonghu Xu <xuzhonghu@huawei.com>
-
Micah Williamson authored
[AMD][CI] Add ray[default] Dependency On ROCm To Pass v1/metrics/test_engine_logger_apis.py (#30110) Signed-off-by:Micah Williamson <micah.williamson@amd.com>
-
Chukwuma Nwaugha authored
Signed-off-by: nwaughac@gmail.com
-
Jingchun Gao authored
Signed-off-by:
Jingchun Gao <gaojingchun1@huawei.com> Signed-off-by:
Jingchun Gao <63247409+gjc0824@users.noreply.github.com> Co-authored-by:
Jingchun Gao <gaojingchun1@huawei.com>
-
Charlie Fu authored
Signed-off-by:charlifu <charlifu@amd.com>
-
Laith Sakka authored
Signed-off-by:Laith Sakka <lsakka@meta.com>
-
Charlie Fu authored
Signed-off-by:charlifu <charlifu@amd.com>
-
Qiu authored
Signed-off-by:
QiuChunshuo <qiuchunshuo@huawei.com> Co-authored-by:
Cyrus Leung <tlleungac@connect.ust.hk>
-
Hubert de La Jonquiere authored
Signed-off-by:hdlj-h <hubert@hcompany.ai>
-
Shengqi Chen authored
[CI] fix silent error in nightly wheel index generation script, add generation time to HTML index (#30060) Signed-off-by:Shengqi Chen <harry-chen@outlook.com>
-
Zhewen Li authored
Signed-off-by:zhewenli <zhewenli@meta.com>
-
Zhewen Li authored
Signed-off-by:zhewenli <zhewenli@meta.com>
-
Alexander Matveev authored
Signed-off-by:Alexander Matveev <amatveev@redhat.com>
-
- 04 Dec, 2025 26 commits
-
-
TimWang authored
Signed-off-by:Tim <tim.wang03@sap.com>
-
Laith Sakka authored
Signed-off-by:Laith Sakka <lsakka@meta.com>
-
Lucas Wilkinson authored
Signed-off-by:Lucas Wilkinson <lwilkins@redhat.com>
-
Peng-YM authored
[Bugfix] Missing tokens in `return_token_ids` when tool parsers is enabled in streaming mode (#29074) Signed-off-by:Peng-YM <1048217874pengym@gmail.com>
-
Mercykid-bash authored
Signed-off-by:
Che Ruan <cr623@ic.ac.uk> Signed-off-by:
mengxingkongzhouhan <117415539+mengxingkongzhouhan@users.noreply.github.com> Signed-off-by:
Mercykid-bash <ruanche0218@gmail.com> Signed-off-by:
Harry Mellor <19981378+hmellor@users.noreply.github.com> Co-authored-by:
Che Ruan <cr623@ic.ac.uk> Co-authored-by:
mengxingkongzhouhan <117415539+mengxingkongzhouhan@users.noreply.github.com> Co-authored-by:
Harry Mellor <19981378+hmellor@users.noreply.github.com>
-
Harry Mellor authored
Signed-off-by:Harry Mellor <19981378+hmellor@users.noreply.github.com>
-
Kuntai Du authored
[KVConnector] Remove v0-related kv connector components such as kv pipe and kv lookup buffer (#29705) Signed-off-by:KuntaiDu <kuntai@uchicago.edu>
-
Jee Jee Li authored
Signed-off-by:Jee Jee Li <pandaleefree@gmail.com>
-
Tao Yun authored
Signed-off-by:
taoyun <1069423820@qq.com> Signed-off-by:
Cyrus Leung <cyrus.tl.leung@gmail.com> Co-authored-by:
Cyrus Leung <cyrus.tl.leung@gmail.com>
-
Qiu authored
Signed-off-by:QiuChunshuo <qiuchunshuo@huawei.com>
-
Cyrus Leung authored
Signed-off-by:DarkLight1337 <tlleungac@connect.ust.hk>
-
Shengqi Chen authored
Signed-off-by:Shengqi Chen <harry-chen@outlook.com>
-
Doug Smith authored
Signed-off-by:dougbtv <dosmith@redhat.com>
-
Woosuk Kwon authored
Signed-off-by:Woosuk Kwon <woosuk.kwon@berkeley.edu>
-
Harry Mellor authored
Signed-off-by:Harry Mellor <19981378+hmellor@users.noreply.github.com>
-
Yongtao Huang authored
Signed-off-by:Yongtao Huang <yongtaoh2022@gmail.com>
-
Harry Mellor authored
Signed-off-by:Harry Mellor <19981378+hmellor@users.noreply.github.com>
-
wang.yuqi authored
Signed-off-by:
wang.yuqi <noooop@126.com> Signed-off-by:
wang.yuqi <yuqi.wang@daocloud.io> Co-authored-by:
gemini-code-assist[bot] <176961590+gemini-code-assist[bot]@users.noreply.github.com>
-
Kevin H. Luu authored
Signed-off-by:
Kevin H. Luu <khluu000@gmail.com> Co-authored-by:
gemini-code-assist[bot] <176961590+gemini-code-assist[bot]@users.noreply.github.com>
-
Chauncey authored
Signed-off-by:
chaunceyjiang <chaunceyjiang@gmail.com> Signed-off-by:
Chauncey <chaunceyjiang@gmail.com> Co-authored-by:
Cyrus Leung <cyrus.tl.leung@gmail.com>
-
Andreas Karatzas authored
Signed-off-by:Andreas Karatzas <akaratza@amd.com>
-
Noa Neria authored
Signed-off-by:Noa Neria <noa@run.ai>
-
dtc authored
Signed-off-by:
Tianchen Ding <dtcccc@linux.alibaba.com> Signed-off-by:
dtc <dtcccc@linux.alibaba.com> Co-authored-by:
Nicolò Lucchesi <nicolo.lucchesi@gmail.com>
-
rasmith authored
[CI/Build][AMD] Skip test on test_hybrid_attention_mamba_tensor_shapes on ROCm, requires FLASHINFER (#29995) Signed-off-by:
Randall Smith <ransmith@amd.com> Co-authored-by:
Randall Smith <ransmith@amd.com>
-
Arpit Khandelwal authored
Signed-off-by:
arpitkh101 <arpit5khandelwal@gmail.com> Co-authored-by:
Cyrus Leung <tlleungac@connect.ust.hk>
-
Xu Wenqing authored
Signed-off-by:许文卿 <xwq391974@alibaba-inc.com>
-