- 17 Jul, 2025 6 commits
-
-
Michael Goin authored
Signed-off-by:mgoin <mgoin64@gmail.com>
-
XiongfeiWei authored
Signed-off-by:Xiongfei Wei <isaacwxf23@gmail.com>
-
Michael Goin authored
Signed-off-by:mgoin <mgoin64@gmail.com>
-
Kevin_Xiong authored
Signed-off-by:KevinXiong-C <kevin_xiong1997@outlook.com>
-
Michael Goin authored
Signed-off-by:mgoin <mgoin64@gmail.com>
-
QiliangCui authored
Signed-off-by:Qiliang Cui <derrhein@gmail.com>
-
- 16 Jul, 2025 27 commits
-
-
Nir David authored
Support FP8 Quantization and Inference Run on Intel Gaudi (HPU) using INC (Intel Neural Compressor) (#12010) Signed-off-by:
Nir David <ndavid@habana.ai> Signed-off-by:
Uri Livne <ulivne@habana.ai> Co-authored-by:
Uri Livne <ulivne@habana.ai>
-
Cyrus Leung authored
Signed-off-by:DarkLight1337 <tlleungac@connect.ust.hk>
-
Harry Mellor authored
Signed-off-by:Harry Mellor <19981378+hmellor@users.noreply.github.com>
-
Avshalom Manevich authored
Signed-off-by:h-avsha <avshalom.manevich@hcompany.ai>
-
Mac Misiura authored
feat - add a new endpoint `get_tokenizer_info` to provide tokenizer/chat-template information (#20575) Signed-off-by:m-misiura <mmisiura@redhat.com>
-
Cyrus Leung authored
Signed-off-by:DarkLight1337 <tlleungac@connect.ust.hk>
-
Michael Yao authored
Signed-off-by:windsonsea <haifeng.yao@daocloud.io>
-
Seiji Eicher authored
Signed-off-by:Seiji Eicher <seiji@anyscale.com>
-
Lucas Wilkinson authored
Signed-off-by:Lucas Wilkinson <lwilkins@redhat.com>
-
Chengji Yao authored
Signed-off-by:Chengji Yao <chengjiyao@google.com>
-
zhiweiz authored
Signed-off-by:
qizixi <qizixi@meta.com> Co-authored-by:
qizixi <qizixi@meta.com>
-
Peter Pan authored
Signed-off-by:Peter Pan <Peter.Pan@daocloud.io>
-
Maximilien de Bayser authored
Signed-off-by:Max de Bayser <mbayser@br.ibm.com>
-
Patrick von Platen authored
Signed-off-by:
Patrick von Platen <patrick.v.platen@gmail.com> Co-authored-by:
gemini-code-assist[bot] <176961590+gemini-code-assist[bot]@users.noreply.github.com>
-
Wentao Ye authored
Signed-off-by:yewentao256 <zhyanwentao@126.com>
-
Michael Goin authored
Signed-off-by:mgoin <mgoin64@gmail.com>
-
Chendi.Xue authored
Signed-off-by:Chendi.Xue <chendi.xue@intel.com>
-
Doug Smith authored
Signed-off-by:dougbtv <dosmith@redhat.com>
-
Ming Yang authored
Signed-off-by:Ming Yang <minos.future@gmail.com>
-
Ricardo Decal authored
Signed-off-by:Ricardo Decal <rdecal@anyscale.com>
-
Reid authored
Signed-off-by:reidliu41 <reid201711@gmail.com>
-
Brayden Zhong authored
Signed-off-by:Brayden Zhong <b8zhong@uwaterloo.ca>
-
Chauncey authored
Signed-off-by:chaunceyjiang <chaunceyjiang@gmail.com>
-
Thomas Parnell authored
[Model] Add ModelConfig class for GraniteMoeHybrid to override default max_seq_len_to_capture (#20923) Signed-off-by:Thomas Parnell <tpa@zurich.ibm.com>
-
Michael Goin authored
Signed-off-by:mgoin <mgoin64@gmail.com>
-
Chauncey authored
Signed-off-by:chaunceyjiang <chaunceyjiang@gmail.com>
-
Elfie Guo authored
Signed-off-by:
Elfie Guo <elfieg@nvidia.com> Co-authored-by:
Elfie Guo <eflieg@nvidia.com>
-
- 15 Jul, 2025 7 commits
-
-
Chen LI authored
Signed-off-by:
Chen Li <lcpingping@gmail.com> Co-authored-by:
Russell Bryant <rbryant@redhat.com> Signed-off-by:
Russell Bryant <rbryant@redhat.com>
-
Marko Rosenmueller authored
Signed-off-by:Marko Rosenmueller <5467316+dr75@users.noreply.github.com>
-
Tuan, Hoang-Trong authored
[BugFix] fix 3 issues: (1) using metadata for causal-conv1d, (2) indexing overflow in v1 vLLM, and (3) init_states in v0 (#20838) Signed-off-by:
Tuan M. Hoang-Trong <tmhoangt@us.ibm.com> Co-authored-by:
Tuan M. Hoang-Trong <tmhoangt@us.ibm.com>
-
Gregory Shtrasberg authored
Signed-off-by:Gregory Shtrasberg <Gregory.Shtrasberg@amd.com>
-
Harry Mellor authored
Signed-off-by:Harry Mellor <19981378+hmellor@users.noreply.github.com>
-
Harry Mellor authored
Signed-off-by:Harry Mellor <19981378+hmellor@users.noreply.github.com>
-
Harry Mellor authored
Signed-off-by:Harry Mellor <19981378+hmellor@users.noreply.github.com>
-