- 17 Jul, 2025 14 commits
-
-
kYLe authored
Signed-off-by:
Kyle Huang <kylhuang@nvidia.com> Co-authored-by:
Cyrus Leung <cyrus.tl.leung@gmail.com>
-
Asher authored
Signed-off-by:Asher Zhang <asherszhang@tencent.com>
-
Varun Sundar Rabindranath authored
Signed-off-by:
Varun Sundar Rabindranath <vsundarr@redhat.com> Co-authored-by:
Varun Sundar Rabindranath <vsundarr@redhat.com>
-
Chauncey authored
Signed-off-by:chaunceyjiang <chaunceyjiang@gmail.com>
-
Jee Jee Li authored
Signed-off-by:Jee Jee Li <pandaleefree@gmail.com>
-
David Ben-David authored
Signed-off-by:
David Ben-David <davidb@pliops.com> Co-authored-by:
David Ben-David <davidb@pliops.com>
-
Zhonghua Deng authored
Signed-off-by:Abatom <abzhonghua@gmail.com>
-
Lucas Wilkinson authored
Signed-off-by:Lucas Wilkinson <lwilkins@redhat.com>
-
Michael Goin authored
Signed-off-by:mgoin <mgoin64@gmail.com>
-
XiongfeiWei authored
Signed-off-by:Xiongfei Wei <isaacwxf23@gmail.com>
-
Michael Goin authored
Signed-off-by:mgoin <mgoin64@gmail.com>
-
Kevin_Xiong authored
Signed-off-by:KevinXiong-C <kevin_xiong1997@outlook.com>
-
Michael Goin authored
Signed-off-by:mgoin <mgoin64@gmail.com>
-
QiliangCui authored
Signed-off-by:Qiliang Cui <derrhein@gmail.com>
-
- 16 Jul, 2025 26 commits
-
-
Nir David authored
Support FP8 Quantization and Inference Run on Intel Gaudi (HPU) using INC (Intel Neural Compressor) (#12010) Signed-off-by:
Nir David <ndavid@habana.ai> Signed-off-by:
Uri Livne <ulivne@habana.ai> Co-authored-by:
Uri Livne <ulivne@habana.ai>
-
Cyrus Leung authored
Signed-off-by:DarkLight1337 <tlleungac@connect.ust.hk>
-
Harry Mellor authored
Signed-off-by:Harry Mellor <19981378+hmellor@users.noreply.github.com>
-
Avshalom Manevich authored
Signed-off-by:h-avsha <avshalom.manevich@hcompany.ai>
-
Mac Misiura authored
feat - add a new endpoint `get_tokenizer_info` to provide tokenizer/chat-template information (#20575) Signed-off-by:m-misiura <mmisiura@redhat.com>
-
Cyrus Leung authored
Signed-off-by:DarkLight1337 <tlleungac@connect.ust.hk>
-
Michael Yao authored
Signed-off-by:windsonsea <haifeng.yao@daocloud.io>
-
Seiji Eicher authored
Signed-off-by:Seiji Eicher <seiji@anyscale.com>
-
Lucas Wilkinson authored
Signed-off-by:Lucas Wilkinson <lwilkins@redhat.com>
-
Chengji Yao authored
Signed-off-by:Chengji Yao <chengjiyao@google.com>
-
zhiweiz authored
Signed-off-by:
qizixi <qizixi@meta.com> Co-authored-by:
qizixi <qizixi@meta.com>
-
Peter Pan authored
Signed-off-by:Peter Pan <Peter.Pan@daocloud.io>
-
Maximilien de Bayser authored
Signed-off-by:Max de Bayser <mbayser@br.ibm.com>
-
Patrick von Platen authored
Signed-off-by:
Patrick von Platen <patrick.v.platen@gmail.com> Co-authored-by:
gemini-code-assist[bot] <176961590+gemini-code-assist[bot]@users.noreply.github.com>
-
Wentao Ye authored
Signed-off-by:yewentao256 <zhyanwentao@126.com>
-
Michael Goin authored
Signed-off-by:mgoin <mgoin64@gmail.com>
-
Chendi.Xue authored
Signed-off-by:Chendi.Xue <chendi.xue@intel.com>
-
Doug Smith authored
Signed-off-by:dougbtv <dosmith@redhat.com>
-
Ming Yang authored
Signed-off-by:Ming Yang <minos.future@gmail.com>
-
Ricardo Decal authored
Signed-off-by:Ricardo Decal <rdecal@anyscale.com>
-
Reid authored
Signed-off-by:reidliu41 <reid201711@gmail.com>
-
Brayden Zhong authored
Signed-off-by:Brayden Zhong <b8zhong@uwaterloo.ca>
-
Chauncey authored
Signed-off-by:chaunceyjiang <chaunceyjiang@gmail.com>
-
Thomas Parnell authored
[Model] Add ModelConfig class for GraniteMoeHybrid to override default max_seq_len_to_capture (#20923) Signed-off-by:Thomas Parnell <tpa@zurich.ibm.com>
-
Michael Goin authored
Signed-off-by:mgoin <mgoin64@gmail.com>
-
Chauncey authored
Signed-off-by:chaunceyjiang <chaunceyjiang@gmail.com>
-