- 16 May, 2025 1 commit
-
-
kliuae authored
Signed-off-by:kf <kuanfu.liu@embeddedllm.com>
-
- 15 May, 2025 4 commits
-
-
Thomas Parnell authored
Signed-off-by:
Thomas Parnell <tpa@zurich.ibm.com> Co-authored-by:
Lucas Wilkinson <lwilkinson@neuralmagic.com>
-
omahs authored
Signed-off-by:omahs <73983677+omahs@users.noreply.github.com>
-
Mark McLoughlin authored
Signed-off-by:Mark McLoughlin <markmc@redhat.com>
-
Chen Zhang authored
Signed-off-by:Chen Zhang <zhangch99@outlook.com>
-
- 14 May, 2025 8 commits
-
-
Aaron Pham authored
Signed-off-by:
Aaron Pham <contact@aarnphm.xyz> Co-authored-by:
Russell Bryant <rbryant@redhat.com>
-
bnellnm authored
-
Ekagra Ranjan authored
[V1][Spec Decode] Share input embedding of target model with EAGLE draft model to free ~1GB for llama 3 model (#17326) Co-authored-by:
root <root@ekagra-8xh100.us-east5-a.c.serving-efficiency-poc.internal> Co-authored-by:
Woosuk Kwon <woosuk.kwon@berkeley.edu>
-
Nick Hill authored
-
Michael Goin authored
-
youkaichao authored
Signed-off-by:youkaichao <youkaichao@gmail.com>
-
Roger Wang authored
Signed-off-by:Roger Wang <hey@rogerw.me>
-
Chen Zhang authored
Signed-off-by:Chen Zhang <zhangch99@outlook.com>
-
- 13 May, 2025 6 commits
-
-
Nick Hill authored
Signed-off-by:Nick Hill <nhill@redhat.com>
-
Cyrus Leung authored
Signed-off-by:DarkLight1337 <tlleungac@connect.ust.hk>
-
Jin Huang authored
Signed-off-by:
Jin Huang <jinhun@amazon.com> Co-authored-by:
Jin Huang <jinhun@amazon.com>
-
Chen Zhang authored
Signed-off-by:
Chen Zhang <zhangch99@outlook.com> Co-authored-by:
Woosuk Kwon <woosuk.kwon@berkeley.edu>
-
Chauncey authored
[Feature][V1] Support `tool_choice: required` when using Xgrammar as the `StructuredOutputBackend`. (#17845) Signed-off-by:chaunceyjiang <chaunceyjiang@gmail.com>
-
Cyrus Leung authored
Signed-off-by:DarkLight1337 <tlleungac@connect.ust.hk>
-
- 12 May, 2025 7 commits
-
-
wwl2755 authored
Signed-off-by:wwl2755 <wangwenlong2755@gmail.com>
-
Robert Shaw authored
Signed-off-by:rshaw@neuralmagic.com <robertgshaw2@gmail.com>
-
Chen Zhang authored
Signed-off-by:Chen Zhang <zhangch99@outlook.com>
-
Jade Zheng authored
Signed-off-by:Jade Zheng <zheng.shoujian@outlook.com>
-
Robert Shaw authored
Signed-off-by:
ApostaC <yihua98@uchicago.edu> Signed-off-by:
Tyler Michael Smith <tyler@neuralmagic.com> Signed-off-by:
rshaw@neuralmagic.com <robertgshaw2@gmail.com> Signed-off-by:
Robert Shaw <rshaw@neuralmagic.com> Signed-off-by:
mgoin <mgoin64@gmail.com> Signed-off-by:
Nick Hill <nhill@redhat.com> Signed-off-by:
Brent Salisbury <bsalisbu@redhat.com> Co-authored-by:
Tyler Michael Smith <tyler@neuralmagic.com> Co-authored-by:
ApostaC <yihua98@uchicago.edu> Co-authored-by:
Robert Shaw <rshaw@neuralmagic.com> Co-authored-by:
mgoin <mgoin64@gmail.com> Co-authored-by:
Nick Hill <nhill@redhat.com> Co-authored-by:
Tyler Michael Smith <tysmith@redhat.com> Co-authored-by:
Brent Salisbury <bsalisbu@redhat.com>
-
Siyuan Liu authored
Signed-off-by:Siyuan Liu <lsiyuan@google.com>
-
Cheng Kuan Yong Jason authored
[Bugfix] validate grammar and throw 400 error instead of crashing the engine when xgrammar validation fails (#17623) Signed-off-by:
Jason Cheng <jasoncky96@gmail.com> Co-authored-by:
Russell Bryant <rbryant@redhat.com>
-
- 11 May, 2025 3 commits
-
-
TJian authored
Signed-off-by:tjtanaa <tunjian.tan@embeddedllm.com>
-
Gregory Shtrasberg authored
Signed-off-by:Gregory Shtrasberg <Gregory.Shtrasberg@amd.com>
-
Ben Browning authored
Signed-off-by:Ben Browning <bbrownin@redhat.com>
-
- 10 May, 2025 3 commits
-
-
Chen Zhang authored
Signed-off-by:Chen Zhang <zhangch99@outlook.com>
-
Chen Zhang authored
Signed-off-by:Chen Zhang <zhangch99@outlook.com>
-
Mark McLoughlin authored
Signed-off-by:Mark McLoughlin <markmc@redhat.com>
-
- 09 May, 2025 6 commits
-
-
Mark McLoughlin authored
Signed-off-by:Mark McLoughlin <markmc@redhat.com>
-
Chen Zhang authored
Signed-off-by:Chen Zhang <zhangch99@outlook.com>
-
vllmellm authored
Signed-off-by:vllmellm <vllm.ellm@embeddedllm.com>
-
Ning Xie authored
-
Lucas Wilkinson authored
Signed-off-by:Lucas Wilkinson <lwilkinson@neuralmagic.com>
-
vllmellm authored
Signed-off-by:
vllmellm <vllm.ellm@embeddedllm.com> Co-authored-by:
qli88 <qiang.li2@amd.com> Co-authored-by:
Hongxia Yang <62075498+hongxiayang@users.noreply.github.com>
-
- 08 May, 2025 2 commits
-
-
Russell Bryant authored
Signed-off-by:Russell Bryant <rbryant@redhat.com>
-
Jevin Jiang authored
-