- 14 May, 2025 22 commits
-
-
Harry Mellor authored
Signed-off-by:Harry Mellor <19981378+hmellor@users.noreply.github.com>
-
Harry Mellor authored
Signed-off-by:Harry Mellor <19981378+hmellor@users.noreply.github.com>
-
Cyrus Leung authored
Signed-off-by:DarkLight1337 <tlleungac@connect.ust.hk>
-
TJian authored
Signed-off-by:
tjtanaa <tunjian.tan@embeddedllm.com> Co-authored-by:
Gregory Shtrasberg <Gregory.Shtrasberg@amd.com>
-
Andrzej Kotłowski authored
Signed-off-by:Andrzej Kotłowski <akotlowski@habana.ai>
-
rongfu.leng authored
Signed-off-by:rongfu.leng <rongfu.leng@daocloud.io>
-
Jee Jee Li authored
Signed-off-by:Jee Jee Li <pandaleefree@gmail.com>
-
wang.yuqi authored
-
lkchen authored
Signed-off-by:Linkun <github@lkchen.net>
-
qli88 authored
Signed-off-by:Qiang Li <qiang.li2@amd.com>
-
Charlie Fu authored
Signed-off-by:charlifu <charlifu@amd.com>
-
vllmellm authored
Signed-off-by:
tjtanaa <tunjian.tan@embeddedllm.com> Co-authored-by:
tjtanaa <tunjian.tan@embeddedllm.com>
-
Michael Goin authored
-
Michael Goin authored
Signed-off-by:mgoin <mgoin64@gmail.com>
-
youkaichao authored
Signed-off-by:youkaichao <youkaichao@gmail.com>
-
Jon Gill authored
Signed-off-by:Jon Gill <jon@yurts.ai>
-
Roger Wang authored
Signed-off-by:Roger Wang <hey@rogerw.me>
-
Nick Hill authored
Signed-off-by:Nick Hill <nhill@redhat.com>
-
Pavani Majety authored
Signed-off-by:Pavani Majety <pmajety@nvidia.com>
-
Luka Govedič authored
Signed-off-by:Luka Govedič <lgovedic@redhat.com>
-
Chen Zhang authored
Signed-off-by:Chen Zhang <zhangch99@outlook.com>
-
vllmellm authored
Signed-off-by:
vllmellm <vllm.ellm@embeddedllm.com> Signed-off-by:
tjtanaa <tunjian.tan@embeddedllm.com> Co-authored-by:
tjtanaa <tunjian.tan@embeddedllm.com>
-
- 13 May, 2025 18 commits
-
-
Nick Hill authored
Signed-off-by:Nick Hill <nhill@redhat.com>
-
Harry Mellor authored
Signed-off-by:Harry Mellor <19981378+hmellor@users.noreply.github.com>
-
Harry Mellor authored
Signed-off-by:Harry Mellor <19981378+hmellor@users.noreply.github.com>
-
Cyrus Leung authored
Signed-off-by:DarkLight1337 <tlleungac@connect.ust.hk>
-
Harry Mellor authored
Signed-off-by:Harry Mellor <19981378+hmellor@users.noreply.github.com>
-
Harry Mellor authored
Signed-off-by:Harry Mellor <19981378+hmellor@users.noreply.github.com>
-
Harry Mellor authored
Signed-off-by:Harry Mellor <19981378+hmellor@users.noreply.github.com>
-
Aaron Pham authored
Signed-off-by:Aaron Pham <contact@aarnphm.xyz>
-
Michael Goin authored
Signed-off-by:mgoin <mgoin64@gmail.com>
-
Woosuk Kwon authored
Signed-off-by:Woosuk Kwon <woosuk.kwon@berkeley.edu>
-
Jin Huang authored
Signed-off-by:
Jin Huang <jinhun@amazon.com> Co-authored-by:
Jin Huang <jinhun@amazon.com>
-
Chen Zhang authored
Signed-off-by:
Chen Zhang <zhangch99@outlook.com> Co-authored-by:
Woosuk Kwon <woosuk.kwon@berkeley.edu>
-
Calvin Chen authored
Signed-off-by:calvin chen <120380290@qq.com>
-
Chauncey authored
[Feature][V1] Support `tool_choice: required` when using Xgrammar as the `StructuredOutputBackend`. (#17845) Signed-off-by:chaunceyjiang <chaunceyjiang@gmail.com>
-
Cyrus Leung authored
Signed-off-by:DarkLight1337 <tlleungac@connect.ust.hk>
-
Michael Goin authored
Signed-off-by:mgoin <mgoin64@gmail.com>
-
Jee Jee Li authored
Signed-off-by:Jee Jee Li <pandaleefree@gmail.com>
-
Tao He authored
Implements dual-chunk-flash-attn backend for dual chunk attention with sparse attention support (#11844)
-