"vllm/vscode:/vscode.git/clone" did not exist on "a4bcf959ab1b097c9abd3667e5b7a4a6bf07c9ca"
- 16 Aug, 2025 6 commits
-
-
afeldman-nm authored
Signed-off-by:
Andrew Feldman <afeldman@redhat.com> Signed-off-by:
Andrew Feldman <afeld2012@gmail.com> Signed-off-by:
Harry Mellor <19981378+hmellor@users.noreply.github.com> Signed-off-by:
Nick Hill <nhill@redhat.com> Co-authored-by:
Nick Hill <nhill@redhat.com> Co-authored-by:
Andrew Feldman <afeld2012@gmail.com> Co-authored-by:
Harry Mellor <19981378+hmellor@users.noreply.github.com>
-
Michael Goin authored
Signed-off-by:mgoin <mgoin64@gmail.com>
-
Cyrus Leung authored
Signed-off-by:DarkLight1337 <tlleungac@connect.ust.hk>
-
Calvin Chen authored
Signed-off-by:
calvin chen <wen.chen@dynamia.ai> Co-authored-by:
Nick Hill <nhill@redhat.com>
-
Nicolò Lucchesi authored
Signed-off-by:NickLucche <nlucches@redhat.com>
-
Nick Hill authored
Signed-off-by:Nick Hill <nhill@redhat.com>
-
- 15 Aug, 2025 14 commits
-
-
Yong Hoon Shin authored
Signed-off-by:Yong Hoon Shin <yhshin@meta.com>
-
Or Ozeri authored
Signed-off-by:Or Ozeri <oro@il.ibm.com>
-
Nick Hill authored
Signed-off-by:
Nick Hill <nhill@redhat.com> Co-authored-by:
Yinghai Lu <yinghai@thinkingmachines.ai>
-
rishitdholakia13 authored
[Structured Outputs] [Bug] Fix misalignment in apply_grammar_bitmask causing unintended masking and NaN logits (#22963) Signed-off-by:rishitdholakia13 <rishit+github@cohere.com>
-
eigen authored
-
Zebing Lin authored
Signed-off-by:linzebing <linzebing1995@gmail.com>
-
Wentao Ye authored
Signed-off-by:yewentao256 <zhyanwentao@126.com>
-
JartX authored
Signed-off-by:JartX <sagformas@epdcenter.es>
-
fhl2000 authored
[Core] Allow full cudagraph with separate attention routines and orthogonal to compilation, add support for FA2 and FlashInfer (#20059) Signed-off-by:
fhl <2410591650@qq.com> Signed-off-by:
fhl2000 <63384265+fhl2000@users.noreply.github.com> Signed-off-by:
Lucas Wilkinson <lwilkins@redhat.com> Signed-off-by:
Lucas Wilkinson <LucasWilkinson@users.noreply.github.com> Co-authored-by:
Luka Govedič <ProExpertProg@users.noreply.github.com> Co-authored-by:
Lucas Wilkinson <lwilkins@redhat.com> Co-authored-by:
Lucas Wilkinson <LucasWilkinson@users.noreply.github.com>
-
Thomas Parnell authored
Signed-off-by:
Daniel Afrimi <danielafrimi8@gmail.com> Signed-off-by:
Thomas Parnell <tpa@zurich.ibm.com> Signed-off-by:
Chen Zhang <zhangch99@outlook.com> Co-authored-by:
Daniel Afrimi <danielafrimi8@gmail.com> Co-authored-by:
Burkhard Ringlein <ngl@zurich.ibm.com> Co-authored-by:
Chen Zhang <zhangch99@outlook.com>
-
Roger Wang authored
Signed-off-by:
Roger Wang <hey@rogerw.me> Co-authored-by:
gemini-code-assist[bot] <176961590+gemini-code-assist[bot]@users.noreply.github.com>
-
amirai21 authored
Signed-off-by:
amirk <amirk@ai21.com> Signed-off-by:
asafg <asafg@ai21.com> Co-authored-by:
asafg <asafg@ai21.com> Co-authored-by:
Asaf Joseph Gardin <39553475+Josephasafg@users.noreply.github.com>
-
Asaf Joseph Gardin authored
Signed-off-by:
asafg <asafg@ai21.com> Co-authored-by:
asafg <asafg@ai21.com>
-
Wentao Ye authored
Signed-off-by:yewentao256 <zhyanwentao@126.com>
-
- 14 Aug, 2025 6 commits
-
-
Nick Hill authored
Signed-off-by:Nick Hill <nhill@redhat.com>
-
Nick Hill authored
Signed-off-by:Nick Hill <nhill@redhat.com>
-
nvjullin authored
Signed-off-by:
Julien Lin <jullin@nvidia.com> Signed-off-by:
mgoin <mgoin64@gmail.com> Co-authored-by:
mgoin <mgoin64@gmail.com>
-
Lucas Wilkinson authored
Signed-off-by:Lucas Wilkinson <lwilkins@redhat.com>
-
iAmir97 authored
Signed-off-by:
iAmir97 <Amir.balwel@embeddedllm.com> Signed-off-by:
iAmir97 <71513472+iAmir97@users.noreply.github.com> Co-authored-by:
iAmir97 <Amir.balwel@embeddedllm.com> Co-authored-by:
gemini-code-assist[bot] <176961590+gemini-code-assist[bot]@users.noreply.github.com>
-
Nick Hill authored
Signed-off-by:
Nick Hill <nhill@redhat.com> Co-authored-by:
Isotr0py <mozf@mail2.sysu.edu.cn>
-
- 13 Aug, 2025 5 commits
-
-
Jialin Ouyang authored
Signed-off-by:Jialin Ouyang <Jialin.Ouyang@gmail.com>
-
Cyrus Leung authored
Signed-off-by:DarkLight1337 <tlleungac@connect.ust.hk>
-
Chen Zhang authored
Signed-off-by:Chen Zhang <zhangch99@outlook.com>
-
Giancarlo Delfin authored
Signed-off-by:Giancarlo Delfin <gdelfin@meta.com>
-
Michael Goin authored
Signed-off-by:mgoin <mgoin64@gmail.com>
-
- 12 Aug, 2025 3 commits
-
-
Xiaozhu Meng authored
Signed-off-by:
Xiaozhu <mxz297@gmail.com> Signed-off-by:
Michael Goin <mgoin64@gmail.com> Co-authored-by:
Michael Goin <mgoin64@gmail.com> Co-authored-by:
gemini-code-assist[bot] <176961590+gemini-code-assist[bot]@users.noreply.github.com>
-
Rahul Tuli authored
Signed-off-by:Rahul Tuli <rtuli@redhat.com>
-
wang.yuqi authored
[Bugfix] Fix ModernBert load & Enable sliding window attention for bidirectional attention. (#22637) Signed-off-by:
wang.yuqi <noooop@126.com> Signed-off-by:
Max de Bayser <mbayser@br.ibm.com> Co-authored-by:
Max de Bayser <mbayser@br.ibm.com>
-
- 11 Aug, 2025 3 commits
-
-
wang.yuqi authored
Signed-off-by:wang.yuqi <noooop@126.com>
-
Maximilien de Bayser authored
Signed-off-by:Max de Bayser <mbayser@br.ibm.com>
-
Nick Hill authored
Signed-off-by:Nick Hill <nhill@redhat.com>
-
- 10 Aug, 2025 2 commits
-
-
Chengji Yao authored
[TPU] kv cache update kernel doesn't need to be padded slices to multiple of num_slices_per_block (#22394) Signed-off-by:
Chengji Yao <chengjiyao@gmail.com> Co-authored-by:
Chengji Yao <chengjiyao@gmail.com>
-
Thomas Parnell authored
Signed-off-by:Thomas Parnell <tpa@zurich.ibm.com>
-
- 09 Aug, 2025 1 commit
-
-
Or Ozeri authored
Signed-off-by:Or Ozeri <oro@il.ibm.com>
-