- 16 Aug, 2025 3 commits
-
-
Benjamin Chislett authored
Signed-off-by:Benjamin Chislett <benjamin.chislett@centml.ai>
-
Nicolò Lucchesi authored
Signed-off-by:NickLucche <nlucches@redhat.com>
-
Nick Hill authored
Signed-off-by:Nick Hill <nhill@redhat.com>
-
- 15 Aug, 2025 37 commits
-
-
Yichen Yan authored
Signed-off-by: <wenji.yyc@alibaba-inc.com> Signed-off-by:Yichen Yan <wenji.yyc@alibaba-inc.com>
-
Yong Hoon Shin authored
Signed-off-by:Yong Hoon Shin <yhshin@meta.com>
-
Or Ozeri authored
Signed-off-by:Or Ozeri <oro@il.ibm.com>
-
Nick Hill authored
Signed-off-by:
Nick Hill <nhill@redhat.com> Co-authored-by:
Yinghai Lu <yinghai@thinkingmachines.ai>
-
rishitdholakia13 authored
[Structured Outputs] [Bug] Fix misalignment in apply_grammar_bitmask causing unintended masking and NaN logits (#22963) Signed-off-by:rishitdholakia13 <rishit+github@cohere.com>
-
Eli Uriegas authored
Signed-off-by:Eli Uriegas <eliuriegas@meta.com>
-
Thomas Parnell authored
Signed-off-by:Thomas Parnell <tpa@zurich.ibm.com>
-
Lucas Wilkinson authored
Signed-off-by:Lucas Wilkinson <lwilkins@redhat.com>
-
eigen authored
-
Seiji Eicher authored
Signed-off-by:Seiji Eicher <seiji@anyscale.com>
-
shixianc authored
Signed-off-by:
Shixian Cui <shixian@amazon.com> Co-authored-by:
Shixian Cui <shixian@amazon.com>
-
Michael Goin authored
Signed-off-by:mgoin <mgoin64@gmail.com>
-
Michael Goin authored
Signed-off-by:Michael Goin <mgoin64@gmail.com>
-
nvjullin authored
Signed-off-by:Julien Lin <jullin@nvidia.com>
-
Zebing Lin authored
Signed-off-by:linzebing <linzebing1995@gmail.com>
-
Wentao Ye authored
Signed-off-by:yewentao256 <zhyanwentao@126.com>
-
Chih-Chieh Yang authored
Signed-off-by:Chih-Chieh-Yang <7364402+cyang49@users.noreply.github.com>
-
bnellnm authored
[Kernels] Clean up FusedMoeMethodBase and modular kernel setup. Remove extra arguments from modular kernel methods. (#22035) Signed-off-by:
Bill Nell <bnell@redhat.com> Co-authored-by:
Michael Goin <mgoin64@gmail.com>
-
Shanshan Shen authored
Signed-off-by:shen-shanshan <467638484@qq.com>
-
Chenheli Hua authored
Signed-off-by:Chenheli Hua <huachenheli@outlook.com>
-
JartX authored
Signed-off-by:JartX <sagformas@epdcenter.es>
-
sstamenk authored
[BugFix] Skip the Q component for QKVParallelLinear in the case of QKVCrossParallelLinear since its width is 0 (#22369) Signed-off-by:sstamenk <sstamenk@amd.com>
-
Woosuk Kwon authored
Signed-off-by:Woosuk Kwon <woosuk.kwon@berkeley.edu>
-
fhl2000 authored
[Core] Allow full cudagraph with separate attention routines and orthogonal to compilation, add support for FA2 and FlashInfer (#20059) Signed-off-by:
fhl <2410591650@qq.com> Signed-off-by:
fhl2000 <63384265+fhl2000@users.noreply.github.com> Signed-off-by:
Lucas Wilkinson <lwilkins@redhat.com> Signed-off-by:
Lucas Wilkinson <LucasWilkinson@users.noreply.github.com> Co-authored-by:
Luka Govedič <ProExpertProg@users.noreply.github.com> Co-authored-by:
Lucas Wilkinson <lwilkins@redhat.com> Co-authored-by:
Lucas Wilkinson <LucasWilkinson@users.noreply.github.com>
-
Csrayz authored
Signed-off-by:
Csrayz <jover@cmbchina.com> Co-authored-by:
Harry Mellor <19981378+hmellor@users.noreply.github.com>
-
Harry Mellor authored
Signed-off-by:Harry Mellor <19981378+hmellor@users.noreply.github.com>
-
Jee Jee Li authored
Signed-off-by:Jee Jee Li <pandaleefree@gmail.com>
-
Thomas Parnell authored
Signed-off-by:
Daniel Afrimi <danielafrimi8@gmail.com> Signed-off-by:
Thomas Parnell <tpa@zurich.ibm.com> Signed-off-by:
Chen Zhang <zhangch99@outlook.com> Co-authored-by:
Daniel Afrimi <danielafrimi8@gmail.com> Co-authored-by:
Burkhard Ringlein <ngl@zurich.ibm.com> Co-authored-by:
Chen Zhang <zhangch99@outlook.com>
-
Staszek Paśko authored
Signed-off-by:Staszek Pasko <staszek@gmail.com>
-
Roger Wang authored
Signed-off-by:
Roger Wang <hey@rogerw.me> Co-authored-by:
gemini-code-assist[bot] <176961590+gemini-code-assist[bot]@users.noreply.github.com>
-
Jinzhen Lin authored
Signed-off-by:Jinzhen Lin <jinzhen.ljz@antgroup.com>
-
Sayandip Dutta authored
Signed-off-by:
Sayandip Dutta <sayandip199309@gmail.com> Co-authored-by:
Cyrus Leung <cyrus.tl.leung@gmail.com>
-
amirai21 authored
Signed-off-by:
amirk <amirk@ai21.com> Signed-off-by:
asafg <asafg@ai21.com> Co-authored-by:
asafg <asafg@ai21.com> Co-authored-by:
Asaf Joseph Gardin <39553475+Josephasafg@users.noreply.github.com>
-
wang.yuqi authored
Signed-off-by:wang.yuqi <noooop@126.com>
-
frankie authored
Signed-off-by:
frankie-ys <yongshengwang@cmbchina.com> Signed-off-by:
frankie <wangyongsheng686@gmail.com> Co-authored-by:
Cyrus Leung <cyrus.tl.leung@gmail.com> Co-authored-by:
Kuntai Du <kuntai@uchicago.edu>
-
TJian authored
Signed-off-by:
vllmellm <vllm.ellm@embeddedllm.com> Co-authored-by:
vllmellm <vllm.ellm@embeddedllm.com>
-
Asaf Joseph Gardin authored
Signed-off-by:
asafg <asafg@ai21.com> Co-authored-by:
asafg <asafg@ai21.com>
-