- 14 May, 2025 7 commits
-
-
wang.yuqi authored
-
qli88 authored
Signed-off-by:Qiang Li <qiang.li2@amd.com>
-
vllmellm authored
Signed-off-by:
tjtanaa <tunjian.tan@embeddedllm.com> Co-authored-by:
tjtanaa <tunjian.tan@embeddedllm.com>
-
Michael Goin authored
Signed-off-by:mgoin <mgoin64@gmail.com>
-
youkaichao authored
Signed-off-by:youkaichao <youkaichao@gmail.com>
-
Pavani Majety authored
Signed-off-by:Pavani Majety <pmajety@nvidia.com>
-
vllmellm authored
Signed-off-by:
vllmellm <vllm.ellm@embeddedllm.com> Signed-off-by:
tjtanaa <tunjian.tan@embeddedllm.com> Co-authored-by:
tjtanaa <tunjian.tan@embeddedllm.com>
-
- 13 May, 2025 6 commits
-
-
Harry Mellor authored
Signed-off-by:Harry Mellor <19981378+hmellor@users.noreply.github.com>
-
Michael Goin authored
Signed-off-by:mgoin <mgoin64@gmail.com>
-
Michael Goin authored
Signed-off-by:mgoin <mgoin64@gmail.com>
-
Jee Jee Li authored
Signed-off-by:Jee Jee Li <pandaleefree@gmail.com>
-
Tao He authored
Implements dual-chunk-flash-attn backend for dual chunk attention with sparse attention support (#11844)
-
Michael Goin authored
Signed-off-by:
mgoin <mgoin64@gmail.com> Signed-off-by:
Dipika <dipikasikka1@gmail.com> Co-authored-by:
Dipika <dipikasikka1@gmail.com>
-
- 12 May, 2025 1 commit
-
-
Michael Goin authored
Signed-off-by:mgoin <mgoin64@gmail.com>
-
- 11 May, 2025 3 commits
-
-
TJian authored
Signed-off-by:tjtanaa <tunjian.tan@embeddedllm.com>
-
Dipika Sikka authored
Signed-off-by:
Dipika Sikka <dipikasikka1@gmail.com> Signed-off-by:
Dipika <dipikasikka1@gmail.com>
-
Jinzhen Lin authored
Signed-off-by:Jinzhen Lin <linjinzhen@hotmail.com>
-
- 09 May, 2025 6 commits
-
-
Pavani Majety authored
-
Michael Goin authored
Signed-off-by:mgoin <mgoin64@gmail.com>
-
Michael Goin authored
-
qli88 authored
Signed-off-by:Qiang Li <qiang.li2@amd.com>
-
Lucas Wilkinson authored
Signed-off-by:Lucas Wilkinson <lwilkinson@neuralmagic.com>
-
vllmellm authored
Signed-off-by:
vllmellm <vllm.ellm@embeddedllm.com> Co-authored-by:
qli88 <qiang.li2@amd.com> Co-authored-by:
Hongxia Yang <62075498+hongxiayang@users.noreply.github.com>
-
- 08 May, 2025 6 commits
-
-
Shu Wang authored
Signed-off-by:Shu Wang <shuw@nvidia.com>
-
Michael Goin authored
Signed-off-by:mgoin <mgoin64@gmail.com>
-
fxmarty-amd authored
Signed-off-by:
Felix Marty <felmarty@amd.com> Signed-off-by:
kewang2 <kewang2@amd.com> Co-authored-by:
kewang2 <kewang2@amd.com>
-
xsank authored
Signed-off-by:
唯勤 <xsank.mz@alibaba-inc.com> Co-authored-by:
唯勤 <xsank.mz@alibaba-inc.com>
-
Ximingwang-09 authored
-
Hashem Hashemi authored
Signed-off-by:
Hashem Hashemi <hashem.hashemi@amd.com> Signed-off-by:
charlifu <charlifu@amd.com> Co-authored-by:
charlifu <charlifu@amd.com>
-
- 07 May, 2025 6 commits
-
-
Bowen Bao authored
-
Yong Hoon Shin authored
Signed-off-by:Yong Hoon Shin <yhshin@meta.com>
-
Szymon Ożóg authored
Signed-off-by:
SzymonOzog <szymon.ozog@aleph-alpha.com> Signed-off-by:
SzymonOzog <szymon.ozog@gmail.com> Signed-off-by:
Isotr0py <2037008807@qq.com> Co-authored-by:
Isotr0py <2037008807@qq.com>
-
Michael Goin authored
Signed-off-by:Michael Goin <mgoin64@gmail.com>
-
Chih-Chieh Yang authored
[Model] Mamba2 causal conv1d Refactor to Split Prefill and Decode Requests for Corresponding Kernels (#17146) Signed-off-by:Chih-Chieh-Yang <7364402+cyang49@users.noreply.github.com>
-
Hongxia Yang authored
Signed-off-by:Hongxia Yang <hongxia.yang@amd.com>
-
- 06 May, 2025 1 commit
-
-
Mengqing Cao authored
Signed-off-by:Mengqing Cao <cmq0113@163.com>
-
- 05 May, 2025 1 commit
-
-
Jinzhen Lin authored
Signed-off-by:Jinzhen Lin <linjinzhen@hotmail.com>
-
- 04 May, 2025 1 commit
-
-
Harry Mellor authored
Signed-off-by:Harry Mellor <19981378+hmellor@users.noreply.github.com>
-
- 03 May, 2025 2 commits
-
-
rasmith authored
Signed-off-by:Randall Smith <Randall.Smith@amd.com>
-
Eric Hartford authored
-