- 11 May, 2025 1 commit
-
-
Jinzhen Lin authored
Signed-off-by:Jinzhen Lin <linjinzhen@hotmail.com>
-
- 10 May, 2025 1 commit
-
-
tracelogfb authored
Co-authored-by:Stephen Chen <tracelog@meta.com>
-
- 09 May, 2025 3 commits
-
-
Pavani Majety authored
-
Cyrus Leung authored
Signed-off-by:DarkLight1337 <tlleungac@connect.ust.hk>
-
vllmellm authored
Signed-off-by:
vllmellm <vllm.ellm@embeddedllm.com> Co-authored-by:
qli88 <qiang.li2@amd.com> Co-authored-by:
Hongxia Yang <62075498+hongxiayang@users.noreply.github.com>
-
- 08 May, 2025 2 commits
-
-
Shu Wang authored
Signed-off-by:Shu Wang <shuw@nvidia.com>
-
Hashem Hashemi authored
Signed-off-by:
Hashem Hashemi <hashem.hashemi@amd.com> Signed-off-by:
charlifu <charlifu@amd.com> Co-authored-by:
charlifu <charlifu@amd.com>
-
- 07 May, 2025 3 commits
-
-
Yong Hoon Shin authored
Signed-off-by:Yong Hoon Shin <yhshin@meta.com>
-
Szymon Ożóg authored
Signed-off-by:
SzymonOzog <szymon.ozog@aleph-alpha.com> Signed-off-by:
SzymonOzog <szymon.ozog@gmail.com> Signed-off-by:
Isotr0py <2037008807@qq.com> Co-authored-by:
Isotr0py <2037008807@qq.com>
-
Chih-Chieh Yang authored
[Model] Mamba2 causal conv1d Refactor to Split Prefill and Decode Requests for Corresponding Kernels (#17146) Signed-off-by:Chih-Chieh-Yang <7364402+cyang49@users.noreply.github.com>
-
- 06 May, 2025 3 commits
-
-
Thomas Parnell authored
Signed-off-by:
Thomas Parnell <tpa@zurich.ibm.com> Signed-off-by:
Lucas Wilkinson <lwilkinson@neuralmagic.com> Co-authored-by:
Lucas Wilkinson <lwilkinson@neuralmagic.com>
-
Mengqing Cao authored
Signed-off-by:Mengqing Cao <cmq0113@163.com>
-
Lucas Wilkinson authored
Signed-off-by:Lucas Wilkinson <lwilkinson@neuralmagic.com>
-
- 05 May, 2025 1 commit
-
-
Jinzhen Lin authored
Signed-off-by:Jinzhen Lin <linjinzhen@hotmail.com>
-
- 02 May, 2025 1 commit
-
-
Caleb_Du authored
Signed-off-by:Caleb_Du <Caleb_Du@zju.edu.cn>
-
- 01 May, 2025 1 commit
-
-
Sage Moore authored
[torch.compile] Add torch inductor pass for fusing silu_and_mul with subsequent scaled_fp8_quant operations (#10867) Signed-off-by:Sage Moore <sage@neuralmagic.com>
-
- 27 Apr, 2025 2 commits
-
-
Kaixi Hou authored
Signed-off-by:kaixih <kaixih@nvidia.com>
-
rasmith authored
[Kernel][Triton][FP8] Adding fp8 and variable length sequence support to Triton FAv2 kernel (#12591) Signed-off-by:Randall Smith <Randall.Smith@amd.com>
-
- 26 Apr, 2025 2 commits
-
-
Happy authored
Signed-off-by:ShuaibinLi <lishuaibin@live.cn>
-
Shu Wang authored
Signed-off-by:shuw <shuw@nvidia.com>
-
- 24 Apr, 2025 1 commit
-
-
Michael Goin authored
Signed-off-by:mgoin <mgoin64@gmail.com>
-
- 23 Apr, 2025 1 commit
-
-
Michael Goin authored
Signed-off-by:mgoin <mgoin64@gmail.com>
-
- 22 Apr, 2025 3 commits
-
-
vllmellm authored
Signed-off-by:
vllmellm <vllm.ellm@embeddedllm.com> Co-authored-by:
qli88 <qiang.li2@amd.com>
-
Charlie Fu authored
Signed-off-by:
charlifu <charlifu@amd.com> Co-authored-by:
Tyler Michael Smith <tysmith@redhat.com>
-
Varun Sundar Rabindranath authored
Signed-off-by:
varun sundar rabindranath <vsundarr@redhat.com> Co-authored-by:
varun sundar rabindranath <vsundarr@redhat.com>
-
- 15 Apr, 2025 2 commits
-
-
Tyler Michael Smith authored
Signed-off-by:Tyler Michael Smith <tyler@neuralmagic.com>
-
Jinzhen Lin authored
Signed-off-by:
Jinzhen Lin <linjinzhen@hotmail.com> Co-authored-by:
Michael Goin <michael@neuralmagic.com> Co-authored-by:
mgoin <mgoin64@gmail.com>
-
- 11 Apr, 2025 2 commits
-
-
Michael Goin authored
[Kernel] Support W8A8 channel-wise weights and per-token activations in triton fused_moe_kernel (#16366) Signed-off-by:mgoin <mgoin64@gmail.com>
-
DefTruth authored
Signed-off-by:DefTruth <qiustudent_r@163.com>
-
- 08 Apr, 2025 1 commit
-
-
Kebe authored
Signed-off-by:Kebe <mail@kebe7jun.com>
-
- 03 Apr, 2025 3 commits
-
-
bnellnm authored
Signed-off-by:Bill Nell <bnell@redhat.com>
-
bnellnm authored
Signed-off-by:Bill Nell <bnell@redhat.com>
-
Aleksandr Malyshev authored
Signed-off-by:
Aleksandr Malyshev <maleksan@amd.com> Signed-off-by:
root <root@banff-cyxtera-s65-4.amd.com> Co-authored-by:
Aleksandr Malyshev <maleksan@amd.com> Co-authored-by:
root <root@banff-cyxtera-s65-4.amd.com>
-
- 02 Apr, 2025 1 commit
-
-
LukasBluebaum authored
Signed-off-by:lukas.bluebaum <lukas.bluebaum@aleph-alpha.com>
-
- 01 Apr, 2025 2 commits
-
-
Gerald authored
Signed-off-by:
qscqesze <475517977@qq.com> Co-authored-by:
qingjun <qingjun@minimaxi.com> Co-authored-by:
qscqesze <475517977@qq.com>
-
bnellnm authored
Signed-off-by:Bill Nell <bnell@redhat.com>
-
- 31 Mar, 2025 1 commit
-
-
youkaichao authored
Signed-off-by:youkaichao <youkaichao@gmail.com>
-
- 27 Mar, 2025 1 commit
-
-
ElizaWszola authored
Signed-off-by:
ElizaWszola <eliza@neuralmagic.com> Signed-off-by:
ElizaWszola <ewszola@redhat.com> Co-authored-by:
Lucas Wilkinson <wilkinson.lucas@gmail.com>
-
- 26 Mar, 2025 1 commit
-
-
vllmellm authored
Signed-off-by:
vllmellm <vllm.ellm@embeddedllm.com> Signed-off-by:
tjtanaa <tunjian.tan@embeddedllm.com> Co-authored-by:
tjtanaa <tunjian.tan@embeddedllm.com>
-
- 25 Mar, 2025 1 commit
-
-
Thien Tran authored
Signed-off-by:Thien Tran <gau.nernst@yahoo.com.sg>
-