- 22 May, 2025 1 commit
-
-
Tyler Michael Smith authored
Signed-off-by:
Tyler Michael Smith <tyler@neuralmagic.com> Signed-off-by:
Lucas Wilkinson <lwilkinson@neuralmagic.com> Signed-off-by:
Tyler Michael Smith <tysmith@redhat.com> Co-authored-by:
Lucas Wilkinson <lwilkinson@neuralmagic.com>
-
- 15 May, 2025 1 commit
-
-
Lucas Wilkinson authored
Signed-off-by:Lucas Wilkinson <lwilkinson@neuralmagic.com>
-
- 13 May, 2025 1 commit
-
-
Tao He authored
Implements dual-chunk-flash-attn backend for dual chunk attention with sparse attention support (#11844)
-
- 09 May, 2025 1 commit
-
-
Pavani Majety authored
-
- 08 May, 2025 1 commit
-
-
Shu Wang authored
Signed-off-by:Shu Wang <shuw@nvidia.com>
-
- 05 May, 2025 2 commits
-
-
Jinzhen Lin authored
Signed-off-by:Jinzhen Lin <linjinzhen@hotmail.com>
-
Tyler Michael Smith authored
Signed-off-by:Tyler Michael Smith <tyler@neuralmagic.com>
-
- 03 May, 2025 1 commit
-
-
Tyler Michael Smith authored
Signed-off-by:Tyler Michael Smith <tyler@neuralmagic.com>
-
- 02 May, 2025 1 commit
-
-
Caleb_Du authored
Signed-off-by:Caleb_Du <Caleb_Du@zju.edu.cn>
-
- 01 May, 2025 1 commit
-
-
Sage Moore authored
[torch.compile] Add torch inductor pass for fusing silu_and_mul with subsequent scaled_fp8_quant operations (#10867) Signed-off-by:Sage Moore <sage@neuralmagic.com>
-
- 30 Apr, 2025 1 commit
-
-
Huy Do authored
-
- 27 Apr, 2025 1 commit
-
-
Kaixi Hou authored
Signed-off-by:kaixih <kaixih@nvidia.com>
-
- 22 Apr, 2025 1 commit
-
-
Charlie Fu authored
Signed-off-by:
charlifu <charlifu@amd.com> Co-authored-by:
Tyler Michael Smith <tysmith@redhat.com>
-
- 15 Apr, 2025 1 commit
-
-
Jinzhen Lin authored
Signed-off-by:
Jinzhen Lin <linjinzhen@hotmail.com> Co-authored-by:
Michael Goin <michael@neuralmagic.com> Co-authored-by:
mgoin <mgoin64@gmail.com>
-
- 11 Apr, 2025 1 commit
-
-
DefTruth authored
Signed-off-by:DefTruth <qiustudent_r@163.com>
-
- 01 Apr, 2025 1 commit
-
-
Ilya Markov authored
Signed-off-by:
ilmarkov <imarkov@redhat.com> Co-authored-by:
ilmarkov <imarkov@redhat.com>
-
- 31 Mar, 2025 2 commits
-
-
Harry Mellor authored
Signed-off-by:Harry Mellor <19981378+hmellor@users.noreply.github.com>
-
youkaichao authored
Signed-off-by:youkaichao <youkaichao@gmail.com>
-
- 29 Mar, 2025 1 commit
-
-
Gregory Shtrasberg authored
Signed-off-by:Gregory Shtrasberg <Gregory.Shtrasberg@amd.com>
-
- 27 Mar, 2025 1 commit
-
-
ElizaWszola authored
Signed-off-by:
ElizaWszola <eliza@neuralmagic.com> Signed-off-by:
ElizaWszola <ewszola@redhat.com> Co-authored-by:
Lucas Wilkinson <wilkinson.lucas@gmail.com>
-
- 14 Mar, 2025 2 commits
-
-
Michael Goin authored
Signed-off-by:
mgoin <michael@neuralmagic.com> Signed-off-by:
mgoin <mgoin64@gmail.com> Signed-off-by:
luka <luka@neuralmagic.com> Signed-off-by:
Tyler Michael Smith <tyler@neuralmagic.com> Signed-off-by:
Harry Mellor <19981378+hmellor@users.noreply.github.com> Co-authored-by:
Luka Govedič <ProExpertProg@users.noreply.github.com> Co-authored-by:
DarkLight1337 <tlleungac@connect.ust.hk> Co-authored-by:
Tyler Michael Smith <tyler@neuralmagic.com> Co-authored-by:
Harry Mellor <19981378+hmellor@users.noreply.github.com>
-
Yajie Wang authored
Signed-off-by:wyj371990 <wyj371990@alibaba-inc.com>
-
- 12 Mar, 2025 2 commits
-
-
TJian authored
[FEAT] [ROCm] [Embedding] Add encoder-only model support into ROCm Flash Attention to enable embedding models. (#14664) Signed-off-by:tjtanaa <tunjian.tan@embeddedllm.com>
-
Sage Moore authored
Signed-off-by:Sage Moore <sage@neuralmagic.com>
-
- 11 Mar, 2025 2 commits
-
-
Lucas Wilkinson authored
Signed-off-by:Lucas Wilkinson <lwilkins@redhat.com>
-
Jinzhen Lin authored
Signed-off-by:
Jinzhen Lin <linjinzhen@hotmail.com> Co-authored-by:
mgoin <mgoin64@gmail.com>
-
- 08 Mar, 2025 1 commit
-
-
Lucas Wilkinson authored
Signed-off-by:
Lucas Wilkinson <lwilkins@redhat.com> Signed-off-by:
Lucas Wilkinson <lwilkinson@neuralmagic.com> Signed-off-by:
Tyler Michael Smith <tyler@neuralmagic.com> Co-authored-by:
Tyler Michael Smith <tyler@neuralmagic.com>
-
- 05 Mar, 2025 1 commit
-
-
Michael Goin authored
Signed-off-by:mgoin <mgoin64@gmail.com>
-
- 04 Mar, 2025 1 commit
-
-
kushanam authored
-
- 01 Mar, 2025 1 commit
-
-
YajieWang authored
-
- 27 Feb, 2025 1 commit
-
-
Lucas Wilkinson authored
Signed-off-by:
Lucas Wilkinson <lwilkinson@neuralmagic.com> Signed-off-by:
Lucas Wilkinson <lwilkins@redhat.com>
-
- 26 Feb, 2025 1 commit
-
-
Henry Tsang authored
-
- 25 Feb, 2025 1 commit
-
-
Gregory Shtrasberg authored
-
- 22 Feb, 2025 1 commit
-
-
Kaixi Hou authored
-
- 20 Feb, 2025 1 commit
-
-
Gregory Shtrasberg authored
-
- 14 Feb, 2025 1 commit
-
-
Tyler Michael Smith authored
Signed-off-by:Tyler Michael Smith <tyler@neuralmagic.com>
-
- 13 Feb, 2025 1 commit
-
-
Kaixi Hou authored
-
- 11 Feb, 2025 1 commit
-
-
Yuhong Guo authored
Signed-off-by:
YuhongGuo <yuhong.gyh@antgroup.com> Co-authored-by:
Tyler Michael Smith <tyler@neuralmagic.com>
-
- 07 Feb, 2025 1 commit
-
-
Lucas Wilkinson authored
-
- 31 Jan, 2025 1 commit
-
-
Lucas Wilkinson authored
-