- 26 Sep, 2025 1 commit
-
-
fhl2000 authored
Signed-off-by:
fhl2000 <63384265+fhl2000@users.noreply.github.com> Co-authored-by:
Luka Govedič <ProExpertProg@users.noreply.github.com>
-
- 15 Aug, 2025 1 commit
-
-
fhl2000 authored
[Core] Allow full cudagraph with separate attention routines and orthogonal to compilation, add support for FA2 and FlashInfer (#20059) Signed-off-by:
fhl <2410591650@qq.com> Signed-off-by:
fhl2000 <63384265+fhl2000@users.noreply.github.com> Signed-off-by:
Lucas Wilkinson <lwilkins@redhat.com> Signed-off-by:
Lucas Wilkinson <LucasWilkinson@users.noreply.github.com> Co-authored-by:
Luka Govedič <ProExpertProg@users.noreply.github.com> Co-authored-by:
Lucas Wilkinson <lwilkins@redhat.com> Co-authored-by:
Lucas Wilkinson <LucasWilkinson@users.noreply.github.com>
-
- 13 Jun, 2025 1 commit
-
-
Luka Govedič authored
Signed-off-by:luka <luka@neuralmagic.com>
-
- 08 Jun, 2025 1 commit
-
-
Richard Zou authored
Signed-off-by:Richard Zou <zou3519@gmail.com>
-
- 03 Jun, 2025 1 commit
-
-
Simon Mo authored
Signed-off-by:simon-mo <simon.mo@hey.com>
-
- 22 May, 2025 1 commit
-
-
Mengqing Cao authored
Signed-off-by:
Mengqing Cao <cmq0113@163.com> Co-authored-by:
youkaichao <youkaichao@gmail.com>
-