- 02 Feb, 2026 1 commit
-
-
yugong333 authored
Reduce the kernel overhead when num of active loras is smaller than max loras. Multiple cuda graphs are captured for each num of active-loras. (#32005) Signed-off-by:Yu Gong <yu3.gong@gmail.com>
-
- 23 Jan, 2026 1 commit
-
-
Lucas Wilkinson authored
Signed-off-by:Lucas Wilkinson <lwilkins@redhat.com>
-
- 20 Jan, 2026 1 commit
-
-
Lucas Wilkinson authored
Signed-off-by:
Lucas Wilkinson <lwilkins@redhat.com> Signed-off-by:
Matthew Bonanni <mbonanni@redhat.com> Co-authored-by:
Matthew Bonanni <mbonanni@redhat.com>
-
- 18 Dec, 2025 1 commit
-
-
Elizabeth Thomas authored
Signed-off-by:
Elizabeth Thomas <email2eliza@gmail.com> Signed-off-by:
Harry Mellor <19981378+hmellor@users.noreply.github.com> Co-authored-by:
Harry Mellor <19981378+hmellor@users.noreply.github.com>
-
- 09 Dec, 2025 1 commit
-
-
Lucas Wilkinson authored
Signed-off-by:Lucas Wilkinson <lwilkins@redhat.com>
-
- 02 Dec, 2025 1 commit
-
-
Harry Mellor authored
Signed-off-by:Harry Mellor <19981378+hmellor@users.noreply.github.com>
-
- 26 Nov, 2025 1 commit
-
-
Lucas Wilkinson authored
-
- 25 Oct, 2025 1 commit
-
-
Jiangyun Zhu authored
Signed-off-by:zjy0516 <riverclouds.zhu@qq.com>
-
- 15 Oct, 2025 1 commit
-
-
Morrison Turnansky authored
[Frontend][torch.compile] CompilationConfig Overhaul (#20283): name change compilation level to compilation mode, deprecation compilation level (#26355) Signed-off-by:
morrison-turnansky <mturnans@redhat.com> Signed-off-by:
Morrison Turnansky <mturnans@redhat.com> Co-authored-by:
Luka Govedič <ProExpertProg@users.noreply.github.com>
-
- 05 Oct, 2025 1 commit
-
-
Harry Mellor authored
Signed-off-by:Harry Mellor <19981378+hmellor@users.noreply.github.com>
-
- 26 Sep, 2025 1 commit
-
-
fhl2000 authored
Signed-off-by:
fhl2000 <63384265+fhl2000@users.noreply.github.com> Co-authored-by:
Luka Govedič <ProExpertProg@users.noreply.github.com>
-
- 15 Aug, 2025 1 commit
-
-
fhl2000 authored
[Core] Allow full cudagraph with separate attention routines and orthogonal to compilation, add support for FA2 and FlashInfer (#20059) Signed-off-by:
fhl <2410591650@qq.com> Signed-off-by:
fhl2000 <63384265+fhl2000@users.noreply.github.com> Signed-off-by:
Lucas Wilkinson <lwilkins@redhat.com> Signed-off-by:
Lucas Wilkinson <LucasWilkinson@users.noreply.github.com> Co-authored-by:
Luka Govedič <ProExpertProg@users.noreply.github.com> Co-authored-by:
Lucas Wilkinson <lwilkins@redhat.com> Co-authored-by:
Lucas Wilkinson <LucasWilkinson@users.noreply.github.com>
-