- 22 Apr, 2026 1 commit
-
-
wangmin6 authored
-
- 18 Apr, 2026 1 commit
-
-
wangmin6 authored
-
- 05 Feb, 2026 1 commit
-
-
zhuwenwen authored
-
- 03 Feb, 2026 2 commits
-
-
Richard Zou authored
[torch.compile] Don't do the fast moe cold start optimization if there is speculative decoding (#33624) Signed-off-by:
Richard Zou <zou3519@gmail.com> Co-authored-by:
Robert Shaw <114415538+robertgshaw2-redhat@users.noreply.github.com> (cherry picked from commit 5eac9a1b)
-
Richard Zou authored
Signed-off-by:
Richard Zou <zou3519@gmail.com> (cherry picked from commit d9aa39a3)
-
- 24 Jan, 2026 1 commit
-
-
ElizaWszola authored
Signed-off-by:
ElizaWszola <ewszola@redhat.com> Signed-off-by:
mgoin <mgoin64@gmail.com> Signed-off-by:
Matthew Bonanni <mbonanni@redhat.com> Signed-off-by:
Luka Govedič <luka.govedic@gmail.com> Signed-off-by:
Lucas Wilkinson <lwilkins@redhat.com> Signed-off-by:
Luka Govedič <ProExpertProg@users.noreply.github.com> Signed-off-by:
Luka Govedič <lgovedic@redhat.com> Co-authored-by:
mgoin <mgoin64@gmail.com> Co-authored-by:
Varun Sundar Rabindranath <varunsundar08@gmail.com> Co-authored-by:
Matthew Bonanni <mbonanni@redhat.com> Co-authored-by:
Luka Govedič <ProExpertProg@users.noreply.github.com> Co-authored-by:
Luka Govedič <luka.govedic@gmail.com> Co-authored-by:
Lucas Wilkinson <lwilkins@redhat.com> Co-authored-by:
Luka Govedič <lgovedic@redhat.com>
-
- 22 Jan, 2026 1 commit
-
-
Richard Zou authored
Signed-off-by:Richard Zou <zou3519@gmail.com>
-
- 19 Jan, 2026 1 commit
-
-
Nicolò Lucchesi authored
Signed-off-by:NickLucche <nlucches@redhat.com>
-
- 09 Jan, 2026 1 commit
-
-
Matthew Bonanni authored
Signed-off-by:Matthew Bonanni <mbonanni@redhat.com>
-
- 08 Jan, 2026 1 commit
-
-
Ronald authored
Signed-off-by:Ronald1995 <ronaldautomobile@163.com>
-
- 05 Jan, 2026 1 commit
-
-
zzzzwwjj authored
Signed-off-by:
zzzzwwjj <1183291235@qq.com> Signed-off-by:
zzzzwwjj <34335947+zzzzwwjj@users.noreply.github.com> Co-authored-by:
Cyrus Leung <cyrus.tl.leung@gmail.com>
-
- 02 Jan, 2026 1 commit
-
-
Nick Hill authored
Signed-off-by:
Nick Hill <nhill@redhat.com> Signed-off-by:
njhill <nickhill123@gmail.com>
-
- 23 Dec, 2025 1 commit
-
-
Weida Hong authored
Signed-off-by:Weida Hong <wdhongtw@google.com>
-
- 09 Dec, 2025 1 commit
-
-
Lucas Wilkinson authored
Signed-off-by:Lucas Wilkinson <lwilkins@redhat.com>
-
- 27 Nov, 2025 1 commit
-
-
Matthew Bonanni authored
Signed-off-by:Matthew Bonanni <mbonanni@redhat.com>
-
- 26 Nov, 2025 1 commit
-
-
Lucas Wilkinson authored
-
- 24 Nov, 2025 1 commit
-
-
Didier Durand authored
Signed-off-by:
Didier Durand <durand.didier@gmail.com> Signed-off-by:
Harry Mellor <19981378+hmellor@users.noreply.github.com> Co-authored-by:
Harry Mellor <19981378+hmellor@users.noreply.github.com>
-
- 15 Nov, 2025 1 commit
-
-
Varun Sundar Rabindranath authored
Signed-off-by:
Varun Sundar Rabindranath <vsundarr@redhat.com> Co-authored-by:
Varun Sundar Rabindranath <vsundarr@redhat.com>
-
- 11 Nov, 2025 1 commit
-
-
Canlin Guo authored
Signed-off-by:gcanlin <canlinguosdu@gmail.com>
-
- 20 Oct, 2025 1 commit
-
-
Andy Lo authored
Signed-off-by:
Andy Lo <andy@mistral.ai> Co-authored-by:
Jee Jee Li <pandaleefree@gmail.com>
-
- 12 Oct, 2025 1 commit
-
-
Harry Mellor authored
Signed-off-by:Harry Mellor <19981378+hmellor@users.noreply.github.com>
-
- 11 Oct, 2025 1 commit
-
-
zhuwenwen authored
-
- 10 Oct, 2025 2 commits
-
-
Sage Moore authored
Signed-off-by:Sage Moore <sage@neuralmagic.com>
-
Lucas Wilkinson authored
Signed-off-by:
Lucas Wilkinson <lwilkins@redhat.com> Signed-off-by:
Lucas Wilkinson <LucasWilkinson@users.noreply.github.com> Co-authored-by:
Benjamin Chislett <chislett.ben@gmail.com>
-
- 09 Oct, 2025 1 commit
-
-
Sage Moore authored
Signed-off-by:Sage Moore <sage@neuralmagic.com>
-
- 07 Oct, 2025 1 commit
-
-
Sage Moore authored
Signed-off-by:
Sage Moore <sage@neuralmagic.com> Signed-off-by:
mgoin <mgoin64@gmail.com> Co-authored-by:
mgoin <mgoin64@gmail.com>
-
- 05 Oct, 2025 1 commit
-
-
Harry Mellor authored
Signed-off-by:Harry Mellor <19981378+hmellor@users.noreply.github.com>
-
- 28 Sep, 2025 1 commit
-
-
Tyler Michael Smith authored
Signed-off-by:
Tyler Michael Smith <tlrmchlsmth@gmail.com> Signed-off-by:
simon-mo <simon.mo@hey.com>
-
- 27 Sep, 2025 1 commit
-
-
Tyler Michael Smith authored
Signed-off-by:Tyler Michael Smith <tlrmchlsmth@gmail.com>
-
- 26 Sep, 2025 1 commit
-
-
fhl2000 authored
Signed-off-by:
fhl2000 <63384265+fhl2000@users.noreply.github.com> Co-authored-by:
Luka Govedič <ProExpertProg@users.noreply.github.com>
-
- 16 Sep, 2025 1 commit
-
-
Sage Moore authored
Signed-off-by:
Lucas Wilkinson <lwilkins@redhat.com> Signed-off-by:
Sage Moore <sage@neuralmagic.com> Signed-off-by:
Lucas Wilkinson <lwilkinson@neuralmagic.com> Signed-off-by:
yewentao256 <zhyanwentao@126.com> Co-authored-by:
Lucas Wilkinson <lwilkins@redhat.com> Co-authored-by:
Lucas Wilkinson <lwilkinson@neuralmagic.com> Co-authored-by:
yewentao256 <zhyanwentao@126.com> Co-authored-by:
Lucas Wilkinson <LucasWilkinson@users.noreply.github.com> Co-authored-by:
Robert Shaw <114415538+robertgshaw2-redhat@users.noreply.github.com>
-
- 15 Sep, 2025 1 commit
-
-
Sage Moore authored
Signed-off-by:Sage Moore <sage@neuralmagic.com>
-
- 19 Aug, 2025 1 commit
-
-
zhuwenwen authored
-
- 18 Aug, 2025 1 commit
-
-
zhuwenwen authored
-
- 15 Aug, 2025 1 commit
-
-
fhl2000 authored
[Core] Allow full cudagraph with separate attention routines and orthogonal to compilation, add support for FA2 and FlashInfer (#20059) Signed-off-by:
fhl <2410591650@qq.com> Signed-off-by:
fhl2000 <63384265+fhl2000@users.noreply.github.com> Signed-off-by:
Lucas Wilkinson <lwilkins@redhat.com> Signed-off-by:
Lucas Wilkinson <LucasWilkinson@users.noreply.github.com> Co-authored-by:
Luka Govedič <ProExpertProg@users.noreply.github.com> Co-authored-by:
Lucas Wilkinson <lwilkins@redhat.com> Co-authored-by:
Lucas Wilkinson <LucasWilkinson@users.noreply.github.com>
-
- 08 Aug, 2025 1 commit
-
-
Shu Wang authored
Signed-off-by:Shu Wang <shuw@nvidia.com>
-
- 11 Jul, 2025 1 commit
-
-
lizhigong authored
-
- 13 Jun, 2025 1 commit
-
-
Luka Govedič authored
Signed-off-by:luka <luka@neuralmagic.com>
-
- 03 Jun, 2025 1 commit
-
-
Simon Mo authored
Signed-off-by:simon-mo <simon.mo@hey.com>
-
- 01 Jun, 2025 1 commit
-
-
zhrrr authored
[Misc] reuse num_tokens_across_dp of get_dp_padding to avoid unnecessary dp all reduce in set_forward_context (#18935) Signed-off-by:
Tyler Michael Smith <tysmith@redhat.com> Co-authored-by:
zhuhaoran <zhuhaoran.zhr@alibaba-inc.com> Co-authored-by:
Tyler Michael Smith <tysmith@redhat.com>
-