- 11 Aug, 2025 1 commit
-
-
zhuwenwen authored
更改默认的full _cuda_graph启动方式为false
-
- 08 Aug, 2025 1 commit
-
-
zhuwenwen authored
-
- 07 Aug, 2025 1 commit
-
-
zhuwenwen authored
-
- 01 Aug, 2025 3 commits
- 31 Jul, 2025 3 commits
- 24 Jul, 2025 2 commits
- 21 Jul, 2025 1 commit
-
-
Lucas Wilkinson authored
Signed-off-by:
Lucas Wilkinson <lwilkins@redhat.com> Co-authored-by:
Michael Goin <mgoin64@gmail.com>
-
- 20 Jul, 2025 1 commit
-
-
Chengji Yao authored
Signed-off-by:Chengji Yao <chengjiyao@google.com>
-
- 19 Jul, 2025 3 commits
-
-
Woosuk Kwon authored
Signed-off-by:Woosuk Kwon <woosuk.kwon@berkeley.edu>
-
Lucas Wilkinson authored
Signed-off-by:Lucas Wilkinson <lwilkins@redhat.com>
-
Lucia Fang authored
Signed-off-by:
Lucia Fang <fanglu@fb.com> Signed-off-by:
Lu Fang <fanglu@meta.com> Signed-off-by:
Lu Fang <fanglu@fb.com> Co-authored-by:
Lu Fang <fanglu@meta.com>
-
- 18 Jul, 2025 2 commits
-
-
Lucas Wilkinson authored
-
elvischenv authored
[Bugfix] Fix the tensor non-contiguous issue for Flashinfer TRT-LLM backend attention kernel (#21133)
-
- 17 Jul, 2025 2 commits
-
-
Lucas Wilkinson authored
Signed-off-by:Lucas Wilkinson <lwilkins@redhat.com>
-
QiliangCui authored
Signed-off-by:Qiliang Cui <derrhein@gmail.com>
-
- 16 Jul, 2025 3 commits
-
-
Chengji Yao authored
Signed-off-by:Chengji Yao <chengjiyao@google.com>
-
Peter Pan authored
Signed-off-by:Peter Pan <Peter.Pan@daocloud.io>
-
Elfie Guo authored
Signed-off-by:
Elfie Guo <elfieg@nvidia.com> Co-authored-by:
Elfie Guo <eflieg@nvidia.com>
-
- 15 Jul, 2025 2 commits
-
-
Yifei Teng authored
Signed-off-by:Yifei Teng <tengyifei88@gmail.com>
-
Alexander Matveev authored
Signed-off-by:Alexander Matveev <amatveev@redhat.com>
-
- 14 Jul, 2025 1 commit
-
-
Cyrus Leung authored
Signed-off-by:DarkLight1337 <tlleungac@connect.ust.hk>
-
- 12 Jul, 2025 1 commit
-
-
王敏 authored
-
- 11 Jul, 2025 3 commits
-
-
Pavani Majety authored
Signed-off-by:
Pavani Majety <pmajety@nvidia.com> Signed-off-by:
mgoin <mgoin64@gmail.com> Co-authored-by:
shuw <shuw@nvidia.com> Co-authored-by:
mgoin <mgoin64@gmail.com>
-
nopperl authored
Signed-off-by:nopperl <54780682+nopperl@users.noreply.github.com>
-
Alexander Matveev authored
-
- 09 Jul, 2025 2 commits
-
-
Tuan, Hoang-Trong authored
Signed-off-by:
Tuan M. Hoang-Trong <tmhoangt@us.ibm.com> Co-authored-by:
Tuan M. Hoang-Trong <tmhoangt@us.ibm.com> Co-authored-by:
Tyler Michael Smith <tysmith@redhat.com> Co-authored-by:
Tyler Michael Smith <tyler@neuralmagic.com>
-
Akash kaothalkar authored
Signed-off-by:
Akash Kaothalkar <akash.kaothalkar@ibm.com> Co-authored-by:
Akash Kaothalkar <akash.kaothalkar@ibm.com> Co-authored-by:
Nikhil Gupta <nikhil.gupta2@arm.com>
-
- 08 Jul, 2025 2 commits
-
-
Chenyaaang authored
Signed-off-by:Chenyaaang <chenyangli@google.com>
-
Li, Jiang authored
Signed-off-by:jiang1.li <jiang1.li@intel.com>
-
- 07 Jul, 2025 4 commits
- 06 Jul, 2025 1 commit
-
-
Cyrus Leung authored
Signed-off-by:DarkLight1337 <tlleungac@connect.ust.hk>
-
- 05 Jul, 2025 1 commit
-
-
Isotr0py authored
Signed-off-by:
Isotr0py <2037008807@qq.com> Signed-off-by:
Isotr0py <mozf@mail2.sysu.edu.cn>
-