- 26 Mar, 2026 3 commits
-
-
Wentao Ye authored
Signed-off-by:yewentao256 <zhyanwentao@126.com>
-
Vadim Gimpelson authored
Signed-off-by:Vadim Gimpelson <vadim.gimpelson@gmail.com>
-
Andreas Karatzas authored
Signed-off-by:Andreas Karatzas <akaratza@amd.com>
-
- 25 Mar, 2026 2 commits
-
-
Yongye Zhu authored
-
Chauncey authored
[Revert] Remove CUDA torch fallbacks for fp8_mqa_logits/fp8_paged_mqa_logits_torch function (#37968) Signed-off-by:chaunceyjiang <chaunceyjiang@gmail.com>
-
- 24 Mar, 2026 1 commit
-
-
Li, Jiang authored
Signed-off-by:jiang1.li <jiang1.li@intel.com>
-
- 23 Mar, 2026 2 commits
-
-
Ranran authored
Signed-off-by:
Ranran <1012869439@qq.com> Signed-off-by:
Ranran <hzz5361@psu.edu> Signed-off-by:
ran <hzz5361@psu.edu> Co-authored-by:
Wentao Ye <44945378+yewentao256@users.noreply.github.com>
-
yzong-rh authored
[Bug][MoE] Strengthen _supports_current_device() checks in the TRTLLM FP8, NVFP4, and FlashInfer CuteDSL MoE experts (#36728) Signed-off-by:Yifan Zong <yzong@redhat.com>
-
- 20 Mar, 2026 2 commits
-
-
SherryC41 authored
Co-authored-by:sherryC41 <sherry.c.c41@gmail.com>
-
Itay Alroy authored
Signed-off-by:Itay Alroy <ialroy@nvidia.com>
-
- 18 Mar, 2026 1 commit
-
-
Xin Yang authored
Signed-off-by:Xin Yang <xyangx@amazon.com>
-
- 16 Mar, 2026 2 commits
-
-
Kunshang Ji authored
Signed-off-by:Kunshang Ji <jikunshang95@gmail.com>
-
leo-cf-tian authored
Signed-off-by:
wzhao18 <wzhao18.sz@gmail.com> Signed-off-by:
Leo Tian <lctian@nvidia.com> Co-authored-by:
wzhao18 <wzhao18.sz@gmail.com> Co-authored-by:
Stefano Castagnetta <scastagnetta@nvidia.com> Co-authored-by:
root <root@lyris0267.lyris.clusters.nvidia.com>
-
- 13 Mar, 2026 1 commit
-
-
Itay Alroy authored
Signed-off-by:
Itay Alroy <ialroy@nvidia.com> Co-authored-by:
Yongji Wu <wuyongji317@gmail.com> Co-authored-by:
Ron Tourgeman <rtourgeman@nvidia.com>
-
- 12 Mar, 2026 2 commits
-
-
Kunshang Ji authored
Signed-off-by:
Kunshang Ji <jikunshang95@gmail.com> Signed-off-by:
Kunshang Ji <kunshang.ji@intel.com>
-
Nick Hill authored
Signed-off-by:Nick Hill <nickhill123@gmail.com>
-
- 10 Mar, 2026 1 commit
-
-
Vadim Gimpelson authored
Signed-off-by:Vadim Gimpelson <vadim.gimpelson@gmail.com>
-
- 09 Mar, 2026 1 commit
-
-
wang.yuqi authored
Signed-off-by:
wang.yuqi <yuqi.wang@daocloud.io> Signed-off-by:
wang.yuqi <noooop@126.com> Co-authored-by:
gemini-code-assist[bot] <176961590+gemini-code-assist[bot]@users.noreply.github.com> Co-authored-by:
Harry Mellor <19981378+hmellor@users.noreply.github.com>
-
- 06 Mar, 2026 1 commit
-
-
Walter Beller-Morales authored
Signed-off-by:walterbm <walter.beller.morales@gmail.com>
-
- 04 Mar, 2026 1 commit
-
-
Kunshang Ji authored
Signed-off-by:
Kunshang Ji <kunshang.ji@intel.com> Signed-off-by:
Kunshang Ji <jikunshang95@gmail.com> Co-authored-by:
Harry Mellor <19981378+hmellor@users.noreply.github.com>
-
- 03 Mar, 2026 1 commit
-
-
Robert Shaw authored
Signed-off-by:
Robert Shaw <robshaw@redhat.com> Signed-off-by:
Robert Shaw <rshaw@neuralmagic.com> Signed-off-by:
Robert Shaw <robertgshaw2@gmail.com> Co-authored-by:
Robert Shaw <robshaw@redhat.com> Co-authored-by:
Robert Shaw <rshaw@neuralmagic.com>
-
- 01 Mar, 2026 1 commit
-
-
Richard Zou authored
Signed-off-by:Richard Zou <zou3519@gmail.com>
-
- 28 Feb, 2026 2 commits
-
-
Chauncey authored
Signed-off-by:chaunceyjiang <chaunceyjiang@gmail.com>
-
Andreas Karatzas authored
Signed-off-by:Andreas Karatzas <akaratza@amd.com>
-
- 27 Feb, 2026 1 commit
-
-
Michael Goin authored
-
- 26 Feb, 2026 1 commit
-
-
Tyler Michael Smith authored
Signed-off-by:
Tyler Michael Smith <tlrmchlsmth@gmail.com> Co-authored-by:
Claude Opus 4.6 <noreply@anthropic.com>
-
- 25 Feb, 2026 2 commits
-
-
Xinyu Chen authored
Signed-off-by:
Xinyu Chen <xinyu1.chen@intel.com> Co-authored-by:
chzhang <chaojun.zhang@intel.com> Co-authored-by:
zhenwei-intel <zhenwei.liu@intel.com> Co-authored-by:
Kunshang Ji <kunshang.ji@intel.com>
-
Kunshang Ji authored
Signed-off-by:
Kunshang Ji <kunshang.ji@intel.com> Signed-off-by:
Kunshang Ji <jikunshang95@gmail.com>
-
- 24 Feb, 2026 1 commit
-
-
danisereb authored
Signed-off-by:Daniel Serebrenik <daserebrenik@nvidia.com>
-
- 23 Feb, 2026 2 commits
-
-
Harry Mellor authored
Signed-off-by:Harry Mellor <19981378+hmellor@users.noreply.github.com>
-
Neil Schemenauer authored
Signed-off-by:Neil Schemenauer <nas@arctrix.com>
-
- 19 Feb, 2026 1 commit
-
-
Manrique Vargas authored
Signed-off-by:machov <mv1742@nyu.edu>
-
- 18 Feb, 2026 1 commit
-
-
ElizaWszola authored
Signed-off-by:
ElizaWszola <ewszola@redhat.com> Co-authored-by:
Luka Govedič <ProExpertProg@users.noreply.github.com>
-
- 17 Feb, 2026 2 commits
-
-
Jongseok Park authored
Signed-off-by:
js_park <cakeng@naver.com> Signed-off-by:
Jongseok Park <37990712+cakeng@users.noreply.github.com> Signed-off-by:
Sunga Kim <sunga.kim@berkeley.edu> Signed-off-by:
Nick Hill <nickhill123@gmail.com> Co-authored-by:
Sunga Kim <sunga.kim@berkeley.edu> Co-authored-by:
Nick Hill <nickhill123@gmail.com>
-
Cyrus Leung authored
Signed-off-by:DarkLight1337 <tlleungac@connect.ust.hk>
-
- 14 Feb, 2026 1 commit
-
-
Andreas Karatzas authored
Signed-off-by:Andreas Karatzas <akaratza@amd.com>
-
- 13 Feb, 2026 1 commit
-
-
Wei Zhao authored
[Feature] Support CPU Offloading without Pytorch Pinned Memory that leads to doubled allocation (#32993) Signed-off-by:
wzhao18 <wzhao18.sz@gmail.com> Co-authored-by:
Michael Goin <mgoin64@gmail.com>
-
- 11 Feb, 2026 1 commit
-
-
Cyrus Leung authored
Signed-off-by:DarkLight1337 <tlleungac@connect.ust.hk>
-
- 09 Feb, 2026 1 commit
-
-
wang.yuqi authored
Signed-off-by:wang.yuqi <yuqi.wang@daocloud.io>
-
- 06 Feb, 2026 1 commit
-
-
Wentao Ye authored
Signed-off-by:yewentao256 <zhyanwentao@126.com>
-