- 16 Apr, 2026 1 commit
-
-
vllmellm authored
Signed-off-by:vllmellm <vllm.ellm@embeddedllm.com>
-
- 15 Apr, 2026 3 commits
-
-
zofia authored
Signed-off-by:Zhu, Zufang <zufang.zhu@intel.com>
-
Zhenzhong Xu authored
Signed-off-by:
Zhenzhong1 <zhenzhong.xu@intel.com> Signed-off-by:
Zhenzhong Xu <zhenzhong.xu@intel.com>
-
Vibhav Agarwal authored
Signed-off-by:
vibhavagarwal5 <vibhavagarwal5@gmail.com> Signed-off-by:
Michael Goin <mgoin64@gmail.com> Co-authored-by:
Xinyu Chen <xinyu1.chen@intel.com> Co-authored-by:
Michael Goin <mgoin64@gmail.com>
-
- 14 Apr, 2026 9 commits
-
-
Michael Goin authored
Signed-off-by:
mgoin <mgoin64@gmail.com> Co-authored-by:
Claude <noreply@anthropic.com>
-
bnellnm authored
Signed-off-by:Bill Nell <bnell@redhat.com>
-
Jackmin801 authored
Signed-off-by:
Robert Shaw <robertgshaw2@gmail.com> Signed-off-by:
Jackmin801 <ongjackm@gmail.com> Co-authored-by:
Robert Shaw <robertgshaw2@gmail.com>
-
roikoren755 authored
Signed-off-by:Roi Koren <roik@nvidia.com>
-
danielafrimi authored
Signed-off-by:
root <root@lyris0017.lyris.clusters.nvidia.com> Signed-off-by:
Daniel Afrimi <dafrimi@nvidia.com> Co-authored-by:
root <root@lyris0017.lyris.clusters.nvidia.com>
-
Albert Cheng authored
Signed-off-by:
Michael Goin <mgoin64@gmail.com> Co-authored-by:
Albert Cheng (Engrg-Hardware 1) <albecheng@login-lyris02.lyris.clusters.nvidia.com> Co-authored-by:
Michael Goin <mgoin64@gmail.com> Co-authored-by:
gemini-code-assist[bot] <176961590+gemini-code-assist[bot]@users.noreply.github.com>
-
Hexiang Wang authored
Signed-off-by:whx-sjtu <2952154980@qq.com>
-
bnellnm authored
Signed-off-by:
Bill Nell <bnell@redhat.com> Co-authored-by:
Robert Shaw <114415538+robertgshaw2-redhat@users.noreply.github.com>
-
fxmarty-amd authored
[fix][MOE] Fix MOE experts `intermediate_size` dimension not being narrowed before weight loading (#39688) Signed-off-by:Felix Marty <Felix.Marty@amd.com>
-
- 13 Apr, 2026 6 commits
-
-
Monishver authored
Signed-off-by:
Monishver Chandrasekaran <monishverchandrasekaran@gmail.com> Co-authored-by:
mergify[bot] <37929162+mergify[bot]@users.noreply.github.com> Co-authored-by:
Nicolò Lucchesi <nlucches@redhat.com>
-
Tyler Michael Smith authored
[Bugfix] Reject non-nvfp4 dtypes when using the flashinfer_nvlink_one_sided all2all backend (#39717) Signed-off-by:
Tyler Michael Smith <tlrmchlsmth@gmail.com> Co-authored-by:
Claude Opus 4.6 (1M context) <noreply@anthropic.com>
-
Yongye Zhu authored
Signed-off-by:Yongye Zhu <zyy1102000@gmail.com>
-
Santino Ramos authored
Signed-off-by:Santino Ramos <santinor@inferact.ai>
-
Yi Liu authored
Signed-off-by:yiliu30 <yi4.liu@intel.com>
-
Jesus Federico authored
Signed-off-by:
Jesus Federico <jefp@amazon.com> Signed-off-by:
wang.yuqi <yuqi.wang@daocloud.io> Co-authored-by:
Claude Opus 4.6 (1M context) <noreply@anthropic.com> Co-authored-by:
wang.yuqi <yuqi.wang@daocloud.io>
-
- 12 Apr, 2026 1 commit
-
-
Le Yang authored
-
- 11 Apr, 2026 2 commits
-
-
EdalatiAli authored
Signed-off-by:EdalatiAli <aliedalati@cohere.com>
-
Vibhav Agarwal authored
Signed-off-by:
Vibhav Agarwal <vibhavagarwal5@gmail.com> Co-authored-by:
vibhav-agarwal <vibhav.agarwal@glance.com> Co-authored-by:
Claude Opus 4.6 <noreply@anthropic.com>
-
- 10 Apr, 2026 9 commits
-
-
Michael Goin authored
Signed-off-by:mgoin <mgoin64@gmail.com>
-
Manu authored
Signed-off-by:manu <fortin.emmanuel@gmail.com>
-
Jesus Federico authored
Signed-off-by:
Jesus Federico <jefp@amazon.com> Co-authored-by:
Claude Opus 4.6 (1M context) <noreply@anthropic.com>
-
Jee Jee Li authored
Signed-off-by:Jee Jee Li <pandaleefree@gmail.com>
-
wang.yuqi authored
Signed-off-by:wang.yuqi <yuqi.wang@daocloud.io>
-
Hexiang Wang authored
Signed-off-by:
whx-sjtu <2952154980@qq.com> Co-authored-by:
mergify[bot] <37929162+mergify[bot]@users.noreply.github.com>
-
Richard Zou authored
Signed-off-by:Richard Zou <zou3519@gmail.com>
-
Kunshang Ji authored
Signed-off-by:
Kunshang Ji <kunshang.ji@intel.com> Signed-off-by:
Kunshang Ji <jikunshang95@gmail.com>
-
Ibrahim Arshad authored
Signed-off-by:Ibrahim Arshad <38925737+ibrahim1023@users.noreply.github.com>
-
- 09 Apr, 2026 7 commits
-
-
Michael Goin authored
Signed-off-by:mgoin <mgoin64@gmail.com>
-
PikaPikachu authored
Signed-off-by:kangletian <Letian.Kang@amd.com>
-
Richard Zou authored
Signed-off-by:Richard Zou <zou3519@gmail.com>
-
Yongye Zhu authored
-
Wei Zhao authored
Signed-off-by:
wzhao18 <wzhao18.sz@gmail.com> Co-authored-by:
Jee Jee Li <pandaleefree@gmail.com>
-
Maral authored
[W8A8 Block Linear Refactor][2/N] Remove W8A8Fp8BlockLinearOp and adopt Fp8 block linear kernel selections. (#33892) Signed-off-by:
maral <maralbahari.98@gmail.com> Signed-off-by:
Maral <maralbahari.98@gmail.com>
-
Benjamin Chislett authored
Signed-off-by:
Benjamin Chislett <bchislett@nvidia.com> Signed-off-by:
Benjamin Chislett <chislett.ben@gmail.com> Co-authored-by:
gemini-code-assist[bot] <176961590+gemini-code-assist[bot]@users.noreply.github.com>
-
- 08 Apr, 2026 2 commits
-
-
Wentao Ye authored
Signed-off-by:yewentao256 <zhyanwentao@126.com>
-
Jackmin801 authored
Signed-off-by:
Jackmin801 <ongjackm@gmail.com> Signed-off-by:
Robert Shaw <robshaw@redhat.com> Co-authored-by:
Robert Shaw <robshaw@redhat.com> Co-authored-by:
Robert Shaw <114415538+robertgshaw2-redhat@users.noreply.github.com>
-