- 09 Apr, 2026 2 commits
-
-
Maral authored
[W8A8 Block Linear Refactor][2/N] Remove W8A8Fp8BlockLinearOp and adopt Fp8 block linear kernel selections. (#33892) Signed-off-by:
maral <maralbahari.98@gmail.com> Signed-off-by:
Maral <maralbahari.98@gmail.com>
-
Benjamin Chislett authored
Signed-off-by:
Benjamin Chislett <bchislett@nvidia.com> Signed-off-by:
Benjamin Chislett <chislett.ben@gmail.com> Co-authored-by:
gemini-code-assist[bot] <176961590+gemini-code-assist[bot]@users.noreply.github.com>
-
- 08 Apr, 2026 18 commits
-
-
Wentao Ye authored
Signed-off-by:yewentao256 <zhyanwentao@126.com>
-
triangleXIV authored
[BugFix] --max-model-len=-1 causes over-limit requests to hang and starve the entire service (#39102) Signed-off-by:
triangle14 <y1019026570@gmail.com> Signed-off-by:
mgoin <mgoin64@gmail.com> Co-authored-by:
mgoin <mgoin64@gmail.com>
-
Rishi Puri authored
Signed-off-by:
Rishi Puri <riship@nvidia.com> Signed-off-by:
Rishi Puri <puririshi98@berkeley.edu> Signed-off-by:
sfeng33 <4florafeng@gmail.com> Co-authored-by:
Benjamin Chislett <chislett.ben@gmail.com> Co-authored-by:
Flora Feng <4florafeng@gmail.com>
-
Wentao Ye authored
Signed-off-by:yewentao256 <zhyanwentao@126.com>
-
Jackmin801 authored
Signed-off-by:
Jackmin801 <ongjackm@gmail.com> Signed-off-by:
Robert Shaw <robshaw@redhat.com> Co-authored-by:
Robert Shaw <robshaw@redhat.com> Co-authored-by:
Robert Shaw <114415538+robertgshaw2-redhat@users.noreply.github.com>
-
Ben Browning authored
Signed-off-by:
Ben Browning <bbrownin@redhat.com> Co-authored-by:
Claude Opus 4.6 <noreply@anthropic.com> Co-authored-by:
Cursor <cursoragent@cursor.com>
-
Roberto L. Castro authored
[Perf][Kernel] Persistent TopK scheduler: unified CUDAGraph-safe kernel with dynamic per-row dispatch - DeepSeek-V3.2 DSA decode (#37421) Signed-off-by:
LopezCastroRoberto <rocastro@redhat.com> Signed-off-by:
Roberto L. Castro <38211239+LopezCastroRoberto@users.noreply.github.com> Co-authored-by:
Claude Sonnet 4.5 <noreply@anthropic.com> Co-authored-by:
Lucas Wilkinson <LucasWilkinson@users.noreply.github.com>
-
Shengqi Chen authored
Signed-off-by:
Shengqi Chen <harry-chen@outlook.com> Co-authored-by:
Jason Li <jasonlizhengjian@gmail.com> Co-authored-by:
Roger Wang <hey@rogerw.io>
-
Or Ozeri authored
Signed-off-by:Or Ozeri <oro@il.ibm.com>
-
Flora Feng authored
Signed-off-by:sfeng33 <4florafeng@gmail.com>
-
Gregory Shtrasberg authored
Signed-off-by:Gregory Shtrasberg <Gregory.Shtrasberg@amd.com>
-
haosdent authored
Signed-off-by:haosdent <haosdent@gmail.com>
-
wang.yuqi authored
Signed-off-by:wang.yuqi <yuqi.wang@daocloud.io>
-
yoke authored
Signed-off-by:yoke233 <yoke2012@gmail.com>
-
Flora Feng authored
Signed-off-by:
sfeng33 <4florafeng@gmail.com> Signed-off-by:
Andrew Xia <axia@meta.com>
-
Flora Feng authored
Signed-off-by:sfeng33 <4florafeng@gmail.com>
-
Andrey Talman authored
-
Giancarlo Delfin authored
Signed-off-by:Giancarlo Delfin <gdelfin@inferact.ai>
-
- 07 Apr, 2026 7 commits
-
-
Lucas Wilkinson authored
Signed-off-by:Lucas Wilkinson <lwilkins@redhat.com>
-
ibifrost authored
Signed-off-by:
wuchenxin <wuchenxin.wcx@alibaba-inc.com> Signed-off-by:
ibifrost <47308427+ibifrost@users.noreply.github.com> Co-authored-by:
Simon Mo <simon.mo@hey.com>
-
Ilya Boytsov authored
Signed-off-by:Ilya Boytsov <ilyaboytsov1805@gmail.com>
-
Ronen Schaffer authored
Signed-off-by:Ronen Schaffer <ronen.schaffer@ibm.com>
-
Jiangyun Zhu authored
Signed-off-by:
zjy0516 <riverclouds.zhu@qq.com> Signed-off-by:
Jiangyun Zhu <riverclouds.zhu@qq.com> Co-authored-by:
Luka Govedič <ProExpertProg@users.noreply.github.com>
-
Andreas Karatzas authored
Signed-off-by:Andreas Karatzas <akaratza@amd.com>
-
Andreas Karatzas authored
Signed-off-by:Andreas Karatzas <akaratza@amd.com>
-
- 06 Apr, 2026 9 commits
-
-
fxmarty-amd authored
[NVFP4] Support NVFP4 dense models from `modelopt` and `compressed-tensors` on AMD Instinct MI300, MI355X and Hopper through emulation (#35733) Signed-off-by:
Felix Marty <Felix.Marty@amd.com> Signed-off-by:
fxmarty-amd <felmarty@amd.com> Co-authored-by:
Kyle Sayers <kylesayrs@gmail.com>
-
Yongye Zhu authored
Signed-off-by:
Yongye Zhu <zyy1102000@gmail.com> Co-authored-by:
Michael Goin <mgoin64@gmail.com> Co-authored-by:
Roger Wang <hey@rogerw.io>
-
bnellnm authored
Signed-off-by:
Bill Nell <bnell@redhat.com> Co-authored-by:
Robert Shaw <114415538+robertgshaw2-redhat@users.noreply.github.com>
-
zhanqiuhu authored
-
Walter Beller-Morales authored
Signed-off-by:walterbm <walter.beller.morales@gmail.com>
-
Julien Denize authored
Signed-off-by:juliendenize <julien.denize@mistral.ai>
-
bhargav-patel-29 authored
Signed-off-by:
bhargav-patel-29 <bhargav.patel@tihiitb.org> Co-authored-by:
gemini-code-assist[bot] <176961590+gemini-code-assist[bot]@users.noreply.github.com>
-
liuchenbing2026 authored
Signed-off-by:
liuchenbing <chenliumail@163.com> Signed-off-by:
liucb <liuchengbao_work@163.com> Co-authored-by:
liuchenbing <chenliumail@163.com>
-
Micah Williamson authored
Signed-off-by:Micah Williamson <micah.williamson@amd.com>
-
- 05 Apr, 2026 3 commits
-
-
Greg Pereira authored
Signed-off-by:
greg pereira <grpereir@redhat.com> Signed-off-by:
Robert Shaw <114415538+robertgshaw2-redhat@users.noreply.github.com> Co-authored-by:
Robert Shaw <114415538+robertgshaw2-redhat@users.noreply.github.com>
-
Greg Pereira authored
Signed-off-by:
greg pereira <grpereir@redhat.com> Co-authored-by:
Robert Shaw <114415538+robertgshaw2-redhat@users.noreply.github.com>
-
Aaron Batilo authored
Signed-off-by:Aaron Batilo <abatilo@coreweave.com>
-
- 03 Apr, 2026 1 commit
-
-
Jeffrey Wang authored
Signed-off-by:Jeffrey Wang <jeffreywang@anyscale.com>
-