"vscode:/vscode.git/clone" did not exist on "06e16a27eb251805c8e07b3a2e3bbd980fcf1592"
- 22 Apr, 2026 3 commits
-
-
Jhao-Ting Chen authored
Signed-off-by:Jhao-Ting Chen <jhaotingc@nvidia.com>
-
Khushali Desai authored
Signed-off-by:khushali9 <khushali.desai9@gmail.com>
-
TJian authored
[ROCm] [Wheel] [Bugfix] [Critical] Remove any packages installed from github from rocm.txt e.g `fastsafetensors` as it is incompatible with `uv pip` (#40461) Signed-off-by:tjtanaa <tunjian.tan@embeddedllm.com>
-
- 21 Apr, 2026 37 commits
-
-
Matthew Bonanni authored
Signed-off-by:
Matthew Bonanni <mbonanni@redhat.com> Co-authored-by:
Tyler Michael Smith <tlrmchlsmth@gmail.com>
-
bnellnm authored
Signed-off-by:
Bill Nell <bnell@redhat.com> Signed-off-by:
Robert Shaw <114415538+robertgshaw2-redhat@users.noreply.github.com> Co-authored-by:
Robert Shaw <114415538+robertgshaw2-redhat@users.noreply.github.com>
-
bnellnm authored
Signed-off-by:Bill Nell <bnell@redhat.com>
-
Wentao Ye authored
Signed-off-by:yewentao256 <zhyanwentao@126.com>
-
Wentao Ye authored
Signed-off-by:yewentao256 <zhyanwentao@126.com>
-
Jakub Zakrzewski authored
[Bugfix][Kernel] nvfp4 cutlass MoE: fix nvfp4 experts quant out-of-bounds read for expert counts not divisible by 4 or 16 (#40351) Signed-off-by:Jakub Zakrzewski <jzakrzewski@nvidia.com>
-
Fergus authored
Signed-off-by:
Fergus <fergus.barratt00@gmail.com> Signed-off-by:
fergus barratt <fergus.barratt00@gmail.com> Co-authored-by:
Chauncey <chaunceyjiang@gmail.com>
-
Rishi Puri authored
Signed-off-by:
Stefano Castagnetta <scastagnetta@nvidia.com> Signed-off-by:
Rishi Puri <puririshi98@berkeley.edu> Co-authored-by:
Claude Opus 4.6 (1M context) <noreply@anthropic.com> Co-authored-by:
Stefano Castagnetta <scastagnetta@nvidia.com> Co-authored-by:
Matthew Bonanni <mbonanni@redhat.com> Co-authored-by:
Benjamin Chislett <bchislett@nvidia.com>
-
Zijing Liu authored
[MRv2]fix: model accuracy regression caused by reusing the stale last_sampled_tokens and draft_tokens (#39833) Signed-off-by:Zijing Liu <liuzijing2014@gmail.com>
-
Isotr0py authored
Co-authored-by:Roger Wang <hey@rogerw.io>
-
Harry Mellor authored
Signed-off-by:Harry Mellor <19981378+hmellor@users.noreply.github.com>
-
Wentao Ye authored
Signed-off-by:
yewentao256 <zhyanwentao@126.com> Signed-off-by:
Wentao Ye <44945378+yewentao256@users.noreply.github.com> Co-authored-by:
Nick Hill <nickhill123@gmail.com>
-
Vadim Gimpelson authored
Signed-off-by:
Vadim Gimpelson <vadim.gimpelson@gmail.com> Signed-off-by:
Vadim Gimpelson <156319763+vadiklyutiy@users.noreply.github.com>
-
Matthew Bonanni authored
Signed-off-by:Matthew Bonanni <mbonanni@redhat.com>
-
roikoren755 authored
Default to 'align' mamba cache mode for Mamba-based models when speculative decoding is enabled (#40454) Signed-off-by:Roi Koren <roik@nvidia.com>
-
Shanshan Shen authored
[MM][CG] Optimize default `max_frames_per_batch` auto-infer for ViT CUDA graph video inference (#40445) Signed-off-by:shen-shanshan <467638484@qq.com>
-
xiangdong authored
Signed-off-by:
zengxian <xiangdong.zeng@intel.com> Co-authored-by:
Kunshang Ji <kunshang.ji@intel.com>
-
ℍ𝕠𝕝𝕝𝕠𝕨 𝕄𝕒𝕟 authored
Signed-off-by:Hollow Man <hollowman@opensuse.org>
-
Yusuf Mohammad authored
Signed-off-by:
Yusuf <yusufmohammad@live.com> Co-authored-by:
mergify[bot] <37929162+mergify[bot]@users.noreply.github.com>
-
Talor Abramovich authored
Signed-off-by:
talora <talora@nvidia.com> Signed-off-by:
Talor Abramovich <talor19@gmail.com> Co-authored-by:
gemini-code-assist[bot] <176961590+gemini-code-assist[bot]@users.noreply.github.com>
-
artem-spector authored
Signed-off-by:
Artem Spector <artems@il.ibm.com> Signed-off-by:
artemspector <artems@il.ibm.com> Co-authored-by:
artemspector <artems@il.ibm.com> Co-authored-by:
Claude Opus 4.6 <noreply@anthropic.com>
-
wang.yuqi authored
Signed-off-by:
wang.yuqi <yuqi.wang@daocloud.io> Signed-off-by:
wang.yuqi <noooop@126.com> Co-authored-by:
Cyrus Leung <cyrus.tl.leung@gmail.com> Co-authored-by:
mergify[bot] <37929162+mergify[bot]@users.noreply.github.com>
-
Kris Hung authored
Signed-off-by:
Krish Hung <krishung5@gmail.com> Signed-off-by:
krishung5 <krish@nvidia.com> Co-authored-by:
Claude Opus 4.6 (1M context) <noreply@anthropic.com>
-
Jhao-Ting Chen authored
Signed-off-by:Jhao-Ting Chen <jhaotingc@nvidia.com>
-
wang.yuqi authored
Revert "[Startup] Parallelize torch/transformers import + weight prefetch + forkserver prewarm" (#40438) Signed-off-by:wang.yuqi <yuqi.wang@daocloud.io>
-
Zeyu Zhang authored
Signed-off-by:Alchuang22-dev <2584829494@qq.com>
-
hangy-amd authored
Signed-off-by:Hang Yang <hangy@amd.com>
-
milesial authored
-
Shanshan Shen authored
Signed-off-by:shen-shanshan <467638484@qq.com>
-
SeongJun Lee authored
Signed-off-by:lesj0610 <lesj0610@gmail.com>
-
Wentao Ye authored
Signed-off-by:yewentao256 <zhyanwentao@126.com>
-
Simon Mo authored
Signed-off-by:simon-mo <simon@inferact.ai>
-
Shanshan Shen authored
Signed-off-by:shen-shanshan <467638484@qq.com>
-
Luciano Martins authored
Signed-off-by:
Luciano Martins <lucianommartins@users.noreply.github.com> Co-authored-by:
Luciano Martins <lucianommartins@users.noreply.github.com>
-
Chauncey authored
Signed-off-by:chaunceyjiang <chaunceyjiang@gmail.com>
-
Misa authored
[Bugfix] Fix `_CONFIG_REGISTRY` types getting wrong config class when on-disk model_type differs (#39554) Signed-off-by:
Misa <misaAle@users.noreply.github.com> Signed-off-by:
Misael Casarez <misacasa@amazon.com> Co-authored-by:
Misael Casarez <misacasa@amazon.com>
-
Yanan Cao authored
Signed-off-by:
Yanan Cao <gmagogsfm@gmail.com> Co-authored-by:
Theresa Shan <Theresa.Shan@amd.com> Co-authored-by:
Claude Opus 4.6 (1M context) <noreply@anthropic.com>
-