- 30 Jul, 2025 18 commits
-
-
Po-Han Huang (NVIDIA) authored
Signed-off-by:Po-Han Huang <pohanh@nvidia.com>
-
Ruixiang Tan authored
Signed-off-by:
tanruixiang <tanruixiang0104@gmail.com> Signed-off-by:
Ruixiang Tan <819464715@qq.com> Signed-off-by:
GitHub <noreply@github.com>
-
youkaichao authored
-
Yan Pashkovsky authored
Signed-off-by:Yan Pashkovsky <yanp.bugz@gmail.com>
-
rongfu.leng authored
Signed-off-by:rongfu.leng <rongfu.leng@daocloud.io>
-
aladerran authored
Signed-off-by:aladerran <aladerran@gmail.com>
-
Peter Pan authored
Signed-off-by:Peter Pan <Peter.Pan@daocloud.io>
-
Jee Jee Li authored
Signed-off-by:Jee Jee Li <pandaleefree@gmail.com>
-
Cyrus Leung authored
Signed-off-by:DarkLight1337 <tlleungac@connect.ust.hk>
-
Kunshang Ji authored
Signed-off-by:Kunshang Ji <kunshang.ji@intel.com>
-
wang.yuqi authored
Signed-off-by:wang.yuqi <noooop@126.com>
-
Cyrus Leung authored
Signed-off-by:DarkLight1337 <tlleungac@connect.ust.hk>
-
MingzhenHan authored
Signed-off-by:MingzhenHan <hanmingzhen2002@outlook.com>
-
Areeb Syed authored
[Bugfix] Fix shape mismatch assertion error when loading Gemma3n model with BitsAndBytes quantization (#21808) Signed-off-by:sydarb <areebsyed237@gmail.com>
-
Csrayz authored
Signed-off-by:Csrayz <33659823+Csrayz@users.noreply.github.com>
-
Harry Mellor authored
Signed-off-by:Harry Mellor <19981378+hmellor@users.noreply.github.com>
-
Chen Zhang authored
Signed-off-by:Chen Zhang <zhangch99@outlook.com>
-
milesial authored
Signed-off-by:
Alexandre Milesi <30204471+milesial@users.noreply.github.com> Co-authored-by:
Alexandre Milesi <30204471+milesial@users.noreply.github.com>
-
- 29 Jul, 2025 13 commits
-
-
Yong Hoon Shin authored
Signed-off-by:Yong Hoon Shin <yhshin@meta.com>
-
Doug Smith authored
Signed-off-by:dougbtv <dosmith@redhat.com>
-
elvischenv authored
Signed-off-by:elvischenv <219235043+elvischenv@users.noreply.github.com>
-
Wenhua Cheng authored
Signed-off-by:Wenhua Cheng <wenhua.cheng@intel.com>
-
Richard Zou authored
Signed-off-by:Richard Zou <zou3519@gmail.com>
-
Chen Zhang authored
Signed-off-by:Chen Zhang <zhangch99@outlook.com>
-
Jee Jee Li authored
Signed-off-by:Jee Jee Li <pandaleefree@gmail.com>
-
Reza Barazesh authored
Signed-off-by:
Reza Barazesh <rezabarazesh@meta.com> Signed-off-by:
Harry Mellor <19981378+hmellor@users.noreply.github.com> Co-authored-by:
Harry Mellor <19981378+hmellor@users.noreply.github.com>
-
Isotr0py authored
Signed-off-by:
isotr0py <2037008807@qq.com> Signed-off-by:
Isotr0py <2037008807@qq.com> Signed-off-by:
Isotr0py <mozf@mail2.sysu.edu.cn>
-
Nick Hill authored
Signed-off-by:Nick Hill <nhill@redhat.com>
-
Benji Beck authored
Signed-off-by:Benji Beck <benjibeck@meta.com>
-
Calvin Chen authored
Signed-off-by:calvin chen <wen.chen@dynamia.ai>
-
Wentao Ye authored
[Refactor] Merge Compressed Tensor FP8 `CompressedTensorsW8A8Fp8MoEMethod` and `CompressedTensorsW8A8Fp8MoECutlassMethod` (#21775) Signed-off-by:yewentao256 <zhyanwentao@126.com>
-
- 28 Jul, 2025 9 commits
-
-
Lucas Wilkinson authored
Signed-off-by:Lucas Wilkinson <lwilkins@redhat.com>
-
Nikhil Gupta authored
Signed-off-by:
Nikhil Gupta <nikhil.gupta2@arm.com> Co-authored-by:
mgoin <mgoin64@gmail.com>
-
Clayton Coleman authored
Signed-off-by:
Clayton Coleman <smarterclayton@gmail.com> Co-authored-by:
mgoin <mgoin64@gmail.com>
-
Kuntai Du authored
Signed-off-by:KuntaiDu <kuntai@uchicago.edu>
-
Wentao Ye authored
[Bug] Enforce contiguous input for `dynamic_scaled_fp8_quant` and `static_scaled_fp8_quant` (#21773) Signed-off-by:yewentao256 <zhyanwentao@126.com>
-
rasmith authored
[AMD][BugFix] Fix omission of wvSplitK kernel for small batch sizes (1-4) due to torch.compile (#21350) Signed-off-by:Randall Smith <Randall.Smith@amd.com>
-
Harry Mellor authored
Signed-off-by:Harry Mellor <19981378+hmellor@users.noreply.github.com>
-
Nick Hill authored
Signed-off-by:Nick Hill <nhill@redhat.com>
-
Cyrus Leung authored
Signed-off-by:DarkLight1337 <tlleungac@connect.ust.hk>
-