- 30 Oct, 2025 7 commits
-
-
Paul Zhang authored
Signed-off-by:
PaulZhang12 <paulzhan@fb.com> Co-authored-by:
Wentao Ye <44945378+yewentao256@users.noreply.github.com>
-
Roger Meier authored
Signed-off-by:Roger Meier <r.meier@siemens.com>
-
Li, Jiang authored
Signed-off-by:jiang1.li <jiang1.li@intel.com>
-
Zhiyuan Li authored
Signed-off-by:
lizhiyuan <lizhiyuan@moonshot.cn> Signed-off-by:
Zhiyuan Li <uniartisan2017@gmail.com>
-
wang.yuqi authored
Signed-off-by:
wang.yuqi <noooop@126.com> Co-authored-by:
gemini-code-assist[bot] <176961590+gemini-code-assist[bot]@users.noreply.github.com> Co-authored-by:
Cyrus Leung <cyrus.tl.leung@gmail.com>
-
Bram Wasti authored
Signed-off-by:
Bram Wasti <bwasti@meta.com> Signed-off-by:
Bram Wasti <bwasti@fb.com> Co-authored-by:
gemini-code-assist[bot] <176961590+gemini-code-assist[bot]@users.noreply.github.com>
-
Yan Ma authored
Signed-off-by:Yan Ma <yan.ma@intel.com>
-
- 29 Oct, 2025 4 commits
-
-
Wentao Ye authored
Signed-off-by:yewentao256 <zhyanwentao@126.com>
-
Wentao Ye authored
[Bug] Fix DeepEP low latency `assert self.batched_router_logits.size(-1) == full_router_logits.size(-1)` Bug (#27682) Signed-off-by:yewentao256 <zhyanwentao@126.com>
-
Roger Young authored
Signed-off-by:
xuebi <xuebi@minimaxi.com> Co-authored-by:
xuebi <xuebi@minimaxi.com>
-
Zhewen Li authored
Signed-off-by:zhewenli <zhewenli@meta.com>
-
- 28 Oct, 2025 6 commits
-
-
Wentao Ye authored
Signed-off-by:yewentao256 <zhyanwentao@126.com>
-
Zhiyuan Li authored
Signed-off-by:lizhiyuan <lizhiyuan@moonshot.cn>
-
Wentao Ye authored
Signed-off-by:
yewentao256 <zhyanwentao@126.com> Co-authored-by:
Cyrus Leung <tlleungac@connect.ust.hk>
-
Matthew Bonanni authored
Signed-off-by:Matthew Bonanni <mbonanni@redhat.com>
-
Li, Jiang authored
[Bugfix][CPU] Fallback oneDNN linear to torch linear to fix half gemm support on legecy platforms (#27526) Signed-off-by:
jiang1.li <jiang1.li@intel.com> Co-authored-by:
Isotr0py <mozf@mail2.sysu.edu.cn>
-
Eric Yue authored
Signed-off-by:minatoaquaMK2 <jiacheng.yue@foxmail.com>
-
- 27 Oct, 2025 3 commits
-
-
Varun Sundar Rabindranath authored
Signed-off-by:
Varun Sundar Rabindranath <vsundarr@redhat.com> Co-authored-by:
Varun Sundar Rabindranath <vsundarr@redhat.com>
-
Cyrus Leung authored
Signed-off-by:DarkLight1337 <tlleungac@connect.ust.hk>
-
Danielle Robinson authored
Signed-off-by:
Danielle Robinson <dmmaddix@amazon.com> Signed-off-by:
Danielle Robinson <dcmaddix@gmail.com> Co-authored-by:
Danielle Robinson <dmmaddix@amazon.com> Co-authored-by:
Jee Jee Li <pandaleefree@gmail.com>
-
- 26 Oct, 2025 1 commit
-
-
Yeshwanth N authored
Signed-off-by:
Yeshwanth Surya <yeshsurya@gmail.com> Signed-off-by:
Yeshwanth N <yeshsurya@gmail.com> Signed-off-by:
yeshsurya <yeshsurya@gmail.com>
-
- 24 Oct, 2025 4 commits
-
-
Varun Sundar Rabindranath authored
Signed-off-by:
Varun Sundar Rabindranath <vsundarr@redhat.com> Co-authored-by:
Varun Sundar Rabindranath <vsundarr@redhat.com>
-
Wentao Ye authored
Signed-off-by:yewentao256 <zhyanwentao@126.com>
-
fhl2000 authored
Signed-off-by:
fhl <2410591650@qq.com> Signed-off-by:
fhl2000 <63384265+fhl2000@users.noreply.github.com> Signed-off-by:
Harry Mellor <19981378+hmellor@users.noreply.github.com> Co-authored-by:
Luka Govedič <ProExpertProg@users.noreply.github.com> Co-authored-by:
Harry Mellor <19981378+hmellor@users.noreply.github.com>
-
Xiangyu Li authored
[Kernel] Add GPTQv2 format support for low-bit or asymmetric quantization, by adapting gptq_gemm (#26092)
-
- 23 Oct, 2025 6 commits
-
-
Akash kaothalkar authored
Signed-off-by:
Akash Kaothalkar <akash.kaothalkar@ibm.com> Co-authored-by:
Akash Kaothalkar <akash.kaothalkar@ibm.com>
-
Isotr0py authored
Signed-off-by:Isotr0py <mozf@mail2.sysu.edu.cn>
-
Gregory Shtrasberg authored
Signed-off-by:Gregory Shtrasberg <Gregory.Shtrasberg@amd.com>
-
Alexander Matveev authored
Signed-off-by:Alexander Matveev <amatveev@redhat.com>
-
wang.yuqi authored
Signed-off-by:
wang.yuqi <noooop@126.com> Signed-off-by:
Christian Pinto <christian.pinto@ibm.com> Co-authored-by:
gemini-code-assist[bot] <176961590+gemini-code-assist[bot]@users.noreply.github.com> Co-authored-by:
Christian Pinto <christian.pinto@ibm.com>
-
tomeras91 authored
Signed-off-by:Tomer Asida <57313761+tomeras91@users.noreply.github.com>
-
- 22 Oct, 2025 4 commits
-
-
Reinforce-II authored
Signed-off-by:
Reinforce-II <fate@eastal.com> Co-authored-by:
Wentao Ye <44945378+yewentao256@users.noreply.github.com>
-
Wentao Ye authored
Signed-off-by:yewentao256 <zhyanwentao@126.com>
-
dongbo910220 authored
Signed-off-by:
dongbo910220 <1275604947@qq.com> Signed-off-by:
dongbo910220 <32610838+dongbo910220@users.noreply.github.com> Co-authored-by:
Wentao Ye <44945378+yewentao256@users.noreply.github.com> Co-authored-by:
Cyrus Leung <tlleungac@connect.ust.hk>
-
Jiangyun Zhu authored
Signed-off-by:zjy0516 <riverclouds.zhu@qq.com>
-
- 21 Oct, 2025 5 commits
-
-
Alexander Matveev authored
[Performance] Dual stream execution of "shared_experts" and "selected_experts" inside FusedMoE (#26440) Signed-off-by:Alexander Matveev <amatveev@redhat.com>
-
Wentao Ye authored
Signed-off-by:yewentao256 <zhyanwentao@126.com>
-
JartX authored
Signed-off-by:JartX <sagformas@epdcenter.es>
-
Varun Sundar Rabindranath authored
Signed-off-by:
Varun Sundar Rabindranath <vsundarr@redhat.com> Co-authored-by:
Varun Sundar Rabindranath <vsundarr@redhat.com> Co-authored-by:
Michael Goin <mgoin64@gmail.com>
-
Shu Wang authored
Signed-off-by:
Shu Wang <shuw@nvidia.com> Signed-off-by:
Shu Wang. <shuw@nvidia.com> Co-authored-by:
Michael Goin <mgoin64@gmail.com>
-