- 10 Feb, 2026 3 commits
-
-
Qi Wang authored
Signed-off-by:Qi Wang <qiwa@nvidia.com>
-
Phúc H. Lê Khắc authored
-
Cyrus Leung authored
Signed-off-by:DarkLight1337 <tlleungac@connect.ust.hk>
-
- 09 Feb, 2026 3 commits
-
-
Roger Wang authored
Signed-off-by:Roger Wang <hey@rogerw.io>
-
JJJYmmm authored
Signed-off-by:
JJJYmmm <1650675829@qq.com> Signed-off-by:
JJJYmmm <92386084+JJJYmmm@users.noreply.github.com> Signed-off-by:
Roger Wang <hey@rogerw.io> Co-authored-by:
wulipc <wulipc@users.noreply.github.com> Co-authored-by:
ywang96 <ywang96@users.noreply.github.com> Co-authored-by:
Isotr0py <Isotr0py@users.noreply.github.com> Co-authored-by:
Isotr0py <2037008807@qq.com> Co-authored-by:
Roger Wang <hey@rogerw.io>
-
Jee Jee Li authored
-
- 08 Feb, 2026 3 commits
-
-
danisereb authored
Signed-off-by:Daniel Serebrenik <daserebrenik@nvidia.com>
-
Richard Zou authored
Signed-off-by:Richard Zou <zou3519@gmail.com>
-
Reagan Lee authored
Signed-off-by:
Reagan Lee <“reaganjlee@gmail.com”> Signed-off-by:
Reagan Lee <reaganjlee@gmail.com> Signed-off-by:
Reagan Lee <96998476+reaganjlee@users.noreply.github.com> Co-authored-by:
Reagan Lee <“reaganjlee@gmail.com”> Co-authored-by:
gemini-code-assist[bot] <176961590+gemini-code-assist[bot]@users.noreply.github.com>
-
- 07 Feb, 2026 3 commits
-
-
Mohammad Miadh Angkad authored
Signed-off-by:
Mohammad Miadh Angkad <176301910+mmangkad@users.noreply.github.com> Signed-off-by:
Luka Govedič <ProExpertProg@users.noreply.github.com> Co-authored-by:
Luka Govedič <ProExpertProg@users.noreply.github.com>
-
Wentao Ye authored
Signed-off-by:yewentao256 <zhyanwentao@126.com>
-
rasmith authored
Signed-off-by:Randall Smith <Randall.Smith@amd.com>
-
- 06 Feb, 2026 5 commits
-
-
Harry Mellor authored
Signed-off-by:Harry Mellor <19981378+hmellor@users.noreply.github.com>
-
SorenDreano authored
Co-authored-by:
Soren Dreano <soren@numind.ai> Co-authored-by:
Wentao Ye <44945378+yewentao256@users.noreply.github.com>
-
Luka Govedič authored
Signed-off-by:
Luka Govedič <lgovedic@redhat.com> Signed-off-by:
ProExpertProg <luka.govedic@gmail.com> Signed-off-by:
Luka Govedič <ProExpertProg@users.noreply.github.com>
-
chengchengpei authored
Signed-off-by:
Chengcheng Pei <chengchengpei@outlook.com> Signed-off-by:
chengchengpei <5881383+chengchengpei@users.noreply.github.com> Co-authored-by:
chengchengpei <5881383+chengchengpei@users.noreply.github.com> Co-authored-by:
gemini-code-assist[bot] <176961590+gemini-code-assist[bot]@users.noreply.github.com>
-
emricksini-h authored
-
- 05 Feb, 2026 4 commits
-
-
Benjamin Chislett authored
Signed-off-by:Benjamin Chislett <bchislett@nvidia.com>
-
Aaron Hao authored
Signed-off-by:
ahao-anyscale <ahao@anyscale.com> Signed-off-by:
Aaron Hao <ahao@anyscale.com> Co-authored-by:
SumanthRH <sumanthrh99@gmail.com>
-
Luka Govedič authored
Signed-off-by:
Luka Govedič <lgovedic@redhat.com> Signed-off-by:
ProExpertProg <luka.govedic@gmail.com> Signed-off-by:
Luka Govedič <ProExpertProg@users.noreply.github.com>
-
Ilya Boytsov authored
Signed-off-by:
Ilya Boytsov <ilyaboytsov1805@gmail.com> Signed-off-by:
Ilya Boytsov <boytsovpanamera@mail.ru> Co-authored-by:
Cyrus Leung <cyrus.tl.leung@gmail.com> Co-authored-by:
wang.yuqi <yuqi.wang@daocloud.io>
-
- 04 Feb, 2026 2 commits
-
-
Zhengxu Chen authored
Signed-off-by:zhxchen17 <zhxchen17@fb.com>
-
Wentao Ye authored
Signed-off-by:yewentao256 <zhyanwentao@126.com>
-
- 03 Feb, 2026 5 commits
-
-
Harry Mellor authored
Signed-off-by:Harry Mellor <19981378+hmellor@users.noreply.github.com>
-
杨朱 · Kiki authored
Signed-off-by:
carlory <baofa.fan@daocloud.io> Co-authored-by:
Claude Opus 4.5 <noreply@anthropic.com>
-
杨朱 · Kiki authored
Signed-off-by:
carlory <baofa.fan@daocloud.io> Co-authored-by:
Claude Opus 4.5 <noreply@anthropic.com>
-
Kunshang Ji authored
Signed-off-by:Kunshang Ji <kunshang.ji@intel.com>
-
Richard Zou authored
[torch.compile] Don't do the fast moe cold start optimization if there is speculative decoding (#33624) Signed-off-by:
Richard Zou <zou3519@gmail.com> Co-authored-by:
Robert Shaw <114415538+robertgshaw2-redhat@users.noreply.github.com>
-
- 02 Feb, 2026 2 commits
-
-
yugong333 authored
Reduce the kernel overhead when num of active loras is smaller than max loras. Multiple cuda graphs are captured for each num of active-loras. (#32005) Signed-off-by:Yu Gong <yu3.gong@gmail.com>
-
csy0225 authored
Signed-off-by:
Jee Jee Li <pandaleefree@gmail.com> Co-authored-by:
i-zhangmingming <i-zhangmingming@stepfun.com> Co-authored-by:
xiewuxun <xiewuxun@stepfun.com> Co-authored-by:
zetaohong <i-hongzetao@stepfun.com> Co-authored-by:
Jee Jee Li <pandaleefree@gmail.com>
-
- 31 Jan, 2026 4 commits
-
-
Cyrus Leung authored
Signed-off-by:DarkLight1337 <tlleungac@connect.ust.hk>
-
Luka Govedič authored
[fix][torch.compile] Fix cold-start compilation time increase by adding kv cache update to splitting ops (#33441) Signed-off-by:
Luka Govedič <lgovedic@redhat.com> Co-authored-by:
Richard Zou <zou3519@gmail.com>
-
Cyrus Leung authored
Signed-off-by:DarkLight1337 <tlleungac@connect.ust.hk>
-
AutumnAurelium authored
Signed-off-by:AutumnAurelium <88015631+AutumnAurelium@users.noreply.github.com>
-
- 30 Jan, 2026 6 commits
-
-
Harry Mellor authored
Signed-off-by:Harry Mellor <19981378+hmellor@users.noreply.github.com>
-
Frank Wang authored
Signed-off-by:
frankwang28 <frank.wbb@hotmail.com> Signed-off-by:
Frank Wang <41319051+frankwang28@users.noreply.github.com> Co-authored-by:
Wentao Ye <44945378+yewentao256@users.noreply.github.com>
-
Julien Denize authored
Signed-off-by:
Julien Denize <julien.denize@mistral.ai> Signed-off-by:
juliendenize <julien.denize@mistral.ai> Co-authored-by:
Michael Goin <mgoin64@gmail.com>
-
Harry Mellor authored
Signed-off-by:Harry Mellor <19981378+hmellor@users.noreply.github.com>
-
Harry Huang authored
Signed-off-by:huanghaoyan.hhy <huanghaoyan.hhy@alibaba-inc.com>
-
Harry Mellor authored
Signed-off-by:Harry Mellor <19981378+hmellor@users.noreply.github.com>
-