"ssh:/git@developer.sourcefind.cn:2222/OpenDAS/vllm_cscc.git" did not exist on "89f1f253109961aea9c09062e66ae28134fa4746"
- 11 Mar, 2026 2 commits
-
-
tunglinwood authored
Signed-off-by:
tunglinwood <tunglinwood@gmail.com> Signed-off-by:
tunglinwood <tomwu.tunglin@gmail.com> Signed-off-by:
tunglinwood <113751333+tunglinwood@users.noreply.github.com>
-
Ning Xie authored
Signed-off-by:Andy Xie <andy.xning@gmail.com>
-
- 10 Mar, 2026 14 commits
-
-
Nick Hill authored
Signed-off-by:Nick Hill <nickhill123@gmail.com>
-
Nick Hill authored
Signed-off-by:Nick Hill <nickhill123@gmail.com>
-
Harry Mellor authored
Signed-off-by:Harry Mellor <19981378+hmellor@users.noreply.github.com>
-
Hashem Hashemi authored
Signed-off-by:Hashem Hashemi <hashem.hashemi@amd.com>
-
Srinivasoo7 authored
feat(kv-offload): Strategy A — StoreReusedOffloadingManager gates CPU stores on reuse frequency (#35342) Signed-off-by:
srinivas_oo7 <Sriusa4414@gmail.com> Signed-off-by: Sriusa4414@gmail.com Signed-off-by:
Srinivasoo7 <158864704+Srinivasoo7@users.noreply.github.com> Co-authored-by:
srinivas_oo7 <sklinkedin0120@gmail.com> Co-authored-by:
Srinivasoo7 <158864704+Srinivasoo7@users.noreply.github.com> Co-authored-by:
Or Ozeri <oro@il.ibm.com>
-
Jiangyun Zhu authored
Signed-off-by:zjy0516 <riverclouds.zhu@qq.com>
-
wang.yuqi authored
Signed-off-by:wang.yuqi <yuqi.wang@daocloud.io>
-
Mark McLoughlin authored
Signed-off-by:Mark McLoughlin <markmc@redhat.com>
-
Harry Mellor authored
Signed-off-by:Harry Mellor <19981378+hmellor@users.noreply.github.com>
-
Chang Su authored
feat(grpc): extract gRPC servicer into smg-grpc-servicer package, add --grpc flag to vllm serve (#36169) Signed-off-by:
Chang Su <chang.s.su@oracle.com> Co-authored-by:
Nick Hill <nhill@redhat.com>
-
Harry Mellor authored
Signed-off-by:Harry Mellor <19981378+hmellor@users.noreply.github.com>
-
Wentao Ye authored
[Perf] Compute maxsim in worker side, reducing redundant copies, 2.7% E2E throughput improvement (#36159) Signed-off-by:yewentao256 <zhyanwentao@126.com>
-
Hojin Yang authored
Signed-off-by:
effortprogrammer <yhjhoward7@gmail.com> Co-authored-by:
Cyrus Leung <tlleungac@connect.ust.hk>
-
Andreas Karatzas authored
Signed-off-by:Andreas Karatzas <akaratza@amd.com>
-
- 09 Mar, 2026 16 commits
-
-
Shaun Kotek authored
Signed-off-by:
Shaun Kotek - Nvidia <skotek@nvidia.com> Co-authored-by:
root <root@gpu-259.slurm-workers-slurm.slurm.svc.cluster.local>
-
Micah Williamson authored
Signed-off-by:Micah Williamson <micah.williamson@amd.com>
-
Copilot authored
Signed-off-by:
Luka Govedič <ProExpertProg@users.noreply.github.com> Co-authored-by:
copilot-swe-agent[bot] <198982749+Copilot@users.noreply.github.com> Co-authored-by:
ProExpertProg <11367180+ProExpertProg@users.noreply.github.com> Co-authored-by:
Luka Govedič <ProExpertProg@users.noreply.github.com>
-
Andreas Karatzas authored
Signed-off-by:Andreas Karatzas <akaratza@amd.com>
-
Andreas Karatzas authored
[ROCm][CI] Fix ROCm attention backend validation for head sizes, block sizes, and compute capability checks (#36292) Signed-off-by:Andreas Karatzas <akaratza@amd.com>
-
Roberto L. Castro authored
[Attention][Perf][Kernel] Replace torch.cat with vectorized CUDA kernel MLA query concat - DeepSeek-V3.2 (#34917) Signed-off-by:
LopezCastroRoberto <rocastro@redhat.com> Signed-off-by:
Roberto L. Castro <38211239+LopezCastroRoberto@users.noreply.github.com>
-
Roberto L. Castro authored
Signed-off-by:
LopezCastroRoberto <rocastro@redhat.com> Co-authored-by:
Claude <noreply@anthropic.com>
-
Matthew Bonanni authored
Signed-off-by:Matthew Bonanni <mbonanni@redhat.com>
-
Isotr0py authored
Signed-off-by:Isotr0py <mozf@mail2.sysu.edu.cn>
-
Cyrus Leung authored
Signed-off-by:DarkLight1337 <tlleungac@connect.ust.hk>
-
Xin Yang authored
Signed-off-by:Xin Yang <xyangx@amazon.com>
-
liuzhenwei authored
Signed-off-by:zhenwei-intel <zhenwei.liu@intel.com>
-
Alex Brooks authored
Signed-off-by:Alex Brooks <albrooks@redhat.com>
-
wang.yuqi authored
Signed-off-by:
wang.yuqi <yuqi.wang@daocloud.io> Signed-off-by:
wang.yuqi <noooop@126.com>
-
wang.yuqi authored
Signed-off-by:
wang.yuqi <yuqi.wang@daocloud.io> Signed-off-by:
wang.yuqi <noooop@126.com> Co-authored-by:
gemini-code-assist[bot] <176961590+gemini-code-assist[bot]@users.noreply.github.com> Co-authored-by:
Harry Mellor <19981378+hmellor@users.noreply.github.com>
-
Jiangyun Zhu authored
Signed-off-by:zjy0516 <riverclouds.zhu@qq.com>
-
- 08 Mar, 2026 2 commits
-
-
danisereb authored
Signed-off-by:Daniel Serebrenik <daserebrenik@nvidia.com>
-
Andreas Karatzas authored
Signed-off-by:Andreas Karatzas <akaratza@amd.com>
-
- 07 Mar, 2026 6 commits
-
-
Wei Zhao authored
-
PatchyTIS authored
-
Micah Williamson authored
-
Micah Williamson authored
-
qli88 authored
-
rahul-sarvam authored
Signed-off-by:rahul-sarvam <140298821+rahul-sarvam@users.noreply.github.com>
-