- 27 Jan, 2026 1 commit
-
-
Robert Shaw authored
Signed-off-by:
Robert Shaw <robshaw@redhat.com> Signed-off-by:
Amir Klein <203507526+amirkl94@users.noreply.github.com> Co-authored-by:
Robert Shaw <robshaw@redhat.com> Co-authored-by:
amirkl94 <203507526+amirkl94@users.noreply.github.com>
-
- 26 Jan, 2026 3 commits
-
-
Robert Shaw authored
Signed-off-by:
Robert Shaw <robshaw@redhat.com> Co-authored-by:
Robert Shaw <robshaw@redhat.com> (cherry picked from commit 43a013c3)
-
Robert Shaw authored
Signed-off-by:
Robert Shaw <robshaw@redhat.com> Co-authored-by:
Robert Shaw <robshaw@redhat.com>
-
Chauncey authored
Signed-off-by:chaunceyjiang <chaunceyjiang@gmail.com>
-
- 22 Jan, 2026 4 commits
-
-
Fadi Arafeh authored
[CPU Backend] [Perf] Accelerate tensor-parallel/data-parallel inference across NUMA domains on Arm (#32792) Signed-off-by:Fadi Arafeh <fadi.arafeh@arm.com>
-
liranschour authored
Signed-off-by:
Liran Schour <lirans@il.ibm.com> Signed-off-by:
liranschour <liranschour@users.noreply.github.com> Co-authored-by:
Or Ozeri <or@ozery.com>
-
Alex Sun authored
Signed-off-by:Alex Sun <alex.s@amd.com>
-
knlnguyen1802 authored
Signed-off-by:knlnguyen1802 <knlnguyen1802@gmail.com>
-
- 21 Jan, 2026 1 commit
-
-
Or Ozeri authored
Signed-off-by:Or Ozeri <oro@il.ibm.com>
-
- 20 Jan, 2026 2 commits
-
-
YiSheng5 authored
Signed-off-by:yisheng <yi.sheng@intel.com>
-
Walter Beller-Morales authored
Signed-off-by:walterbm <walter.beller.morales@gmail.com>
-
- 19 Jan, 2026 2 commits
-
-
qli88 authored
Signed-off-by:
Qiang Li <qiang.li2@amd.com> Signed-off-by:
Matthew Wong <Matthew.Wong2@amd.com>
-
Nicolò Lucchesi authored
Add a new metric to track the number of requests that had their KV blocks expire. The scenario is particularly important to surface and track as it is a vital indicator of the health of the deployment. Currently we're resorting to track these failures through unstructured log parsing (which is, among other thing, error string dependent); current main: > Releasing expired KV blocks for request cmpl-071d which were retrieved by 0 decode worker(s) within 0 seconds. Signed-off-by:NickLucche <nlucches@redhat.com>
-
- 18 Jan, 2026 2 commits
-
-
Deming authored
-
bnellnm authored
Signed-off-by:Bill Nell <bnell@redhat.com>
-
- 16 Jan, 2026 2 commits
-
-
Ilya Markov authored
Signed-off-by:
ilmarkov <markovilya197@gmail.com> Signed-off-by:
Tyler Michael Smith <tlrmchlsmth@gmail.com> Co-authored-by:
Tyler Michael Smith <tlrmchlsmth@gmail.com>
-
zhuwenwen authored
区分pcie和hglink custom allreduce的使用 vllm:export VLLM_CUSTOM_CACHE=1 dtk:export HIP_KERNEL_EVENT_SYSTENFENCE=1 set VLLM_USE_FUSED_RMS_ROPE=1 add SUPPORT_MOE_MARLIN_W16A16 to use moe marlin on bw support fa kvcache fp8 (todo: add VLLM_USE_QUERY_QUANT to not use q quant) update moe_align_block_size
-
- 15 Jan, 2026 1 commit
-
-
kzwrime authored
Signed-off-by:kunzh <zhikun.wu@outlook.com>
-
- 14 Jan, 2026 1 commit
-
-
Angela Yi authored
Signed-off-by:angelayi <yiangela7@gmail.com>
-
- 13 Jan, 2026 5 commits
-
-
Martin Hickey authored
Signed-off-by:
Martin Hickey <martin.hickey@ie.ibm.com> Co-authored-by:
gemini-code-assist[bot] <176961590+gemini-code-assist[bot]@users.noreply.github.com>
-
Sage Moore authored
Signed-off-by:Sage Moore <sage@neuralmagic.com>
-
Matthew Bonanni authored
Signed-off-by:
Matthew Bonanni <mbonanni@redhat.com> Co-authored-by:
Luka Govedič <ProExpertProg@users.noreply.github.com>
-
Mathis Felardos authored
Signed-off-by:Mathis Felardos <mathis@mistral.ai>
-
Martin Hickey authored
Signed-off-by:
Martin Hickey <martin.hickey@ie.ibm.com> Co-authored-by:
gemini-code-assist[bot] <176961590+gemini-code-assist[bot]@users.noreply.github.com>
-
- 12 Jan, 2026 6 commits
-
-
Nicolò Lucchesi authored
Signed-off-by:NickLucche <nlucches@redhat.com>
-
Lucas Kabela authored
Signed-off-by:Lucas Kabela <lucaskabela@meta.com>
-
Ilya Markov authored
Signed-off-by:ilmarkov <markovilya197@gmail.com>
-
Nicolò Lucchesi authored
Signed-off-by:NickLucche <nlucches@redhat.com>
-
Or Ozeri authored
Signed-off-by:Or Ozeri <oro@il.ibm.com>
-
dtc authored
Signed-off-by:
Tianchen Ding <dtcccc@linux.alibaba.com> Co-authored-by:
Nicolò Lucchesi <nicolo.lucchesi@gmail.com>
-
- 11 Jan, 2026 1 commit
-
-
Or Ozeri authored
Signed-off-by:Or Ozeri <oro@il.ibm.com>
-
- 10 Jan, 2026 1 commit
-
-
Or Ozeri authored
Signed-off-by:Or Ozeri <oro@il.ibm.com>
-
- 09 Jan, 2026 6 commits
-
-
Lucas Kabela authored
-
Chendi.Xue authored
-
Matthew Bonanni authored
Signed-off-by:Matthew Bonanni <mbonanni@redhat.com>
-
inkcherry authored
Signed-off-by:inkcherry <mingzhi.liu@amd.com>
-
Bofeng Xue authored
Signed-off-by:
Bofeng BF1 Xue <xuebf1@Lenovo.com> Co-authored-by:
Bofeng BF1 Xue <xuebf1@Lenovo.com>
-
zhuwenwen authored
-
- 08 Jan, 2026 2 commits
-
-
Lucas Wilkinson authored
[Misc] Fix `Current vLLM config is not set.` warnings, assert to avoid issues in the future (#31747) Signed-off-by:
Lucas Wilkinson <lwilkins@redhat.com> Signed-off-by:
Lucas Wilkinson <LucasWilkinson@users.noreply.github.com> Co-authored-by:
Luka Govedič <ProExpertProg@users.noreply.github.com>
-
zhuwenwen authored
-