- 03 Jun, 2025 1 commit
-
-
Tyler Michael Smith authored
Signed-off-by:
Tyler Michael Smith <tysmith@redhat.com> Signed-off-by:
Tyler Michael Smith <tyler@neuralmagic.com>
-
- 31 May, 2025 2 commits
-
-
Charlie Fu authored
Signed-off-by:charlifu <charlifu@amd.com>
-
vllmellm authored
Signed-off-by:vllmellm <vllm.ellm@embeddedllm.com>
-
- 30 May, 2025 4 commits
-
-
rongfu.leng authored
Signed-off-by:rongfu.leng <rongfu.leng@daocloud.io>
-
Michael Goin authored
Signed-off-by:mgoin <mgoin64@gmail.com>
-
Wenhua Cheng authored
Signed-off-by:wenhuach21 <wenhua.cheng@intel.com>
-
Cyrus Leung authored
Signed-off-by:DarkLight1337 <tlleungac@connect.ust.hk>
-
- 29 May, 2025 3 commits
-
-
Chengji Yao authored
Signed-off-by:Chengji Yao <chengjiyao@google.com>
-
Chengji Yao authored
Signed-off-by:Chengji Yao <chengjiyao@google.com>
-
Maximilien de Bayser authored
Signed-off-by:
Max de Bayser <mbayser@br.ibm.com> Co-authored-by:
Cyrus Leung <cyrus.tl.leung@gmail.com>
-
- 28 May, 2025 1 commit
-
-
Varun Sundar Rabindranath authored
Signed-off-by:
Varun Sundar Rabindranath <varun@neuralmagic.com> Co-authored-by:
Varun Sundar Rabindranath <varun@neuralmagic.com>
-
- 27 May, 2025 5 commits
-
-
Satyajith Chilappagari authored
Signed-off-by:Satyajith Chilappagari <satchill@amazon.com>
-
Hyogeun Oh (오효근) authored
[Doc] Convert Sphinx directives ( `{class}`, `{meth}`, `{attr}`, ...) to MkDocs format for better documentation linking (#18663) Signed-off-by:Zerohertz <ohg3417@gmail.com>
-
almersawi authored
Signed-off-by:
Islam Almersawi <islam.almersawi@openinnovation.ai> Co-authored-by:
Islam Almersawi <islam.almersawi@openinnovation.ai>
-
vllmellm authored
Signed-off-by:vllmellm <vllm.ellm@embeddedllm.com>
-
Isotr0py authored
Signed-off-by:
Isotr0py <mozf@mail2.sysu.edu.cn> Signed-off-by:
Isotr0py <2037008807@qq.com>
-
- 25 May, 2025 1 commit
-
-
Cyrus Leung authored
Signed-off-by:DarkLight1337 <tlleungac@connect.ust.hk>
-
- 24 May, 2025 2 commits
-
-
wangxiyuan authored
Signed-off-by:wangxiyuan <wangxiyuan1007@gmail.com>
-
Wenhua Cheng authored
Signed-off-by:wenhuach21 <wenhua.cheng@intel.com>
-
- 23 May, 2025 5 commits
-
-
Feng XiaoLong authored
Signed-off-by:
Crucifixion-Fxl <xmufxl@gmail.com> Co-authored-by:
Crucifixion-Fxl <xmufxl@gmail.com>
-
Pavani Majety authored
[ModelOpt] Introduce VLLM_MAX_TOKENS_PER_EXPERT_FP4_MOE env var to control blockscale tensor allocation (#18160) Signed-off-by:Pavani Majety <pmajety@nvidia.com>
-
youkaichao authored
Signed-off-by:youkaichao <youkaichao@gmail.com>
-
Harry Mellor authored
Signed-off-by:Harry Mellor <19981378+hmellor@users.noreply.github.com>
-
Kay Yan authored
[Hardware][CPU] Update intel_extension_for_pytorch 2.7.0 and move to `requirements/cpu.txt` (#18542) Signed-off-by:Kay Yan <kay.yan@daocloud.io>
-
- 22 May, 2025 2 commits
-
-
Tyler Michael Smith authored
Signed-off-by:
Tyler Michael Smith <tyler@neuralmagic.com> Signed-off-by:
Lucas Wilkinson <lwilkinson@neuralmagic.com> Signed-off-by:
Tyler Michael Smith <tysmith@redhat.com> Co-authored-by:
Lucas Wilkinson <lwilkinson@neuralmagic.com>
-
Dhia Eddine Rhaiem authored
Signed-off-by:
dhia.rhaiem <dhia.rhaiem@tii.ae> Co-authored-by:
younesbelkada <younesbelkada@gmail.com> Co-authored-by:
Ilyas Chahed <ilyas.chahed@tii.ae> Co-authored-by:
Jingwei Zuo <jingwei.zuo@tii.ae>
-
- 21 May, 2025 3 commits
-
-
GiantCroc authored
Signed-off-by:giantcroc <1204449533@qq.com>
-
Dhia Eddine Rhaiem authored
Signed-off-by:
dhia.rhaiem <dhia.rhaiem@tii.ae> Co-authored-by:
younesbelkada <younesbelkada@gmail.com> Co-authored-by:
Ilyas Chahed <ilyas.chahed@tii.ae> Co-authored-by:
Jingwei Zuo <jingwei.zuo@tii.ae>
-
Michael Goin authored
Signed-off-by:Michael Goin <mgoin64@gmail.com>
-
- 20 May, 2025 2 commits
-
-
Michael Goin authored
Signed-off-by:mgoin <mgoin64@gmail.com>
-
Random Fly authored
Signed-off-by:rand-fly <randfly@outlook.com>
-
- 19 May, 2025 2 commits
-
-
sunyicode0012 authored
Add files via uploadAdd fused MoE kernel tuning configs (fp8_w8a8) for DeepSeek V3/R1 on a single-node 8x NVIDIA H20 96GB setup (#18337)
-
Wenhua Cheng authored
Signed-off-by:wenhuach21 <wenhua.cheng@intel.com>
-
- 18 May, 2025 1 commit
-
-
22quinn authored
Signed-off-by:22quinn <33176974+22quinn@users.noreply.github.com>
-
- 16 May, 2025 2 commits
-
-
Bowen Wang authored
Signed-off-by:
Bowen Wang <abmfy@icloud.com> Co-authored-by:
mgoin <mgoin64@gmail.com>
-
Lain authored
-
- 15 May, 2025 3 commits
-
-
TJian authored
[Bugfix] [ROCm]: Remove assertion logic when using AITER fused moe in unquantizedMethod to reenable LLama4 BF16 (#18205) Signed-off-by:tjtanaa <tunjian.tan@embeddedllm.com>
-
omahs authored
Signed-off-by:omahs <73983677+omahs@users.noreply.github.com>
-
Mengqing Cao authored
Signed-off-by:Mengqing Cao <cmq0113@163.com>
-
- 14 May, 2025 1 commit
-
-
Jerry Zhang authored
Signed-off-by:Jerry Zhang <jerryzh168@gmail.com>
-