- 20 May, 2025 3 commits
-
-
Michael Goin authored
Signed-off-by:mgoin <mgoin64@gmail.com>
-
Random Fly authored
Signed-off-by:rand-fly <randfly@outlook.com>
-
Isotr0py authored
Signed-off-by:Isotr0py <2037008807@qq.com>
-
- 19 May, 2025 7 commits
-
-
Satyajith Chilappagari authored
Signed-off-by:Satyajith Chilappagari <satchill@amazon.com>
-
sunyicode0012 authored
Add files via uploadAdd fused MoE kernel tuning configs (fp8_w8a8) for DeepSeek V3/R1 on a single-node 8x NVIDIA H20 96GB setup (#18337)
-
Wenhua Cheng authored
Signed-off-by:wenhuach21 <wenhua.cheng@intel.com>
-
Jee Jee Li authored
Signed-off-by:Jee Jee Li <pandaleefree@gmail.com>
-
Shaoyu Yang authored
Signed-off-by:
cascade812 <cascade812@outlook.com> Signed-off-by:
shaoyuyoung <shaoyuyoung@gmail.com> Co-authored-by:
cascade <cascade812@outlook.com>
-
CYJiang authored
-
wwl2755 authored
Signed-off-by:wwl2755 <wangwenlong2755@gmail.com>
-
- 18 May, 2025 2 commits
-
-
Lifu Huang authored
Signed-off-by:Lifu Huang <lifu.hlf@gmail.com>
-
22quinn authored
Signed-off-by:22quinn <33176974+22quinn@users.noreply.github.com>
-
- 17 May, 2025 1 commit
-
-
rongfu.leng authored
-
- 16 May, 2025 5 commits
-
-
Bowen Wang authored
Signed-off-by:
Bowen Wang <abmfy@icloud.com> Co-authored-by:
mgoin <mgoin64@gmail.com>
-
learner0810 authored
Signed-off-by:learner0810 <zhongjun.li@daocloud.io>
-
Lain authored
-
Vadim Gimpelson authored
Signed-off-by:Vadim Gimpelson <vadim.gimpelson@centml.ai>
-
Sky Lee authored
Signed-off-by:
lisiqi23 <lisiqi23@xiaomi.com> Signed-off-by:
skylee-01 <497627264@qq.com> Co-authored-by:
lisiqi23 <lisiqi23@xiaomi.com>
-
- 15 May, 2025 6 commits
-
-
TJian authored
[Bugfix] [ROCm]: Remove assertion logic when using AITER fused moe in unquantizedMethod to reenable LLama4 BF16 (#18205) Signed-off-by:tjtanaa <tunjian.tan@embeddedllm.com>
-
Harry Mellor authored
Signed-off-by:Harry Mellor <19981378+hmellor@users.noreply.github.com>
-
omahs authored
Signed-off-by:omahs <73983677+omahs@users.noreply.github.com>
-
Mengqing Cao authored
Signed-off-by:Mengqing Cao <cmq0113@163.com>
-
inkcherry authored
Signed-off-by:inkcherry <mingzhi.liu@intel.com>
-
Harry Mellor authored
Signed-off-by:Harry Mellor <19981378+hmellor@users.noreply.github.com>
-
- 14 May, 2025 14 commits
-
-
Jerry Zhang authored
Signed-off-by:Jerry Zhang <jerryzh168@gmail.com>
-
bnellnm authored
-
Ekagra Ranjan authored
[V1][Spec Decode] Share input embedding of target model with EAGLE draft model to free ~1GB for llama 3 model (#17326) Co-authored-by:
root <root@ekagra-8xh100.us-east5-a.c.serving-efficiency-poc.internal> Co-authored-by:
Woosuk Kwon <woosuk.kwon@berkeley.edu>
-
Cyrus Leung authored
Signed-off-by:DarkLight1337 <tlleungac@connect.ust.hk>
-
TJian authored
Signed-off-by:
tjtanaa <tunjian.tan@embeddedllm.com> Co-authored-by:
Gregory Shtrasberg <Gregory.Shtrasberg@amd.com>
-
Andrzej Kotłowski authored
Signed-off-by:Andrzej Kotłowski <akotlowski@habana.ai>
-
Jee Jee Li authored
Signed-off-by:Jee Jee Li <pandaleefree@gmail.com>
-
wang.yuqi authored
-
qli88 authored
Signed-off-by:Qiang Li <qiang.li2@amd.com>
-
vllmellm authored
Signed-off-by:
tjtanaa <tunjian.tan@embeddedllm.com> Co-authored-by:
tjtanaa <tunjian.tan@embeddedllm.com>
-
Michael Goin authored
Signed-off-by:mgoin <mgoin64@gmail.com>
-
youkaichao authored
Signed-off-by:youkaichao <youkaichao@gmail.com>
-
Pavani Majety authored
Signed-off-by:Pavani Majety <pmajety@nvidia.com>
-
vllmellm authored
Signed-off-by:
vllmellm <vllm.ellm@embeddedllm.com> Signed-off-by:
tjtanaa <tunjian.tan@embeddedllm.com> Co-authored-by:
tjtanaa <tunjian.tan@embeddedllm.com>
-
- 13 May, 2025 2 commits
-
-
Harry Mellor authored
Signed-off-by:Harry Mellor <19981378+hmellor@users.noreply.github.com>
-
Michael Goin authored
Signed-off-by:mgoin <mgoin64@gmail.com>
-