- 22 Jan, 2025 16 commits
-
-
Cody Yu authored
-
Konrad Zawora authored
Signed-off-by:Konrad Zawora <kzawora@habana.ai>
-
Jee Jee Li authored
Signed-off-by:Jee Jee Li <pandaleefree@gmail.com>
-
Robin authored
Signed-off-by:wangerxiao <863579016@qq.com>
-
Cyrus Leung authored
Signed-off-by:DarkLight1337 <tlleungac@connect.ust.hk>
-
Roger Wang authored
Signed-off-by:Roger Wang <ywang@roblox.com>
-
zhou fan authored
Signed-off-by:xffxff <1247714429@qq.com>
-
Cyrus Leung authored
Signed-off-by:DarkLight1337 <tlleungac@connect.ust.hk>
-
youkaichao authored
Signed-off-by:youkaichao <youkaichao@gmail.com>
-
Mengqing Cao authored
Signed-off-by:Mengqing Cao <cmq0113@163.com>
-
youkaichao authored
Signed-off-by:youkaichao <youkaichao@gmail.com>
-
Nick Hill authored
Signed-off-by:Nick Hill <nhill@redhat.com>
-
Cyrus Leung authored
Signed-off-by:DarkLight1337 <tlleungac@connect.ust.hk>
-
Liangfu Chen authored
Signed-off-by:Liangfu Chen <liangfc@amazon.com>
-
Kevin H. Luu authored
Signed-off-by:kevin <kevin@anyscale.com>
-
Cyrus Leung authored
Signed-off-by:DarkLight1337 <tlleungac@connect.ust.hk>
-
- 21 Jan, 2025 24 commits
-
-
Hongxia Yang authored
[Documentation][AMD] Add information about prebuilt ROCm vLLM docker for perf validation purpose (#12281) Signed-off-by:Hongxia Yang <hongxyan@amd.com>
-
Aleksandr Malyshev authored
Signed-off-by:
maleksan85 <maleksan@amd.com> Co-authored-by:
maleksan85 <maleksan@amd.com>
-
Divakar Verma authored
Signed-off-by:Divakar Verma <divakar.verma@amd.com>
-
Jani Monoses authored
-
wangxiyuan authored
Signed-off-by:wangxiyuan <wangxiyuan1007@gmail.com>
-
Adrian Cole authored
Signed-off-by:Adrian Cole <adrian.cole@elastic.co>
-
Andy Lo authored
Signed-off-by:Andy Lo <andy@mistral.ai>
-
Ricky Xu authored
Signed-off-by:rickyx <rickyx@anyscale.com>
-
Jinzhen Lin authored
Signed-off-by:Jinzhen Lin <linjinzhen@hotmail.com>
-
Jannis Schönleber authored
Signed-off-by:Jannis Schönleber <joennlae@gmail.com>
-
youkaichao authored
Signed-off-by:youkaichao <youkaichao@gmail.com>
-
Mengqing Cao authored
Signed-off-by:Mengqing Cao <cmq0113@163.com>
-
Thomas Parnell authored
Signed-off-by:Thomas Parnell <tpa@zurich.ibm.com>
-
Roger Wang authored
Signed-off-by:Roger Wang <ywang@roblox.com>
-
youkaichao authored
Signed-off-by:youkaichao <youkaichao@gmail.com>
-
Cyrus Leung authored
Signed-off-by:DarkLight1337 <tlleungac@connect.ust.hk>
-
Cyrus Leung authored
Signed-off-by:DarkLight1337 <tlleungac@connect.ust.hk>
-
Jee Jee Li authored
Signed-off-by:Jee Jee Li <pandaleefree@gmail.com>
-
Cyrus Leung authored
Signed-off-by:DarkLight1337 <tlleungac@connect.ust.hk>
-
youkaichao authored
Signed-off-by:youkaichao <youkaichao@gmail.com>
-
Nicolò Lucchesi authored
Signed-off-by:NickLucche <nlucches@redhat.com>
-
Gregory Shtrasberg authored
Signed-off-by:Gregory Shtrasberg <Gregory.Shtrasberg@amd.com>
-
Michael Goin authored
Signed-off-by:mgoin <michael@neuralmagic.com>
-
Jinzhen Lin authored
[Kernel] optimize moe_align_block_size for cuda graph and large num_experts (e.g. DeepSeek-V3) (#12222) Signed-off-by:
Jinzhen Lin <linjinzhen@hotmail.com> Co-authored-by:
Michael Goin <mgoin@redhat.com> Co-authored-by:
Tyler Michael Smith <tyler@neuralmagic.com>
-