- 01 Feb, 2025 1 commit
-
-
Lucas Wilkinson authored
This PR implements the Deepseek V3 support by performing matrix absorption the fp8 weights --------- Signed-off-by:
Lucas Wilkinson <lwilkinson@neuralmagic.com> Co-authored-by:
Woosuk Kwon <woosuk.kwon@berkeley.edu> Co-authored-by:
simon-mo <simon.mo@hey.com> Co-authored-by:
Michael Goin <mgoin64@gmail.com> Co-authored-by:
Zhuohan Li <zhuohan123@gmail.com> Co-authored-by:
Tyler Michael Smith <tysmith@redhat.com> Co-authored-by:
Alexander Matveev <59768536+alexm-neuralmagic@users.noreply.github.com>
-
- 31 Jan, 2025 2 commits
-
-
Lucas Wilkinson authored
Signed-off-by:
Lucas Wilkinson <lwilkinson@neuralmagic.com> Signed-off-by:
simon-mo <xmo@berkeley.edu> Co-authored-by:
Woosuk Kwon <woosuk.kwon@berkeley.edu> Co-authored-by:
simon-mo <simon.mo@hey.com> Co-authored-by:
Michael Goin <mgoin64@gmail.com> Co-authored-by:
Zhuohan Li <zhuohan123@gmail.com> Co-authored-by:
Tyler Michael Smith <tysmith@redhat.com> Co-authored-by:
Alexander Matveev <59768536+alexm-neuralmagic@users.noreply.github.com> Co-authored-by:
simon-mo <xmo@berkeley.edu>
-
Aleksandr Malyshev authored
Signed-off-by:
Aleksandr Malyshev <maleksan@amd.com> Co-authored-by:
Aleksandr Malyshev <maleksan@amd.com>
-
- 28 Jan, 2025 1 commit
-
-
fenghuizhang authored
Signed-off-by:Fenghui Zhang <fhzhang@google.com>
-
- 27 Jan, 2025 2 commits
-
-
Bowen Wang authored
Signed-off-by:
Bowen Wang <abmfy@icloud.com> Signed-off-by:
youkaichao <youkaichao@gmail.com> Co-authored-by:
youkaichao <youkaichao@gmail.com>
-
Isotr0py authored
Signed-off-by:Isotr0py <2037008807@qq.com>
-
- 24 Jan, 2025 1 commit
-
-
Lucas Wilkinson authored
Signed-off-by:Lucas Wilkinson <lwilkinson@neuralmagic.com>
-
- 23 Jan, 2025 2 commits
-
-
Gregory Shtrasberg authored
Signed-off-by:
Gregory Shtrasberg <Gregory.Shtrasberg@amd.com> Co-authored-by:
Micah Williamson <micah.williamson@amd.com>
-
Lucas Wilkinson authored
Signed-off-by:Lucas Wilkinson <lwilkinson@neuralmagic.com>
-
- 22 Jan, 2025 1 commit
-
-
youkaichao authored
Signed-off-by:youkaichao <youkaichao@gmail.com>
-
- 20 Jan, 2025 1 commit
-
-
wangxiyuan authored
Signed-off-by:wangxiyuan <wangxiyuan1007@gmail.com>
-
- 15 Jan, 2025 1 commit
-
-
Elfie Guo authored
-
- 14 Jan, 2025 2 commits
-
-
Chen Zhang authored
Signed-off-by:Chen Zhang <zhangch99@outlook.com>
-
wangxiyuan authored
Signed-off-by:wangxiyuan <wangxiyuan1007@gmail.com>
-
- 09 Jan, 2025 1 commit
-
-
Cyrus Leung authored
Signed-off-by:
DarkLight1337 <tlleungac@connect.ust.hk> Signed-off-by:
Maxime Fournioux <55544262+mfournioux@users.noreply.github.com> Co-authored-by:
Maxime Fournioux <55544262+mfournioux@users.noreply.github.com>
-
- 06 Jan, 2025 2 commits
-
-
Chen Zhang authored
Signed-off-by:Chen Zhang <zhangch99@outlook.com>
-
Cyrus Leung authored
Signed-off-by:DarkLight1337 <tlleungac@connect.ust.hk>
-
- 02 Jan, 2025 1 commit
-
-
zhuwenwen authored
-
- 23 Dec, 2024 1 commit
-
-
Rafael Vasquez authored
Signed-off-by:Rafael Vasquez <rafvasq21@gmail.com>
-
- 17 Dec, 2024 1 commit
-
-
Isotr0py authored
Signed-off-by:Isotr0py <2037008807@qq.com>
-
- 13 Dec, 2024 2 commits
-
-
Jani Monoses authored
-
zhuwenwen authored
-
- 11 Dec, 2024 3 commits
-
-
Cyrus Leung authored
Signed-off-by:DarkLight1337 <tlleungac@connect.ust.hk>
-
王敏 authored
-
Tyler Michael Smith authored
Signed-off-by:Tyler Michael Smith <tyler@neuralmagic.com>
-
- 09 Dec, 2024 1 commit
-
-
Konrad Zawora authored
Signed-off-by:Konrad Zawora <kzawora@habana.ai>
-
- 05 Dec, 2024 1 commit
-
-
Cyrus Leung authored
Signed-off-by:DarkLight1337 <tlleungac@connect.ust.hk>
-
- 02 Dec, 2024 2 commits
-
-
youkaichao authored
Signed-off-by:youkaichao <youkaichao@gmail.com>
-
Maximilien de Bayser authored
Signed-off-by:Max de Bayser <mbayser@br.ibm.com>
-
- 27 Nov, 2024 2 commits
-
-
王敏 authored
2.更新medusa readme 3.解决benchmark_moe报错问题
-
Kunshang Ji authored
Signed-off-by:Kunshang Ji <kunshang.ji@intel.com>
-
- 23 Nov, 2024 1 commit
-
-
Isotr0py authored
Signed-off-by:Isotr0py <2037008807@qq.com>
-
- 22 Nov, 2024 1 commit
-
-
youkaichao authored
Signed-off-by:youkaichao <youkaichao@gmail.com>
-
- 21 Nov, 2024 1 commit
-
-
Pavani Majety authored
Signed-off-by:Pavani Majety <pmajety@nvidia.com>
-
- 20 Nov, 2024 2 commits
-
-
Woosuk Kwon authored
Signed-off-by:Woosuk Kwon <woosuk.kwon@berkeley.edu>
-
Li, Jiang authored
Signed-off-by:jiang1.li <jiang1.li@intel.com>
-
- 18 Nov, 2024 1 commit
-
-
王敏 authored
-
- 11 Nov, 2024 1 commit
-
-
Isotr0py authored
Signed-off-by:Isotr0py <2037008807@qq.com>
-
- 07 Nov, 2024 2 commits
-
-
Nicolò Lucchesi authored
Signed-off-by:NickLucche <nlucches@redhat.com>
-
Yan Ma authored
Signed-off-by:
Kunshang Ji <kunshang.ji@intel.com> Signed-off-by:
yan ma <yan.ma@intel.com> Co-authored-by:
Kunshang Ji <kunshang.ji@intel.com>
-