"vscode:/vscode.git/clone" did not exist on "37aadf623786a0fb22a29051b8084168f18db1c9"
- 25 Feb, 2025 3 commits
-
-
wangxiyuan authored
Signed-off-by:wangxiyuan <wangxiyuan1007@gmail.com>
-
cjackal authored
Signed-off-by:cjackal <44624812+cjackal@users.noreply.github.com>
-
Harry Mellor authored
-
- 23 Feb, 2025 1 commit
-
-
Andy Lo authored
Signed-off-by:Andy Lo <andy@mistral.ai>
-
- 22 Feb, 2025 2 commits
-
-
Sage Moore authored
[V1][Kernel] Refactor the prefix_prefill kernel so that the caller no longer has to pass in the context lengths (#13095)
-
Gordon Wong authored
-
- 21 Feb, 2025 2 commits
-
-
Lucas Wilkinson authored
Signed-off-by:
Lucas Wilkinson <lwilkinson@neuralmagic.com> Signed-off-by:
Lucas Wilkinson <lwilkins@redhat.com> Co-authored-by:
Patrick Horn <patrick.horn@gmail.com> Co-authored-by:
simon-mo <xmo@berkeley.edu> Co-authored-by:
Tyler Michael Smith <tyler@neuralmagic.com>
-
Lingfan Yu authored
Signed-off-by:Lingfan Yu <lingfany@amazon.com>
-
- 20 Feb, 2025 1 commit
-
-
Gregory Shtrasberg authored
-
- 19 Feb, 2025 1 commit
-
-
Yu-Zhou authored
Signed-off-by:
yuzhou <yuzhou@habana.ai> Signed-off-by:
zhouyu5 <yu.zhou@intel.com> Co-authored-by:
Cody Yu <hao.yu.cody@gmail.com>
-
- 15 Feb, 2025 1 commit
-
-
Tyler Michael Smith authored
-
- 14 Feb, 2025 2 commits
-
-
Sage Moore authored
-
Michael Goin authored
-
- 12 Feb, 2025 2 commits
-
-
Lingfan Yu authored
[Neuron][Kernel] Support Longer Sequences in NKI-based Flash PagedAttention and Improve Efficiency (#12921) Signed-off-by:Lingfan Yu <lingfany@amazon.com>
-
Christian Pinto authored
-
- 08 Feb, 2025 1 commit
-
-
youkaichao authored
Signed-off-by:youkaichao <youkaichao@gmail.com>
-
- 06 Feb, 2025 4 commits
-
-
Yu Chin Fabian Lim authored
Signed-off-by:
Yu Chin Fabian Lim <flim@sg.ibm.com> Signed-off-by:
Tyler Michael Smith <tyler@neuralmagic.com> Co-authored-by:
Tyler Michael Smith <tyler@neuralmagic.com>
-
Isotr0py authored
-
Isotr0py authored
-
Lucas Wilkinson authored
Signed-off-by:Lucas Wilkinson <lwilkinson@neuralmagic.com>
-
- 05 Feb, 2025 6 commits
-
-
Sanju C Sudhakaran authored
-
Chen Zhang authored
Merged via CLI script
-
Harry Mellor authored
-
Isotr0py authored
-
Aleksandr Malyshev authored
Signed-off-by:
Aleksandr Malyshev <maleksan@amd.com> Co-authored-by:
Aleksandr Malyshev <maleksan@amd.com>
-
Lucas Wilkinson authored
Signed-off-by:
simon-mo <xmo@berkeley.edu> Signed-off-by:
Lucas Wilkinson <lcwilkins@redhat.com> Signed-off-by:
Lucas Wilkinson <lwilkins@redhat.com> Signed-off-by:
Lucas Wilkinson <lwilkinson@neuralmagic.com> Co-authored-by:
simon-mo <xmo@berkeley.edu>
-
- 04 Feb, 2025 1 commit
-
-
Hongxia Yang authored
Signed-off-by:
Hongxia Yang <hongxia.yang@amd.com> Co-authored-by:
Matthew Wong <Matthew.Wong2@amd.com>
-
- 03 Feb, 2025 1 commit
-
-
Russell Bryant authored
Signed-off-by:Russell Bryant <rbryant@redhat.com>
-
- 02 Feb, 2025 1 commit
-
-
Russell Bryant authored
- **Add SPDX license headers to python source files** - **Check for SPDX headers using pre-commit** commit 9d7ef44c3cfb72ca4c32e1c677d99259d10d4745 Author: Russell Bryant <rbryant@redhat.com> Date: Fri Jan 31 14:18:24 2025 -0500 Add SPDX license headers to python source files This commit adds SPDX license headers to python source files as recommended to the project by the Linux Foundation. These headers provide a concise way that is both human and machine readable for communicating license information for each source file. It helps avoid any ambiguity about the license of the code and can also be easily used by tools to help manage license compliance. The Linux Foundation runs license scans against the codebase to help ensure we are in compliance with the licenses of the code we use, including dependencies. Having these headers in place helps that tool do its job. More information can be found on ...
-
- 01 Feb, 2025 1 commit
-
-
Lucas Wilkinson authored
This PR implements the Deepseek V3 support by performing matrix absorption the fp8 weights --------- Signed-off-by:
Lucas Wilkinson <lwilkinson@neuralmagic.com> Co-authored-by:
Woosuk Kwon <woosuk.kwon@berkeley.edu> Co-authored-by:
simon-mo <simon.mo@hey.com> Co-authored-by:
Michael Goin <mgoin64@gmail.com> Co-authored-by:
Zhuohan Li <zhuohan123@gmail.com> Co-authored-by:
Tyler Michael Smith <tysmith@redhat.com> Co-authored-by:
Alexander Matveev <59768536+alexm-neuralmagic@users.noreply.github.com>
-
- 31 Jan, 2025 2 commits
-
-
Lucas Wilkinson authored
Signed-off-by:
Lucas Wilkinson <lwilkinson@neuralmagic.com> Signed-off-by:
simon-mo <xmo@berkeley.edu> Co-authored-by:
Woosuk Kwon <woosuk.kwon@berkeley.edu> Co-authored-by:
simon-mo <simon.mo@hey.com> Co-authored-by:
Michael Goin <mgoin64@gmail.com> Co-authored-by:
Zhuohan Li <zhuohan123@gmail.com> Co-authored-by:
Tyler Michael Smith <tysmith@redhat.com> Co-authored-by:
Alexander Matveev <59768536+alexm-neuralmagic@users.noreply.github.com> Co-authored-by:
simon-mo <xmo@berkeley.edu>
-
Aleksandr Malyshev authored
Signed-off-by:
Aleksandr Malyshev <maleksan@amd.com> Co-authored-by:
Aleksandr Malyshev <maleksan@amd.com>
-
- 28 Jan, 2025 4 commits
-
-
fenghuizhang authored
Signed-off-by:Fenghui Zhang <fhzhang@google.com>
-
Mengqing Cao authored
Signed-off-by:Mengqing Cao <cmq0113@163.com>
-
Liangfu Chen authored
Signed-off-by:
Liangfu Chen <liangfc@amazon.com> Co-authored-by:
Jiangfei Duan <jfduan@outlook.com>
-
Harry Mellor authored
Signed-off-by:Harry Mellor <19981378+hmellor@users.noreply.github.com>
-
- 27 Jan, 2025 2 commits
-
-
Bowen Wang authored
Signed-off-by:
Bowen Wang <abmfy@icloud.com> Signed-off-by:
youkaichao <youkaichao@gmail.com> Co-authored-by:
youkaichao <youkaichao@gmail.com>
-
Isotr0py authored
Signed-off-by:Isotr0py <2037008807@qq.com>
-
- 26 Jan, 2025 2 commits
-
-
Tyler Michael Smith authored
-
Roger Wang authored
-