- 13 May, 2025 1 commit
-
-
zhuwenwen authored
support telechat2 and glm4 nn layout remove log of request_id
-
- 09 May, 2025 12 commits
- 08 May, 2025 4 commits
- 07 May, 2025 4 commits
- 06 May, 2025 2 commits
- 02 May, 2025 2 commits
-
-
Lucas Wilkinson authored
Signed-off-by:Lucas Wilkinson <lwilkinson@neuralmagic.com>
-
Robert Shaw authored
Signed-off-by:rshaw@neuralmagic.com <robertgshaw2@gmail.com>
-
- 30 Apr, 2025 3 commits
- 29 Apr, 2025 3 commits
- 28 Apr, 2025 9 commits
-
-
Michael Goin authored
Signed-off-by:mgoin <mgoin64@gmail.com>
-
Simon Mo authored
Signed-off-by:simon-mo <xmo@berkeley.edu>
-
Charlie Fu authored
Signed-off-by:charlifu <charlifu@amd.com>
-
Lucas Wilkinson authored
[BugFix] Fix cascade attention - RuntimeError: scheduler_metadata must have shape (metadata_size) (#17283) Signed-off-by:Lucas Wilkinson <lwilkinson@neuralmagic.com>
-
Russell Bryant authored
Signed-off-by:Russell Bryant <rbryant@redhat.com>
-
Harry Mellor authored
Signed-off-by:Harry Mellor <19981378+hmellor@users.noreply.github.com>
-
Harry Mellor authored
Signed-off-by:Harry Mellor <19981378+hmellor@users.noreply.github.com>
-
Harry Mellor authored
Signed-off-by:Harry Mellor <19981378+hmellor@users.noreply.github.com>
-
Cyrus Leung authored
Signed-off-by:DarkLight1337 <tlleungac@connect.ust.hk>
-