- 13 May, 2025 1 commit
-
-
zhuwenwen authored
support telechat2 and glm4 nn layout remove log of request_id
-
- 09 May, 2025 2 commits
- 28 Apr, 2025 3 commits
-
-
Harry Mellor authored
Signed-off-by:Harry Mellor <19981378+hmellor@users.noreply.github.com>
-
Cyrus Leung authored
Signed-off-by:DarkLight1337 <tlleungac@connect.ust.hk>
-
Lucas Wilkinson authored
Signed-off-by:
Lucas Wilkinson <lwilkinson@neuralmagic.com> Co-authored-by:
Jee Jee Li <pandaleefree@gmail.com> Co-authored-by:
Aaron Pham <contact@aarnphm.xyz>
-
- 27 Apr, 2025 2 commits
-
-
Cyrus Leung authored
Signed-off-by:DarkLight1337 <tlleungac@connect.ust.hk>
-
Flex Wang authored
[Misc] Change buckets of histogram_iteration_tokens to [1, 8, 16, 32, 64, 128, 256, 512, 1024, 2048, 4096, 8096] to represent number of tokens (#17033) Signed-off-by:sfc-gh-zhwang <flex.wang@snowflake.com>
-
- 26 Apr, 2025 2 commits
-
-
changjun.lee authored
[Bugfix] fix error due to an uninitialized tokenizer when using `skip_tokenizer_init` with `num_scheduler_steps` (#9276) Signed-off-by:changjun.lee <pord7457@gmail.com>
-
rasmith authored
Signed-off-by:Randall Smith <Randall.Smith@amd.com>
-
- 25 Apr, 2025 3 commits
-
-
Benjamin Chislett authored
Signed-off-by:
Bryan Lu <yuzhelu@amazon.com> Signed-off-by:
Benjamin Chislett <benjamin.chislett@centml.ai> Co-authored-by:
Bryan Lu <yuzhelu@amazon.com>
-
rasmith authored
[Quantization][FP8] Add support for FP8 models with input_scale for output projection and QK quantization (#15734) Signed-off-by:
Randall Smith <Randall.Smith@amd.com> Signed-off-by:
Luka Govedič <lgovedic@redhat.com> Co-authored-by:
Luka Govedič <lgovedic@redhat.com>
-
Harry Mellor authored
Signed-off-by:Harry Mellor <19981378+hmellor@users.noreply.github.com>
-
- 24 Apr, 2025 4 commits
-
-
Yinghai Lu authored
Signed-off-by:Yinghai Lu <yinghai@thinkingmachines.ai>
-
Harry Mellor authored
Signed-off-by:Harry Mellor <19981378+hmellor@users.noreply.github.com>
-
Harry Mellor authored
Signed-off-by:Harry Mellor <19981378+hmellor@users.noreply.github.com>
-
Woosuk Kwon authored
Signed-off-by:Woosuk Kwon <woosuk.kwon@berkeley.edu>
-
- 23 Apr, 2025 1 commit
-
-
Travis Johnson authored
Signed-off-by:Travis Johnson <tsjohnso@us.ibm.com>
-
- 22 Apr, 2025 5 commits
-
-
Harry Mellor authored
Signed-off-by:Harry Mellor <19981378+hmellor@users.noreply.github.com>
-
Cyrus Leung authored
Signed-off-by:DarkLight1337 <tlleungac@connect.ust.hk>
-
王敏 authored
-
wangxiyuan authored
Signed-off-by:wangxiyuan <wangxiyuan1007@gmail.com>
-
Michael Goin authored
Signed-off-by:
mgoin <mgoin64@gmail.com> Co-authored-by:
Aurick Qiao <qiao@aurick.net>
-
- 20 Apr, 2025 1 commit
-
-
Harry Mellor authored
Signed-off-by:Harry Mellor <19981378+hmellor@users.noreply.github.com>
-
- 18 Apr, 2025 3 commits
-
-
Harry Mellor authored
Signed-off-by:Harry Mellor <19981378+hmellor@users.noreply.github.com>
-
Harry Mellor authored
Signed-off-by:Harry Mellor <19981378+hmellor@users.noreply.github.com>
-
Cyrus Leung authored
Signed-off-by:DarkLight1337 <tlleungac@connect.ust.hk>
-
- 17 Apr, 2025 4 commits
-
-
Yihua Cheng authored
Signed-off-by:
ApostaC <yihua98@uchicago.edu> Signed-off-by:
rshaw@neuralmagic.com <robertgshaw2@gmail.com> Signed-off-by:
remi <remi@mistral.ai> Co-authored-by:
rshaw@neuralmagic.com <robertgshaw2@gmail.com> Co-authored-by:
Robert Shaw <114415538+robertgshaw2-redhat@users.noreply.github.com> Co-authored-by:
Rémi Delacourt <54138269+Flechman@users.noreply.github.com> Co-authored-by:
Tyler Michael Smith <tysmith@redhat.com>
-
Harry Mellor authored
Signed-off-by:Harry Mellor <19981378+hmellor@users.noreply.github.com>
-
David Heineman authored
Signed-off-by:David Heineman <david@davidheineman.com>
-
zhuwenwen authored
-
- 15 Apr, 2025 1 commit
-
-
Xihui Cang authored
Signed-off-by:Xihui Cang <xihuicang@gmail.com>
-
- 14 Apr, 2025 1 commit
-
-
Harry Mellor authored
Signed-off-by:Harry Mellor <19981378+hmellor@users.noreply.github.com>
-
- 12 Apr, 2025 1 commit
-
-
Cyrus Leung authored
Signed-off-by:DarkLight1337 <tlleungac@connect.ust.hk>
-
- 11 Apr, 2025 2 commits
-
-
Harry Mellor authored
Signed-off-by:Harry Mellor <19981378+hmellor@users.noreply.github.com>
-
Michael Goin authored
Signed-off-by:mgoin <mgoin64@gmail.com>
-
- 10 Apr, 2025 3 commits
-
-
Russell Bryant authored
Signed-off-by:Russell Bryant <rbryant@redhat.com>
-
Harry Mellor authored
Signed-off-by:Harry Mellor <19981378+hmellor@users.noreply.github.com>
-
Cyrus Leung authored
Signed-off-by:DarkLight1337 <tlleungac@connect.ust.hk>
-
- 09 Apr, 2025 1 commit
-
-
yihong authored
Signed-off-by:yihong0618 <zouzou0208@gmail.com>
-