- 03 Mar, 2025 4 commits
-
-
Mark McLoughlin authored
[WIP][[V1][Metrics] Implement max_num_generation_tokens, request_params_n, and request_params_max_tokens metrics (#14055) Signed-off-by:Mark McLoughlin <markmc@redhat.com>
-
TJian authored
-
Cody Yu authored
Signed-off-by:Cody Yu <hao.yu.cody@gmail.com>
-
Harry Mellor authored
-
- 02 Mar, 2025 2 commits
-
-
Ce Gao authored
Signed-off-by:Ce Gao <cegao@tensorchord.ai>
-
Jee Jee Li authored
Signed-off-by:Jee Jee Li <pandaleefree@gmail.com>
-
- 01 Mar, 2025 1 commit
-
-
YajieWang authored
-
- 28 Feb, 2025 7 commits
-
-
Luka Govedič authored
[torch.compile] Fix RMSNorm + quant fusion in the non-cutlass-fp8 case, rename RedundantReshapesPass to NoopEliminationPass (#10902) Signed-off-by:luka <luka@neuralmagic.com>
-
Chen Zhang authored
Signed-off-by:
Chen Zhang <zhangch99@outlook.com> Co-authored-by:
Cody Yu <hao.yu.cody@gmail.com>
-
Chen Zhang authored
Signed-off-by:Chen Zhang <zhangch99@outlook.com>
-
Harry Mellor authored
-
Cyrus Leung authored
-
Harry Mellor authored
-
Travis Johnson authored
Signed-off-by:Travis Johnson <tsjohnso@us.ibm.com>
-
- 27 Feb, 2025 7 commits
-
-
Sage Moore authored
Signed-off-by:Sage Moore <sage@neuralmagic.com>
-
Cyrus Leung authored
Signed-off-by:DarkLight1337 <tlleungac@connect.ust.hk>
-
Isotr0py authored
-
Michael Goin authored
Signed-off-by:mgoin <mgoin64@gmail.com>
-
Mark McLoughlin authored
-
Rui Qiao authored
-
Lucas Wilkinson authored
Signed-off-by:
Lucas Wilkinson <lwilkinson@neuralmagic.com> Signed-off-by:
Lucas Wilkinson <lwilkins@redhat.com>
-
- 26 Feb, 2025 10 commits
-
-
Wallas Henrique authored
Signed-off-by:
Wallas Santos <wallashss@ibm.com> Signed-off-by:
Joe Runde <Joseph.Runde@ibm.com> Co-authored-by:
Joe Runde <Joseph.Runde@ibm.com>
-
Cyrus Leung authored
-
Joe Runde authored
Signed-off-by:Joe Runde <Joseph.Runde@ibm.com>
-
Florian Greinacher authored
-
Cyrus Leung authored
-
Roger Wang authored
-
Jee Jee Li authored
-
Harry Mellor authored
-
Lily Liu authored
-
Michael Goin authored
Signed-off-by:mgoin <mgoin64@gmail.com>
-
- 25 Feb, 2025 6 commits
-
-
Harry Mellor authored
Signed-off-by:Harry Mellor <19981378+hmellor@users.noreply.github.com>
-
Liangfu Chen authored
Signed-off-by:
Liangfu Chen <liangfc@amazon.com> Co-authored-by:
George Novack <gnovack@amazon.com> Co-authored-by:
Aoyu Zhang <aoyuzhan@amazon.com>
-
Jee Jee Li authored
-
Gregory Shtrasberg authored
-
Varun Sundar Rabindranath authored
-
Harry Mellor authored
-
- 24 Feb, 2025 3 commits
-
-
afeldman-nm authored
Signed-off-by:
Andrew Feldman <afeldman@neuralmagic.com> Co-authored-by:
Nick Hill <nhill@redhat.com>
-
Jongseok Park authored
-
Kevin H. Luu authored
Signed-off-by: <> Co-authored-by:EC2 Default User <ec2-user@ip-172-31-63-253.us-west-2.compute.internal>
-