- 04 Mar, 2025 10 commits
-
-
Mark McLoughlin authored
Signed-off-by:Mark McLoughlin <markmc@redhat.com>
-
Nick Hill authored
Signed-off-by:Nick Hill <nhill@redhat.com>
-
Michael Goin authored
Signed-off-by:mgoin <mgoin64@gmail.com>
-
youkaichao authored
Signed-off-by:youkaichao <youkaichao@gmail.com>
-
youkaichao authored
Signed-off-by:youkaichao <youkaichao@gmail.com>
-
Zhanwen Chen authored
-
Rui Qiao authored
Signed-off-by:Rui Qiao <ruisearch42@gmail.com>
-
Cody Yu authored
Signed-off-by:Cody Yu <hao.yu.cody@gmail.com>
-
Divakar Verma authored
Signed-off-by:Divakar Verma <divakar.verma@amd.com>
-
Travis Johnson authored
Signed-off-by:
Travis Johnson <tsjohnso@us.ibm.com> Co-authored-by:
Cyrus Leung <cyrus.tl.leung@gmail.com>
-
- 03 Mar, 2025 14 commits
-
-
iefgnoix authored
Signed-off-by:
Xiongfei Wei <isaacwxf23@gmail.com> Co-authored-by:
Michael Goin <mgoin64@gmail.com>
-
Michael Goin authored
Signed-off-by:mgoin <mgoin64@gmail.com>
-
Michael Goin authored
Signed-off-by:mgoin <mgoin64@gmail.com>
-
Mark McLoughlin authored
[WIP][[V1][Metrics] Implement max_num_generation_tokens, request_params_n, and request_params_max_tokens metrics (#14055) Signed-off-by:Mark McLoughlin <markmc@redhat.com>
-
Nick Hill authored
Signed-off-by:Nick Hill <nhill@redhat.com>
-
Mark McLoughlin authored
Signed-off-by:Mark McLoughlin <markmc@redhat.com>
-
Mark McLoughlin authored
Signed-off-by:Mark McLoughlin <markmc@redhat.com>
-
Harry Mellor authored
Signed-off-by:Harry Mellor <19981378+hmellor@users.noreply.github.com>
-
TJian authored
-
Mark McLoughlin authored
Signed-off-by:Mark McLoughlin <markmc@redhat.com>
-
Cody Yu authored
Signed-off-by:Cody Yu <hao.yu.cody@gmail.com>
-
Mengqing Cao authored
Signed-off-by:Mengqing Cao <cmq0113@163.com>
-
wang.yuqi authored
-
Harry Mellor authored
-
- 02 Mar, 2025 3 commits
-
-
Ce Gao authored
Signed-off-by:Ce Gao <cegao@tensorchord.ai>
-
Jun Duan authored
Signed-off-by:Jun Duan <jun.duan.phd@outlook.com>
-
Jee Jee Li authored
Signed-off-by:Jee Jee Li <pandaleefree@gmail.com>
-
- 01 Mar, 2025 8 commits
-
-
Chen Zhang authored
-
Chen Zhang authored
-
Sage Moore authored
Signed-off-by:Sage Moore <sage@neuralmagic.com>
-
Woosuk Kwon authored
Signed-off-by:Woosuk Kwon <woosuk.kwon@berkeley.edu>
-
Isotr0py authored
-
Li, Jiang authored
-
YajieWang authored
-
Jee Jee Li authored
Signed-off-by:Jee Jee Li <pandaleefree@gmail.com>
-
- 28 Feb, 2025 5 commits
-
-
Luka Govedič authored
[torch.compile] Fix RMSNorm + quant fusion in the non-cutlass-fp8 case, rename RedundantReshapesPass to NoopEliminationPass (#10902) Signed-off-by:luka <luka@neuralmagic.com>
-
Rui Qiao authored
Signed-off-by:Rui Qiao <ruisearch42@gmail.com>
-
Chen Zhang authored
Signed-off-by:
Chen Zhang <zhangch99@outlook.com> Co-authored-by:
Cody Yu <hao.yu.cody@gmail.com>
-
Chen Zhang authored
Signed-off-by:Chen Zhang <zhangch99@outlook.com>
-
iefgnoix authored
Signed-off-by:
Xiongfei Wei <isaacwxf23@gmail.com> Signed-off-by:
mgoin <mgoin64@gmail.com> Co-authored-by:
mgoin <mgoin64@gmail.com>
-