- 05 Mar, 2025 3 commits
-
-
Tyler Michael Smith authored
-
Cody Yu authored
-
Michael Goin authored
Signed-off-by:Michael Goin <mgoin64@gmail.com>
-
- 04 Mar, 2025 2 commits
-
-
Siyuan Liu authored
Signed-off-by:
Siyuan Liu <lsiyuan@google.com> Co-authored-by:
mgoin <mgoin64@gmail.com>
-
Nick Hill authored
Signed-off-by:Nick Hill <nhill@redhat.com>
-
- 03 Mar, 2025 5 commits
-
-
iefgnoix authored
Signed-off-by:
Xiongfei Wei <isaacwxf23@gmail.com> Co-authored-by:
Michael Goin <mgoin64@gmail.com>
-
Mark McLoughlin authored
[WIP][[V1][Metrics] Implement max_num_generation_tokens, request_params_n, and request_params_max_tokens metrics (#14055) Signed-off-by:Mark McLoughlin <markmc@redhat.com>
-
Nick Hill authored
Signed-off-by:Nick Hill <nhill@redhat.com>
-
Mark McLoughlin authored
Signed-off-by:Mark McLoughlin <markmc@redhat.com>
-
Harry Mellor authored
-
- 02 Mar, 2025 1 commit
-
-
Jun Duan authored
Signed-off-by:Jun Duan <jun.duan.phd@outlook.com>
-
- 01 Mar, 2025 4 commits
-
-
Chen Zhang authored
-
Chen Zhang authored
-
Sage Moore authored
Signed-off-by:Sage Moore <sage@neuralmagic.com>
-
Li, Jiang authored
-
- 28 Feb, 2025 3 commits
-
-
Chen Zhang authored
Signed-off-by:
Chen Zhang <zhangch99@outlook.com> Co-authored-by:
Cody Yu <hao.yu.cody@gmail.com>
-
Chen Zhang authored
Signed-off-by:Chen Zhang <zhangch99@outlook.com>
-
iefgnoix authored
Signed-off-by:
Xiongfei Wei <isaacwxf23@gmail.com> Signed-off-by:
mgoin <mgoin64@gmail.com> Co-authored-by:
mgoin <mgoin64@gmail.com>
-
- 27 Feb, 2025 4 commits
-
-
Lucas Wilkinson authored
Signed-off-by:
Yang Chen <yangche@fb.com> Signed-off-by:
Lucas Wilkinson <lwilkinson@neuralmagic.com> Signed-off-by:
Lucas Wilkinson <lwilkins@redhat.com> Co-authored-by:
Yang Chen <yangche@fb.com>
-
Woosuk Kwon authored
Signed-off-by:Woosuk Kwon <woosuk.kwon@berkeley.edu>
-
Yang Chen authored
Signed-off-by:Yang Chen <yangche@fb.com>
-
Mark McLoughlin authored
-
- 26 Feb, 2025 1 commit
-
-
Lily Liu authored
-
- 25 Feb, 2025 4 commits
-
-
Varun Sundar Rabindranath authored
-
Mark McLoughlin authored
-
cjackal authored
Signed-off-by:cjackal <44624812+cjackal@users.noreply.github.com>
-
Harry Mellor authored
-
- 24 Feb, 2025 3 commits
-
-
Roger Wang authored
-
afeldman-nm authored
Signed-off-by:
Andrew Feldman <afeldman@neuralmagic.com> Co-authored-by:
Nick Hill <nhill@redhat.com>
-
Roger Wang authored
-
- 23 Feb, 2025 2 commits
-
-
Nick Hill authored
Even though ZMQ context.destroy() is meant to close open sockets before terminating the context, it appears to be necessary to do this explicitly or else it can hang in the context.term() method. Close zmq sockets explicitly before terminating context, make shutdown of client resource more robust, shut down engine core process prior to terminating zmq context. Signed-off-by:Nick Hill <nhill@redhat.com>
-
youkaichao authored
Signed-off-by:youkaichao <youkaichao@gmail.com>
-
- 22 Feb, 2025 8 commits
-
-
Sage Moore authored
[V1][Kernel] Refactor the prefix_prefill kernel so that the caller no longer has to pass in the context lengths (#13095)
-
youkaichao authored
Signed-off-by:youkaichao <youkaichao@gmail.com>
-
youkaichao authored
Signed-off-by:youkaichao <youkaichao@gmail.com>
-
Cyrus Leung authored
-
Mark McLoughlin authored
-
Mark McLoughlin authored
-
Jennifer Zhao authored
Signed-off-by:
Jennifer Zhao <7443418+JenZhao@users.noreply.github.com> Co-authored-by:
Roger Wang <ywang@roblox.com>
-
Lu Fang authored
Signed-off-by:Lu Fang <lufang@fb.com>
-