- 25 Feb, 2025 1 commit
-
-
Robert Shaw authored
-
- 24 Feb, 2025 10 commits
-
-
Robert Shaw authored
Signed-off-by:rshaw@neuralmagic.com <rshaw@neuralmagic.com>
-
Michael Goin authored
Signed-off-by:mgoin <mgoin64@gmail.com>
-
Roger Wang authored
-
afeldman-nm authored
Signed-off-by:
Andrew Feldman <afeldman@neuralmagic.com> Co-authored-by:
Nick Hill <nhill@redhat.com>
-
Nicolò Lucchesi authored
[Misc][Docs] Raise error when flashinfer is not installed and `VLLM_ATTENTION_BACKEND` is set (#12513) Signed-off-by:NickLucche <nlucches@redhat.com>
-
Zhonghua Deng authored
-
Jongseok Park authored
-
Roger Meier authored
-
Mengqing Cao authored
-
Roger Wang authored
-
- 23 Feb, 2025 7 commits
-
-
Nick Hill authored
Even though ZMQ context.destroy() is meant to close open sockets before terminating the context, it appears to be necessary to do this explicitly or else it can hang in the context.term() method. Close zmq sockets explicitly before terminating context, make shutdown of client resource more robust, shut down engine core process prior to terminating zmq context. Signed-off-by:Nick Hill <nhill@redhat.com>
-
youkaichao authored
Signed-off-by:youkaichao <youkaichao@gmail.com>
-
Nick Hill authored
-
Isotr0py authored
-
Kyle Sayers authored
-
Kevin H. Luu authored
-
Andy Lo authored
Signed-off-by:Andy Lo <andy@mistral.ai>
-
- 22 Feb, 2025 19 commits
-
-
Helena Kloosterman authored
-
Gregory Shtrasberg authored
-
Sage Moore authored
[V1][Kernel] Refactor the prefix_prefill kernel so that the caller no longer has to pass in the context lengths (#13095)
-
Kaixi Hou authored
-
Keyun Tong authored
-
youkaichao authored
Signed-off-by:youkaichao <youkaichao@gmail.com>
-
youkaichao authored
Signed-off-by:youkaichao <youkaichao@gmail.com>
-
Cyrus Leung authored
-
Jee Jee Li authored
-
Mark McLoughlin authored
-
Mark McLoughlin authored
-
Yu Chin Fabian Lim authored
-
Jennifer Zhao authored
Signed-off-by:
Jennifer Zhao <7443418+JenZhao@users.noreply.github.com> Co-authored-by:
Roger Wang <ywang@roblox.com>
-
Lu Fang authored
Signed-off-by:Lu Fang <lufang@fb.com>
-
Shane A authored
-
Gordon Wong authored
-
Jun Duan authored
-
Robin authored
-
Keyun Tong authored
-
- 21 Feb, 2025 3 commits
-
-
Lucas Wilkinson authored
Signed-off-by:
Lucas Wilkinson <lwilkinson@neuralmagic.com> Signed-off-by:
Lucas Wilkinson <lwilkins@redhat.com> Co-authored-by:
Patrick Horn <patrick.horn@gmail.com> Co-authored-by:
simon-mo <xmo@berkeley.edu> Co-authored-by:
Tyler Michael Smith <tyler@neuralmagic.com>
-
Isotr0py authored
-
Gabriel Marinho authored
-