- 18 Oct, 2024 1 commit
-
-
Cody Yu authored
-
- 17 Oct, 2024 1 commit
-
-
Kuntai Du authored
Removing the block manager v1. This is the initial piece of prefix-caching-centric design. In order to achieve prefix-caching-centric design, we need to simplify the code path so that we only use v2 block manager (which has much higher performance on prefix caching).
-
- 11 Oct, 2024 1 commit
-
-
youkaichao authored
Co-authored-by:Brendan Wong <bjwpokemon@gmail.com>
-
- 23 Jul, 2024 1 commit
-
-
Thomas Parnell authored
Signed-off-by:Thomas Parnell <tpa@zurich.ibm.com>
-
- 17 Jul, 2024 1 commit
-
-
Antoni Baum authored
-
- 16 Jul, 2024 1 commit
-
-
Cody Yu authored
-
- 29 Jun, 2024 1 commit
-
-
William Lin authored
Co-authored-by:Antoni Baum <antoni.baum@protonmail.com>
-
- 08 Jun, 2024 1 commit
-
-
youkaichao authored
[CI/Test] improve robustness of test by replacing del with context manager (vllm_runner) (#5357)
-
- 13 May, 2024 1 commit
-
-
Cyrus Leung authored
Since #4335 was merged, I've noticed that the definition of ServerRunner in the tests is the same as in the test for OpenAI API. I have moved the class to the test utilities to avoid code duplication. (Although it only has been repeated twice so far, I will add another similar test suite in #4200 which would duplicate the code a third time) Also, I have moved the test utilities file (test_utils.py) to under the test directory (tests/utils.py), since none of its code is actually used in the main package. Note that I have added __init__.py to each test subpackage and updated the ray.init() call in the test utilities file in order to relative import tests/utils.py.
-
- 04 May, 2024 1 commit
-
-
DearPlanet authored
-
- 02 May, 2024 1 commit
-
-
Ronen Schaffer authored
-
- 11 Mar, 2024 1 commit
-
-
Zhuohan Li authored
-
- 25 Feb, 2024 1 commit
-
-
Harry Mellor authored
-
- 22 Feb, 2024 1 commit
-
-
Ronen Schaffer authored
-
- 19 Feb, 2024 1 commit
-
-
Ronen Schaffer authored
-