- 13 May, 2024 2 commits
-
-
Cyrus Leung authored
Since #4335 was merged, I've noticed that the definition of ServerRunner in the tests is the same as in the test for OpenAI API. I have moved the class to the test utilities to avoid code duplication. (Although it only has been repeated twice so far, I will add another similar test suite in #4200 which would duplicate the code a third time) Also, I have moved the test utilities file (test_utils.py) to under the test directory (tests/utils.py), since none of its code is actually used in the main package. Note that I have added __init__.py to each test subpackage and updated the ray.init() call in the test utilities file in order to relative import tests/utils.py.
-
youkaichao authored
-
- 08 May, 2024 1 commit
-
-
youkaichao authored
[Core][Distributed] support both cpu and device tensor in broadcast tensor dict (#4660)
-
- 10 Apr, 2024 1 commit
-
-
youkaichao authored
[WIP][Core][Refactor] move vllm/model_executor/parallel_utils into vllm/distributed and vllm/device_communicators (#3950)
-
- 29 Mar, 2024 2 commits
-
-
youkaichao authored
[Core][Test] move local_rank to the last arg with default value to keep api compatible (#3711)
-
SangBin Cho authored
Co-authored-by:Simon Mo <simon.mo@hey.com>
-
- 28 Mar, 2024 1 commit
-
-
Roy authored
-
- 27 Mar, 2024 1 commit
-
-
youkaichao authored
-
- 25 Mar, 2024 1 commit
-
-
SangBin Cho authored
-
- 27 Jan, 2024 1 commit
-
-
Hanzhi Zhou authored
-
- 19 Jan, 2024 1 commit
-
-
Zhuohan Li authored
-
- 14 Jan, 2024 1 commit
-
-
Simon Mo authored
-
- 27 Dec, 2023 1 commit
-
-
Zhuohan Li authored
-
- 17 Nov, 2023 1 commit
-
-
Zhuohan Li authored
-
- 02 Oct, 2023 1 commit
-
-
Zhuohan Li authored
-