- 07 Oct, 2024 1 commit
-
-
youkaichao authored
-
- 01 Oct, 2024 1 commit
-
-
Lily Liu authored
-
- 26 Sep, 2024 1 commit
-
-
Nick Hill authored
-
- 24 Sep, 2024 1 commit
-
-
youkaichao authored
Co-authored-by:Brendan Wong <bjwpokemon@gmail.com>
-
- 23 Sep, 2024 1 commit
-
-
jiqing-feng authored
-
- 22 Sep, 2024 1 commit
-
-
Lily Liu authored
-
- 21 Sep, 2024 2 commits
-
-
youkaichao authored
-
Cyrus Leung authored
-
- 02 Sep, 2024 1 commit
-
-
Lily Liu authored
-
- 28 Aug, 2024 1 commit
-
-
Cyrus Leung authored
-
- 19 Aug, 2024 2 commits
-
-
Peng Guanwen authored
Co-authored-by:Michael Goin <michael@neuralmagic.com>
-
SangBin Cho authored
-
- 16 Aug, 2024 1 commit
-
-
jon-chuang authored
-
- 09 Aug, 2024 1 commit
-
-
William Lin authored
-
- 06 Aug, 2024 1 commit
-
-
Lily Liu authored
-
- 30 Jul, 2024 1 commit
-
-
Nick Hill authored
-
- 29 Jul, 2024 1 commit
-
-
Peng Guanwen authored
-
- 19 Jul, 2024 1 commit
-
-
Thomas Parnell authored
Signed-off-by:
Thomas Parnell <tpa@zurich.ibm.com> Co-authored-by:
Nick Hill <nickhill@us.ibm.com>
-
- 15 Jul, 2024 1 commit
-
-
Thomas Parnell authored
Signed-off-by:Thomas Parnell <tpa@zurich.ibm.com>
-
- 01 Jul, 2024 1 commit
-
-
sroy745 authored
-
- 27 Jun, 2024 1 commit
-
-
Roger Wang authored
-
- 19 Jun, 2024 1 commit
-
-
Thomas Parnell authored
Signed-off-by:Thomas Parnell <tpa@zurich.ibm.com>
-
- 18 Jun, 2024 1 commit
-
-
sroy745 authored
[Speculative Decoding 1/2 ] Add typical acceptance sampling as one of the sampling techniques in the verifier (#5131)
-
- 15 Jun, 2024 1 commit
-
-
Cyrus Leung authored
-
- 08 Jun, 2024 2 commits
-
-
youkaichao authored
[CI/Test] improve robustness of test by replacing del with context manager (vllm_runner) (#5357)
-
youkaichao authored
[CI/Test] improve robustness of test by replacing del with context manager (hf_runner) (#5347)
-
- 05 Jun, 2024 1 commit
-
-
zifeitong authored
-
- 02 Jun, 2024 1 commit
-
-
Simon Mo authored
-
- 28 May, 2024 1 commit
-
-
Cyrus Leung authored
Co-authored-by:Roger Wang <ywang@roblox.com>
-
- 13 May, 2024 1 commit
-
-
Cyrus Leung authored
Since #4335 was merged, I've noticed that the definition of ServerRunner in the tests is the same as in the test for OpenAI API. I have moved the class to the test utilities to avoid code duplication. (Although it only has been repeated twice so far, I will add another similar test suite in #4200 which would duplicate the code a third time) Also, I have moved the test utilities file (test_utils.py) to under the test directory (tests/utils.py), since none of its code is actually used in the main package. Note that I have added __init__.py to each test subpackage and updated the ray.init() call in the test utilities file in order to relative import tests/utils.py.
-
- 11 May, 2024 1 commit
-
-
Chang Su authored
-
- 09 May, 2024 1 commit
-
-
Woosuk Kwon authored
-
- 08 May, 2024 1 commit
-
-
Cody Yu authored
Co-authored-by:Cade Daniel <edacih@gmail.com>
-
- 03 May, 2024 1 commit
-
-
SangBin Cho authored
-
- 01 May, 2024 1 commit
-
-
SangBin Cho authored
-
- 27 Apr, 2024 1 commit
-
-
Nick Hill authored
Co-authored-by:DefTruth <31974251+deftruth@users.noreply.github.com>
-
- 26 Apr, 2024 1 commit
-
-
SangBin Cho authored
-
- 23 Apr, 2024 1 commit
-
-
Cade Daniel authored
-
- 16 Apr, 2024 1 commit
-
-
Antoni Baum authored
-
- 11 Apr, 2024 1 commit
-
-
Nick Hill authored
-