- 05 Dec, 2024 1 commit
-
-
zhuwenwen authored
-
- 27 Nov, 2024 1 commit
-
-
zhuwenwen authored
add VLLM_OPTEST_MODELS_PATH/OPTEST_MODELS_PATH to load models from local path instead of Hugging Face Hub
-
- 13 Sep, 2024 1 commit
-
-
Cyrus Leung authored
-
- 21 Aug, 2024 1 commit
-
-
Peter Salas authored
-
- 09 Aug, 2024 1 commit
-
-
Cyrus Leung authored
-
- 06 Aug, 2024 1 commit
-
-
afeldman-nm authored
[Core] Subclass ModelRunner to support cross-attention & encoder sequences (towards eventual encoder/decoder model support) (#4942) Co-authored-by:
Andrew Feldman <afeld2012@gmail.com> Co-authored-by:
Nick Hill <nickhill@us.ibm.com>
-