"tests/entrypoints/pooling/basic/test_truncation.py" did not exist on "105d3d62efde2d750b985020bbcea888f94a139b"
- 08 Jul, 2024 1 commit
-
-
afeldman-nm authored
[Kernel] Correctly invoke prefill & decode kernels for cross-attention (towards eventual encoder/decoder model support) (#4888) Co-authored-by:Woosuk Kwon <woosuk.kwon@berkeley.edu>
-
- 04 Jun, 2024 1 commit
-
-
afeldman-nm authored
[Bugfix]: During testing, use pytest monkeypatch for safely overriding the env var that indicates the vLLM backend (#5210)
-