"vllm/entrypoints/pooling/embed/serving.py" did not exist on "2d1b9baa8f57fc59912c7bcd07fd630fb9d72c9d"
[Bugfix] Add fully sharded layer for QKVParallelLinearWithLora (#5665)
Co-authored-by:
Antoni Baum <antoni.baum@protonmail.com>
Showing
Please register or sign in to comment