"llm/llama.cpp/.devops/llama-server-cuda.Dockerfile" did not exist on "ecd2f176277db4f074e25a2c3646b04b51cec119"
Unverified Commit 71221692 authored by Ying Sheng's avatar Ying Sheng Committed by GitHub
Browse files

[Feature] Initial support for multi-LoRA serving (#1307)

parent c33d82a2
......@@ -7,6 +7,7 @@ suites = {
"minimal": [
"models/test_embedding_models.py",
"models/test_generation_models.py",
"models/test_lora.py",
"sampling/penaltylib",
"test_chunked_prefill.py",
"test_embedding_openai_server.py",
......
Markdown is supported
0% or .
You are about to add 0 people to the discussion. Proceed with caution.
Finish editing this message first!
Please register or to comment