"tests/vscode:/vscode.git/clone" did not exist on "60508ffda91c22e4cde3b18f149d222211db8886"
Unverified Commit 07b8fae2 authored by Kyle Yu's avatar Kyle Yu Committed by GitHub
Browse files

[Doc] correct LoRA capitalization (#20135)


Signed-off-by: default avatarkyolebu <kyu@redhat.com>
parent 56230881
...@@ -40,7 +40,7 @@ vLLM is flexible and easy to use with: ...@@ -40,7 +40,7 @@ vLLM is flexible and easy to use with:
- OpenAI-compatible API server - OpenAI-compatible API server
- Support NVIDIA GPUs, AMD CPUs and GPUs, Intel CPUs, Gaudi® accelerators and GPUs, IBM Power CPUs, TPU, and AWS Trainium and Inferentia Accelerators. - Support NVIDIA GPUs, AMD CPUs and GPUs, Intel CPUs, Gaudi® accelerators and GPUs, IBM Power CPUs, TPU, and AWS Trainium and Inferentia Accelerators.
- Prefix caching support - Prefix caching support
- Multi-lora support - Multi-LoRA support
For more information, check out the following: For more information, check out the following:
......
...@@ -427,7 +427,7 @@ Specified using `--task embed`. ...@@ -427,7 +427,7 @@ Specified using `--task embed`.
See [relevant issue on HF Transformers](https://github.com/huggingface/transformers/issues/34882). See [relevant issue on HF Transformers](https://github.com/huggingface/transformers/issues/34882).
!!! note !!! note
`jinaai/jina-embeddings-v3` supports multiple tasks through lora, while vllm temporarily only supports text-matching tasks by merging lora weights. `jinaai/jina-embeddings-v3` supports multiple tasks through LoRA, while vllm temporarily only supports text-matching tasks by merging LoRA weights.
!!! note !!! note
The second-generation GTE model (mGTE-TRM) is named `NewModel`. The name `NewModel` is too generic, you should set `--hf-overrides '{"architectures": ["GteNewModel"]}'` to specify the use of the `GteNewModel` architecture. The second-generation GTE model (mGTE-TRM) is named `NewModel`. The name `NewModel` is too generic, you should set `--hf-overrides '{"architectures": ["GteNewModel"]}'` to specify the use of the `GteNewModel` architecture.
......
Markdown is supported
0% or .
You are about to add 0 people to the discussion. Proceed with caution.
Finish editing this message first!
Please register or to comment