Skip to content
GitLab
Menu
Projects
Groups
Snippets
Loading...
Help
Help
Support
Community forum
Keyboard shortcuts
?
Submit feedback
Contribute to GitLab
Sign in
Toggle navigation
Menu
Open sidebar
OpenDAS
vllm_cscc
Commits
a30a605d
Unverified
Commit
a30a605d
authored
Nov 25, 2024
by
Cyrus Leung
Committed by
GitHub
Nov 25, 2024
Browse files
[Doc] Add encoder-based models to Supported Models page (#10616)
Signed-off-by:
DarkLight1337
<
tlleungac@connect.ust.hk
>
parent
571841b7
Changes
1
Show whitespace changes
Inline
Side-by-side
Showing
1 changed file
with
45 additions
and
0 deletions
+45
-0
docs/source/models/supported_models.rst
docs/source/models/supported_models.rst
+45
-0
No files found.
docs/source/models/supported_models.rst
View file @
a30a605d
...
...
@@ -325,6 +325,11 @@ Text Embedding
- Example HF Models
- :ref:`LoRA <lora>`
- :ref:`PP <distributed_serving>`
* - :code:`BertModel`
- BERT-based
- :code:`BAAI/bge-base-en-v1.5`, etc.
-
-
* - :code:`Gemma2Model`
- Gemma2-based
- :code:`BAAI/bge-multilingual-gemma2`, etc.
...
...
@@ -340,6 +345,16 @@ Text Embedding
- :code:`ssmits/Qwen2-7B-Instruct-embed-base`, :code:`Alibaba-NLP/gte-Qwen2-1.5B-instruct`, etc.
- ✅︎
- ✅︎
* - :code:`RobertaModel`, :code:`RobertaForMaskedLM`
- RoBERTa-based
- :code:`sentence-transformers/all-roberta-large-v1`, :code:`sentence-transformers/all-roberta-large-v1`, etc.
-
-
* - :code:`XLMRobertaModel`
- XLM-RoBERTa-based
- :code:`intfloat/multilingual-e5-large`, etc.
-
-
.. important::
Some model architectures support both generation and embedding tasks.
...
...
@@ -390,6 +405,36 @@ Classification
.. note::
As an interim measure, these models are supported in both offline and online inference via Embeddings API.
Sentence Pair Scoring
---------------------
.. list-table::
:widths: 25 25 50 5 5
:header-rows: 1
* - Architecture
- Models
- Example HF Models
- :ref:`LoRA <lora>`
- :ref:`PP <distributed_serving>`
* - :code:`BertForSequenceClassification`
- BERT-based
- :code:`cross-encoder/ms-marco-MiniLM-L-6-v2`, etc.
-
-
* - :code:`RobertaForSequenceClassification`
- RoBERTa-based
- :code:`cross-encoder/quora-roberta-base`, etc.
-
-
* - :code:`XLMRobertaForSequenceClassification`
- XLM-RoBERTa-based
- :code:`BAAI/bge-reranker-v2-m3`, etc.
-
-
.. note::
These models are supported in both offline and online inference via Score API.
Multimodal Language Models
^^^^^^^^^^^^^^^^^^^^^^^^^^
...
...
Write
Preview
Markdown
is supported
0%
Try again
or
attach a new file
.
Attach a file
Cancel
You are about to add
0
people
to the discussion. Proceed with caution.
Finish editing this message first!
Cancel
Please
register
or
sign in
to comment