Skip to content
GitLab
Menu
Projects
Groups
Snippets
Loading...
Help
Help
Support
Community forum
Keyboard shortcuts
?
Submit feedback
Contribute to GitLab
Sign in
Toggle navigation
Menu
Open sidebar
OpenDAS
vllm_cscc
Commits
a30a605d
Unverified
Commit
a30a605d
authored
Nov 25, 2024
by
Cyrus Leung
Committed by
GitHub
Nov 25, 2024
Browse files
[Doc] Add encoder-based models to Supported Models page (#10616)
Signed-off-by:
DarkLight1337
<
tlleungac@connect.ust.hk
>
parent
571841b7
Changes
1
Hide whitespace changes
Inline
Side-by-side
Showing
1 changed file
with
45 additions
and
0 deletions
+45
-0
docs/source/models/supported_models.rst
docs/source/models/supported_models.rst
+45
-0
No files found.
docs/source/models/supported_models.rst
View file @
a30a605d
...
@@ -325,6 +325,11 @@ Text Embedding
...
@@ -325,6 +325,11 @@ Text Embedding
- Example HF Models
- Example HF Models
- :ref:`LoRA <lora>`
- :ref:`LoRA <lora>`
- :ref:`PP <distributed_serving>`
- :ref:`PP <distributed_serving>`
* - :code:`BertModel`
- BERT-based
- :code:`BAAI/bge-base-en-v1.5`, etc.
-
-
* - :code:`Gemma2Model`
* - :code:`Gemma2Model`
- Gemma2-based
- Gemma2-based
- :code:`BAAI/bge-multilingual-gemma2`, etc.
- :code:`BAAI/bge-multilingual-gemma2`, etc.
...
@@ -340,6 +345,16 @@ Text Embedding
...
@@ -340,6 +345,16 @@ Text Embedding
- :code:`ssmits/Qwen2-7B-Instruct-embed-base`, :code:`Alibaba-NLP/gte-Qwen2-1.5B-instruct`, etc.
- :code:`ssmits/Qwen2-7B-Instruct-embed-base`, :code:`Alibaba-NLP/gte-Qwen2-1.5B-instruct`, etc.
- ✅︎
- ✅︎
- ✅︎
- ✅︎
* - :code:`RobertaModel`, :code:`RobertaForMaskedLM`
- RoBERTa-based
- :code:`sentence-transformers/all-roberta-large-v1`, :code:`sentence-transformers/all-roberta-large-v1`, etc.
-
-
* - :code:`XLMRobertaModel`
- XLM-RoBERTa-based
- :code:`intfloat/multilingual-e5-large`, etc.
-
-
.. important::
.. important::
Some model architectures support both generation and embedding tasks.
Some model architectures support both generation and embedding tasks.
...
@@ -390,6 +405,36 @@ Classification
...
@@ -390,6 +405,36 @@ Classification
.. note::
.. note::
As an interim measure, these models are supported in both offline and online inference via Embeddings API.
As an interim measure, these models are supported in both offline and online inference via Embeddings API.
Sentence Pair Scoring
---------------------
.. list-table::
:widths: 25 25 50 5 5
:header-rows: 1
* - Architecture
- Models
- Example HF Models
- :ref:`LoRA <lora>`
- :ref:`PP <distributed_serving>`
* - :code:`BertForSequenceClassification`
- BERT-based
- :code:`cross-encoder/ms-marco-MiniLM-L-6-v2`, etc.
-
-
* - :code:`RobertaForSequenceClassification`
- RoBERTa-based
- :code:`cross-encoder/quora-roberta-base`, etc.
-
-
* - :code:`XLMRobertaForSequenceClassification`
- XLM-RoBERTa-based
- :code:`BAAI/bge-reranker-v2-m3`, etc.
-
-
.. note::
These models are supported in both offline and online inference via Score API.
Multimodal Language Models
Multimodal Language Models
^^^^^^^^^^^^^^^^^^^^^^^^^^
^^^^^^^^^^^^^^^^^^^^^^^^^^
...
...
Write
Preview
Markdown
is supported
0%
Try again
or
attach a new file
.
Attach a file
Cancel
You are about to add
0
people
to the discussion. Proceed with caution.
Finish editing this message first!
Cancel
Please
register
or
sign in
to comment