Unverified Commit a30a605d authored by Cyrus Leung's avatar Cyrus Leung Committed by GitHub
Browse files

[Doc] Add encoder-based models to Supported Models page (#10616)


Signed-off-by: default avatarDarkLight1337 <tlleungac@connect.ust.hk>
parent 571841b7
...@@ -325,6 +325,11 @@ Text Embedding ...@@ -325,6 +325,11 @@ Text Embedding
- Example HF Models - Example HF Models
- :ref:`LoRA <lora>` - :ref:`LoRA <lora>`
- :ref:`PP <distributed_serving>` - :ref:`PP <distributed_serving>`
* - :code:`BertModel`
- BERT-based
- :code:`BAAI/bge-base-en-v1.5`, etc.
-
-
* - :code:`Gemma2Model` * - :code:`Gemma2Model`
- Gemma2-based - Gemma2-based
- :code:`BAAI/bge-multilingual-gemma2`, etc. - :code:`BAAI/bge-multilingual-gemma2`, etc.
...@@ -340,6 +345,16 @@ Text Embedding ...@@ -340,6 +345,16 @@ Text Embedding
- :code:`ssmits/Qwen2-7B-Instruct-embed-base`, :code:`Alibaba-NLP/gte-Qwen2-1.5B-instruct`, etc. - :code:`ssmits/Qwen2-7B-Instruct-embed-base`, :code:`Alibaba-NLP/gte-Qwen2-1.5B-instruct`, etc.
- ✅︎ - ✅︎
- ✅︎ - ✅︎
* - :code:`RobertaModel`, :code:`RobertaForMaskedLM`
- RoBERTa-based
- :code:`sentence-transformers/all-roberta-large-v1`, :code:`sentence-transformers/all-roberta-large-v1`, etc.
-
-
* - :code:`XLMRobertaModel`
- XLM-RoBERTa-based
- :code:`intfloat/multilingual-e5-large`, etc.
-
-
.. important:: .. important::
Some model architectures support both generation and embedding tasks. Some model architectures support both generation and embedding tasks.
...@@ -390,6 +405,36 @@ Classification ...@@ -390,6 +405,36 @@ Classification
.. note:: .. note::
As an interim measure, these models are supported in both offline and online inference via Embeddings API. As an interim measure, these models are supported in both offline and online inference via Embeddings API.
Sentence Pair Scoring
---------------------
.. list-table::
:widths: 25 25 50 5 5
:header-rows: 1
* - Architecture
- Models
- Example HF Models
- :ref:`LoRA <lora>`
- :ref:`PP <distributed_serving>`
* - :code:`BertForSequenceClassification`
- BERT-based
- :code:`cross-encoder/ms-marco-MiniLM-L-6-v2`, etc.
-
-
* - :code:`RobertaForSequenceClassification`
- RoBERTa-based
- :code:`cross-encoder/quora-roberta-base`, etc.
-
-
* - :code:`XLMRobertaForSequenceClassification`
- XLM-RoBERTa-based
- :code:`BAAI/bge-reranker-v2-m3`, etc.
-
-
.. note::
These models are supported in both offline and online inference via Score API.
Multimodal Language Models Multimodal Language Models
^^^^^^^^^^^^^^^^^^^^^^^^^^ ^^^^^^^^^^^^^^^^^^^^^^^^^^
......
Markdown is supported
0% or .
You are about to add 0 people to the discussion. Proceed with caution.
Finish editing this message first!
Please register or to comment