"tests/compile/fullgraph/test_full_graph.py" did not exist on "50b8d08dbd4493327e344bc627a0613947deba8f"
README.md 2.13 KB
Newer Older
1
2
3
4
5
# Pooling models

## Cohere rerank usage

```bash
6
# vllm serve BAAI/bge-reranker-base
7
8
9
python examples/online_serving/pooling/cohere_rerank_client.py
```

10
## Embedding requests base64 encoding_format usage
11
12

```bash
13
# vllm serve intfloat/e5-small
14
15
16
17
18
19
python examples/online_serving/pooling/embedding_requests_base64_client.py
```

## Embedding requests bytes encoding_format usage

```bash
20
# vllm serve intfloat/e5-small
21
python examples/online_serving/pooling/embedding_requests_bytes_client.py
22
23
```

24
25
26
## Jinaai rerank usage

```bash
27
# vllm serve BAAI/bge-reranker-base
28
29
30
python examples/online_serving/pooling/jinaai_rerank_client.py
```

31
32
33
## Multi vector retrieval usage

```bash
34
# vllm serve BAAI/bge-m3
35
36
37
python examples/online_serving/pooling/multi_vector_retrieval_client.py
```

38
39
40
## Named Entity Recognition (NER) usage

```bash
41
# vllm serve boltuix/NeuroBERT-NER
42
python examples/online_serving/pooling/ner_client.py
43
44
```

45
## OpenAI chat embedding for multimodal usage
46
47
48
49
50

```bash
python examples/online_serving/pooling/openai_chat_embedding_client_for_multimodal.py
```

51
## OpenAI classification usage
52
53

```bash
54
# vllm serve jason9693/Qwen2.5-1.5B-apeach
55
56
57
python examples/online_serving/pooling/openai_classification_client.py
```

58
## OpenAI cross_encoder score usage
59
60

```bash
61
62
63
64
65
66
67
68
69
70
71
72
73
74
75
# vllm serve BAAI/bge-reranker-v2-m3
python examples/online_serving/pooling/openai_cross_encoder_score.py
```

## OpenAI cross_encoder score for multimodal usage

```bash
# vllm serve jinaai/jina-reranker-m0
python examples/online_serving/pooling/openai_cross_encoder_score_for_multimodal.py
```

## OpenAI embedding usage

```bash
# vllm serve intfloat/e5-small
76
77
78
python examples/online_serving/pooling/openai_embedding_client.py
```

79
## OpenAI embedding matryoshka dimensions usage
80
81

```bash
82
# vllm serve jinaai/jina-embeddings-v3 --trust-remote-code
83
84
85
python examples/online_serving/pooling/openai_embedding_matryoshka_fy.py
```

86
## OpenAI pooling usage
87
88

```bash
89
# vllm serve internlm/internlm2-1_8b-reward --trust-remote-code
90
91
python examples/online_serving/pooling/openai_pooling_client.py
```
92
93
94
95
96
97

## Online Prithvi Geospatial MAE usage

```bash
python examples/online_serving/pooling/prithvi_geospatial_mae.py
```