"vllm/v1/executor/ray_utils.py" did not exist on "2394962d7083f1c1001dba9efefadb674321e688"
Unverified Commit 4984a291 authored by Wentao Ye's avatar Wentao Ye Committed by GitHub
Browse files

[Doc] Fix Markdown Pre-commit Error (#24670)


Signed-off-by: default avataryewentao256 <zhyanwentao@126.com>
parent 404c85ca
......@@ -65,7 +65,7 @@ It is assumed you have already implemented your model in vLLM according to the b
- Implement the prompt construction via [get_generation_prompt][vllm.model_executor.models.interfaces.SupportsTranscription.get_generation_prompt]. The server passes you the resampled waveform and task parameters; you return a valid [PromptType][vllm.inputs.data.PromptType]. There are two common patterns:
#### A. Multimodal LLM with audio embeddings (e.g., Voxtral, Gemma3n)
### A. Multimodal LLM with audio embeddings (e.g., Voxtral, Gemma3n)
Return a dict containing `multi_modal_data` with the audio, and either a `prompt` string or `prompt_token_ids`:
......@@ -102,7 +102,7 @@ It is assumed you have already implemented your model in vLLM according to the b
For further clarification on multi modal inputs, please refer to [Multi-Modal Inputs](../../features/multimodal_inputs.md).
#### B. Encoder–decoder audio-only (e.g., Whisper)
### B. Encoder–decoder audio-only (e.g., Whisper)
Return a dict with separate `encoder_prompt` and `decoder_prompt` entries:
......@@ -142,7 +142,6 @@ It is assumed you have already implemented your model in vLLM according to the b
return cast(PromptType, prompt)
```
- (Optional) Language validation via [validate_language][vllm.model_executor.models.interfaces.SupportsTranscription.validate_language]
If your model requires a language and you want a default, override this method (see Whisper):
......@@ -177,7 +176,6 @@ It is assumed you have already implemented your model in vLLM according to the b
return int(audio_duration_s * stt_config.sample_rate // 320) # example
```
## 2. Audio preprocessing and chunking
The API server takes care of basic audio I/O and optional chunking before building prompts:
......@@ -264,6 +262,7 @@ Once your model implements `SupportsTranscription`, you can test the endpoints (
-F "model=$MODEL_ID" \
http://localhost:8000/v1/audio/translations
```
Or check out more examples in <gh-file:examples/online_serving>.
!!! note
......
Markdown is supported
0% or .
You are about to add 0 people to the discussion. Proceed with caution.
Finish editing this message first!
Please register or to comment