Skip to content
GitLab
Menu
Projects
Groups
Snippets
Loading...
Help
Help
Support
Community forum
Keyboard shortcuts
?
Submit feedback
Contribute to GitLab
Sign in
Toggle navigation
Menu
Open sidebar
OpenDAS
vllm_cscc
Commits
c0efe92d
Unverified
Commit
c0efe92d
authored
Jan 07, 2025
by
Cyrus Leung
Committed by
GitHub
Jan 07, 2025
Browse files
[Doc] Add note to `gte-Qwen2` models (#11808)
Signed-off-by:
DarkLight1337
<
tlleungac@connect.ust.hk
>
parent
d9fa1c05
Changes
1
Show whitespace changes
Inline
Side-by-side
Showing
1 changed file
with
3 additions
and
0 deletions
+3
-0
docs/source/models/supported_models.md
docs/source/models/supported_models.md
+3
-0
No files found.
docs/source/models/supported_models.md
View file @
c0efe92d
...
...
@@ -430,6 +430,9 @@ You can set `--hf-overrides '{"is_causal": false}'` to change the attention mask
On the other hand, its 1.5B variant (`Alibaba-NLP/gte-Qwen2-1.5B-instruct`) uses causal attention
despite being described otherwise on its model card.
Regardless of the variant, you need to enable `--trust-remote-code` for the correct tokenizer to be
loaded. See [relevant issue on HF Transformers](https://github.com/huggingface/transformers/issues/34882).
```
If your model is not in the above list, we will try to automatically convert the model using
...
...
Write
Preview
Markdown
is supported
0%
Try again
or
attach a new file
.
Attach a file
Cancel
You are about to add
0
people
to the discussion. Proceed with caution.
Finish editing this message first!
Cancel
Please
register
or
sign in
to comment