Skip to content
GitLab
Menu
Projects
Groups
Snippets
Loading...
Help
Help
Support
Community forum
Keyboard shortcuts
?
Submit feedback
Contribute to GitLab
Sign in
Toggle navigation
Menu
Open sidebar
OpenDAS
vllm_cscc
Commits
f2b20fe4
Unverified
Commit
f2b20fe4
authored
Feb 14, 2025
by
Harry Mellor
Committed by
GitHub
Feb 13, 2025
Browse files
Consolidate Llama model usage in tests (#13094)
parent
40932d7a
Changes
22
Show whitespace changes
Inline
Side-by-side
Showing
2 changed files
with
2 additions
and
2 deletions
+2
-2
tests/v1/sample/test_logprobs.py
tests/v1/sample/test_logprobs.py
+1
-1
tests/v1/sample/test_logprobs_e2e.py
tests/v1/sample/test_logprobs_e2e.py
+1
-1
No files found.
tests/v1/sample/test_logprobs.py
View file @
f2b20fe4
...
@@ -14,7 +14,7 @@ from vllm import SamplingParams
...
@@ -14,7 +14,7 @@ from vllm import SamplingParams
from
...conftest
import
VllmRunner
from
...conftest
import
VllmRunner
MODEL
=
"meta-llama/Llama-3.2-1B"
MODEL
=
"meta-llama/Llama-3.2-1B
-Instruct
"
DTYPE
=
"half"
DTYPE
=
"half"
...
...
tests/v1/sample/test_logprobs_e2e.py
View file @
f2b20fe4
...
@@ -11,7 +11,7 @@ RTOL = 0.03
...
@@ -11,7 +11,7 @@ RTOL = 0.03
EXPECTED_VALUE
=
0.62
EXPECTED_VALUE
=
0.62
# FIXME(rob): enable prefix caching once supported.
# FIXME(rob): enable prefix caching once supported.
MODEL
=
"meta-llama/Llama-3.2-1B"
MODEL
=
"meta-llama/Llama-3.2-1B
-Instruct
"
MODEL_ARGS
=
f
"pretrained=
{
MODEL
}
,enforce_eager=True,enable_prefix_caching=False"
# noqa: E501
MODEL_ARGS
=
f
"pretrained=
{
MODEL
}
,enforce_eager=True,enable_prefix_caching=False"
# noqa: E501
SERVER_ARGS
=
[
SERVER_ARGS
=
[
"--enforce_eager"
,
"--no_enable_prefix_caching"
,
"--disable-log-requests"
"--enforce_eager"
,
"--no_enable_prefix_caching"
,
"--disable-log-requests"
...
...
Prev
1
2
Next
Write
Preview
Markdown
is supported
0%
Try again
or
attach a new file
.
Attach a file
Cancel
You are about to add
0
people
to the discussion. Proceed with caution.
Finish editing this message first!
Cancel
Please
register
or
sign in
to comment