Skip to content
GitLab
Menu
Projects
Groups
Snippets
Loading...
Help
Help
Support
Community forum
Keyboard shortcuts
?
Submit feedback
Contribute to GitLab
Sign in
Toggle navigation
Menu
Open sidebar
OpenDAS
vllm_cscc
Commits
3a6ae1d3
Unverified
Commit
3a6ae1d3
authored
Jun 05, 2024
by
Simon Mo
Committed by
GitHub
Jun 05, 2024
Browse files
[CI] Disable flash_attn backend for spec decode (#5286)
parent
8f1729b8
Changes
1
Hide whitespace changes
Inline
Side-by-side
Showing
1 changed file
with
5 additions
and
2 deletions
+5
-2
.buildkite/test-pipeline.yaml
.buildkite/test-pipeline.yaml
+5
-2
No files found.
.buildkite/test-pipeline.yaml
View file @
3a6ae1d3
...
@@ -45,7 +45,7 @@ steps:
...
@@ -45,7 +45,7 @@ steps:
-
TEST_DIST_MODEL=meta-llama/Llama-2-7b-hf DISTRIBUTED_EXECUTOR_BACKEND=mp pytest -v -s distributed/test_basic_distributed_correctness.py
-
TEST_DIST_MODEL=meta-llama/Llama-2-7b-hf DISTRIBUTED_EXECUTOR_BACKEND=mp pytest -v -s distributed/test_basic_distributed_correctness.py
-
TEST_DIST_MODEL=facebook/opt-125m DISTRIBUTED_EXECUTOR_BACKEND=mp pytest -v -s distributed/test_chunked_prefill_distributed.py
-
TEST_DIST_MODEL=facebook/opt-125m DISTRIBUTED_EXECUTOR_BACKEND=mp pytest -v -s distributed/test_chunked_prefill_distributed.py
-
TEST_DIST_MODEL=meta-llama/Llama-2-7b-hf DISTRIBUTED_EXECUTOR_BACKEND=mp pytest -v -s distributed/test_chunked_prefill_distributed.py
-
TEST_DIST_MODEL=meta-llama/Llama-2-7b-hf DISTRIBUTED_EXECUTOR_BACKEND=mp pytest -v -s distributed/test_chunked_prefill_distributed.py
-
pytest -v -s spec_decode/e2e/test_integration_dist.py
-
pytest -v -s spec_decode/e2e/test_integration_dist.py
-
label
:
Distributed Tests (Multiple Groups)
-
label
:
Distributed Tests (Multiple Groups)
#mirror_hardwares: [amd]
#mirror_hardwares: [amd]
...
@@ -124,7 +124,10 @@ steps:
...
@@ -124,7 +124,10 @@ steps:
-
label
:
Speculative decoding tests
-
label
:
Speculative decoding tests
#mirror_hardwares: [amd]
#mirror_hardwares: [amd]
command
:
pytest -v -s spec_decode
commands
:
# See https://github.com/vllm-project/vllm/issues/5152
-
export VLLM_ATTENTION_BACKEND=XFORMERS
-
pytest -v -s spec_decode
-
label
:
LoRA Test %N
-
label
:
LoRA Test %N
#mirror_hardwares: [amd]
#mirror_hardwares: [amd]
...
...
Write
Preview
Markdown
is supported
0%
Try again
or
attach a new file
.
Attach a file
Cancel
You are about to add
0
people
to the discussion. Proceed with caution.
Finish editing this message first!
Cancel
Please
register
or
sign in
to comment