Skip to content
GitLab
Menu
Projects
Groups
Snippets
Loading...
Help
Help
Support
Community forum
Keyboard shortcuts
?
Submit feedback
Contribute to GitLab
Sign in
Toggle navigation
Menu
Open sidebar
OpenDAS
vllm_cscc
Commits
f421f3ce
Unverified
Commit
f421f3ce
authored
Sep 10, 2024
by
Alexey Kondratiev(AMD)
Committed by
GitHub
Sep 10, 2024
Browse files
[CI/Build] Enabling kernels tests for AMD, ignoring some of then that fail (#8130)
parent
8c054b7a
Changes
2
Hide whitespace changes
Inline
Side-by-side
Showing
2 changed files
with
24 additions
and
1 deletion
+24
-1
.buildkite/run-amd-test.sh
.buildkite/run-amd-test.sh
+23
-1
.buildkite/test-pipeline.yaml
.buildkite/test-pipeline.yaml
+1
-0
No files found.
.buildkite/run-amd-test.sh
View file @
f421f3ce
...
...
@@ -71,13 +71,35 @@ mkdir -p ${HF_CACHE}
HF_MOUNT
=
"/root/.cache/huggingface"
commands
=
$@
echo
"Commands:
$commands
"
#ignore certain kernels tests
if
[[
$commands
==
*
" kernels "
*
]]
;
then
commands
=
"
${
commands
}
\
--ignore=kernels/test_attention.py
\
--ignore=kernels/test_attention_selector.py
\
--ignore=kernels/test_blocksparse_attention.py
\
--ignore=kernels/test_causal_conv1d.py
\
--ignore=kernels/test_cutlass.py
\
--ignore=kernels/test_encoder_decoder_attn.py
\
--ignore=kernels/test_flash_attn.py
\
--ignore=kernels/test_flashinfer.py
\
--ignore=kernels/test_int8_quant.py
\
--ignore=kernels/test_machete_gemm.py
\
--ignore=kernels/test_mamba_ssm.py
\
--ignore=kernels/test_marlin_gemm.py
\
--ignore=kernels/test_prefix_prefill.py
\
--ignore=kernels/test_rand.py
\
--ignore=kernels/test_sampler.py"
fi
PARALLEL_JOB_COUNT
=
8
# check if the command contains shard flag, we will run all shards in parallel because the host have 8 GPUs.
if
[[
$commands
==
*
"--shard-id="
*
]]
;
then
for
GPU
in
$(
seq
0
$((
$PARALLEL_JOB_COUNT
-
1
))
)
;
do
#replace shard arguments
commands
=
${
@
//
"--shard-id= "
/
"--shard-id=
${
GPU
}
"
}
commands
=
${
commands
//
"--shard-id= "
/
"--shard-id=
${
GPU
}
"
}
commands
=
${
commands
//
"--num-shards= "
/
"--num-shards=
${
PARALLEL_JOB_COUNT
}
"
}
echo
"Shard
${
GPU
}
commands:
$commands
"
docker run
\
--device
/dev/kfd
--device
/dev/dri
\
--network
host
\
...
...
.buildkite/test-pipeline.yaml
View file @
f421f3ce
...
...
@@ -228,6 +228,7 @@ steps:
parallelism
:
4
-
label
:
Kernels Test %N
# 30min each
mirror_hardwares
:
[
amd
]
source_file_dependencies
:
-
csrc/
-
vllm/attention
...
...
Write
Preview
Markdown
is supported
0%
Try again
or
attach a new file
.
Attach a file
Cancel
You are about to add
0
people
to the discussion. Proceed with caution.
Finish editing this message first!
Cancel
Please
register
or
sign in
to comment