Unverified Commit 1994de99 authored by Huamin Li's avatar Huamin Li Committed by GitHub
Browse files

[CI Failure] Fix test_kv_cache_model_load_and_run (#27717)


Signed-off-by: default avatarHuamin Li <3ericli@gmail.com>
parent 4464723f
...@@ -49,7 +49,18 @@ def test_model_load_and_run( ...@@ -49,7 +49,18 @@ def test_model_load_and_run(
KV_CACHE_MODELS = [ KV_CACHE_MODELS = [
# AutoFP8 format using separate .k_scale and .v_scale # AutoFP8 format using separate .k_scale and .v_scale
"nm-testing/Qwen2-1.5B-Instruct-FP8-K-V", # The original checkpoint below was removed from the Hub. To unblock CI and
# until a small replacement with split K/V scales is found, skip this case.
# See PR #27717 for context.
pytest.param(
"nm-testing/Qwen2-1.5B-Instruct-FP8-K-V",
marks=pytest.mark.skip(
reason=(
"Checkpoint removed from HF; temporarily disabling this "
"AutoFP8 split K/V case (PR #27717)."
)
),
),
] ]
......
Markdown is supported
0% or .
You are about to add 0 people to the discussion. Proceed with caution.
Finish editing this message first!
Please register or to comment