Skip to content
GitLab
Menu
Projects
Groups
Snippets
Loading...
Help
Help
Support
Community forum
Keyboard shortcuts
?
Submit feedback
Contribute to GitLab
Sign in
Toggle navigation
Menu
Open sidebar
OpenDAS
vllm_cscc
Commits
d54a18a4
Unverified
Commit
d54a18a4
authored
Nov 14, 2025
by
Michael Goin
Committed by
GitHub
Nov 14, 2025
Browse files
[CI][CPU] Smoke test for Apple Silicon using GHA MacOS runner (#28688)
Signed-off-by:
mgoin
<
mgoin64@gmail.com
>
parent
5f3cd7f7
Changes
1
Hide whitespace changes
Inline
Side-by-side
Showing
1 changed file
with
73 additions
and
0 deletions
+73
-0
.github/workflows/macos-smoke-test.yml
.github/workflows/macos-smoke-test.yml
+73
-0
No files found.
.github/workflows/macos-smoke-test.yml
0 → 100644
View file @
d54a18a4
name
:
macOS Apple Silicon Smoke Test
on
:
workflow_dispatch
:
# Manual trigger
jobs
:
macos-m1-smoke-test
:
runs-on
:
macos-latest
timeout-minutes
:
20
steps
:
-
uses
:
actions/checkout@v4
-
uses
:
astral-sh/setup-uv@v4
with
:
enable-cache
:
true
python-version
:
'
3.12'
-
name
:
Install dependencies
run
:
|
uv pip install -r requirements/cpu-build.txt
uv pip install -r requirements/cpu.txt
-
name
:
Build vLLM
run
:
uv pip install -v -e .
env
:
CMAKE_BUILD_PARALLEL_LEVEL
:
4
-
name
:
Verify installation
run
:
|
python -c "import vllm; print(f'vLLM version: {vllm.__version__}')"
python -c "import torch; print(f'PyTorch: {torch.__version__}')"
-
name
:
Smoke test vllm serve
timeout-minutes
:
10
run
:
|
# Start server in background
vllm serve Qwen/Qwen3-0.6B \
--max-model-len=2048 \
--load-format=dummy \
--enforce-eager \
--port 8000 &
SERVER_PID=$!
# Wait for server to start
for i in {1..30}; do
if curl -s http://localhost:8000/health > /dev/null; then
echo "Server started successfully"
break
fi
if [ "$i" -eq 30 ]; then
echo "Server failed to start"
kill "$SERVER_PID"
exit 1
fi
sleep 2
done
# Test health endpoint
curl -f http://localhost:8000/health
# Test completion
curl -f http://localhost:8000/v1/completions \
-H "Content-Type: application/json" \
-d '{
"model": "Qwen/Qwen3-0.6B",
"prompt": "Hello",
"max_tokens": 5
}'
# Cleanup
kill "$SERVER_PID"
Write
Preview
Markdown
is supported
0%
Try again
or
attach a new file
.
Attach a file
Cancel
You are about to add
0
people
to the discussion. Proceed with caution.
Finish editing this message first!
Cancel
Please
register
or
sign in
to comment