Skip to content

GitLab

  • Menu
Projects Groups Snippets
    • Loading...
  • Help
    • Help
    • Support
    • Community forum
    • Submit feedback
    • Contribute to GitLab
  • Sign in / Register
  • Q Qwen3.5_vllm
  • Project information
    • Project information
    • Activity
    • Labels
    • Members
  • Repository
    • Repository
    • Files
    • Commits
    • Branches
    • Tags
    • Contributors
    • Graph
    • Compare
  • Issues 0
    • Issues 0
    • List
    • Boards
    • Service Desk
    • Milestones
  • Merge requests 0
    • Merge requests 0
  • CI/CD
    • CI/CD
    • Pipelines
    • Jobs
    • Schedules
  • Deployments
    • Deployments
    • Environments
    • Releases
  • Monitor
    • Monitor
    • Incidents
  • Packages & Registries
    • Packages & Registries
    • Package Registry
    • Infrastructure Registry
  • Analytics
    • Analytics
    • CI/CD
    • Repository
    • Value stream
  • Wiki
    • Wiki
  • Snippets
    • Snippets
  • Activity
  • Graph
  • Create a new issue
  • Jobs
  • Commits
  • Issue Boards
Collapse sidebar
  • ModelZoo
  • Qwen3.5_vllm
  • Issues
  • #1

Closed
Open
Created Apr 13, 2026 by nqz15634117639@nqz15634117639

启动Qwen3.5-122B-A10B-GPTQ-Int4报错

启动命令:vllm serve /root/Qwen3.5-122B-A10B-GPTQ-Int4 --gpu-memory-utilization 0.95 --served-model-name qwen3.5-122b --host 0.0.0.0 --port 8001 --tensor-parallel-size 4 --max-model-len 32768 --dtype float16 --quantization gptq --enable-auto-tool-choice --tool-call-parser qwen3_coder --reasoning-parser qwen3 --default-chat-template-kwargs '{"enable_thinking": false}' 报错如下: ERROR 04-11 18:54:40 [multiproc_executor.py:246] Worker proc VllmWorker-2 died unexpectedly, shutting down executor. Process EngineCore_DP0: Traceback (most recent call last): File "/usr/lib/python3.10/multiprocessing/process.py", line 314, in _bootstrap self.run() File "/usr/lib/python3.10/multiprocessing/process.py", line 108, in run self._target(*self._args, **self._kwargs) File "/usr/local/lib/python3.10/dist-packages/vllm/v1/engine/core.py", line 950, in run_engine_core raise e File "/usr/local/lib/python3.10/dist-packages/vllm/v1/engine/core.py", line 937, in run_engine_core engine_core = EngineCoreProc(*args, engine_index=dp_rank, **kwargs) File "/usr/local/lib/python3.10/dist-packages/vllm/v1/engine/core.py", line 691, in init super().init( File "/usr/local/lib/python3.10/dist-packages/vllm/v1/engine/core.py", line 112, in init num_gpu_blocks, num_cpu_blocks, kv_cache_config = self._initialize_kv_caches( File "/usr/local/lib/python3.10/dist-packages/vllm/v1/engine/core.py", line 242, in _initialize_kv_caches available_gpu_memory = self.model_executor.determine_available_memory() File "/usr/local/lib/python3.10/dist-packages/vllm/v1/executor/abstract.py", line 126, in determine_available_memory return self.collective_rpc("determine_available_memory") File "/usr/local/lib/python3.10/dist-packages/vllm/v1/executor/multiproc_executor.py", line 374, in collective_rpc return aggregate(get_response()) File "/usr/local/lib/python3.10/dist-packages/vllm/v1/executor/multiproc_executor.py", line 357, in get_response raise RuntimeError( RuntimeError: Worker failed with error 'name 'get_moe_triton_config_w4a16' is not defined', please check the stack trace above for the root cause

Assignee
Assign to
Time tracking