[Bugfix][Platform][CPU] Fix cuda platform detection on CPU backend edge case (#13358)

Signed-off-by: Isotr0py <2037008807@qq.com>

[Bugfix][Platform][CPU] Fix cuda platform detection on CPU backend edge case (#13358)
Signed-off-by: Isotr0py <2037008807@qq.com>
d67cc21b · Isotr0py · GitHub · e18227b0 · d67cc21b
Unverified Commit d67cc21b authored Feb 17, 2025 by Isotr0py Committed by GitHub Feb 16, 2025
Show whitespace changes
Inline Side-by-side

Showing with 9 additions and 2 deletions

vllm/platforms/__init__.py vllm/platforms/__init__.py +9 -2

No files found.
--- a/vllm/platforms/__init__.py
+++ b/vllm/platforms/__init__.py
@@ -33,12 +33,19 @@ def cuda_platform_plugin() -> Optional[str]:
    is_cuda = False
    try:
+        from importlib.metadata import version
        from vllm.utils import import_pynvml
        pynvml = import_pynvml()
        pynvml.nvmlInit()
        try:
-            if pynvml.nvmlDeviceGetCount() > 0:
+            # NOTE: Edge case: vllm cpu build on a GPU machine.
-                is_cuda = True
+            # Third-party pynvml can be imported in cpu build,
+            # we need to check if vllm is built with cpu too.
+            # Otherwise, vllm will always activate cuda plugin
+            # on a GPU machine, even if in a cpu build.
+            is_cuda = (pynvml.nvmlDeviceGetCount() > 0
+                       and "cpu" not in version("vllm"))
        finally:
            pynvml.nvmlShutdown()
    except Exception as e: