• Daniel Hiltgen's avatar
    Verify permissions for AMD GPU (#6736) · 9246e6dd
    Daniel Hiltgen authored
    This adds back a check which was lost many releases back to verify /dev/kfd permissions
    which when lacking, can lead to confusing failure modes of:
      "rocBLAS error: Could not initialize Tensile host: No devices found"
    
    This implementation does not hard fail the serve command but instead will fall back to CPU
    with an error log.  In the future we can include this in the GPU discovery UX to show
    detected but unsupported devices we discovered.
    9246e6dd
amd_linux.go 15.4 KB