• Daniel Hiltgen's avatar
    Disable concurrency for AMD + Windows · 9929751c
    Daniel Hiltgen authored
    Until ROCm v6.2 ships, we wont be able to get accurate free memory
    reporting on windows, which makes automatic concurrency too risky.
    Users can still opt-in but will need to pay attention to model sizes otherwise they may thrash/page VRAM or cause OOM crashes.
    All other platforms and GPUs have accurate VRAM reporting wired
    up now, so we can turn on concurrency by default.
    9929751c
types.go 3.61 KB