- 19 Nov, 2025 1 commit
-
-
Michael Yang authored
cuda panics on batches larger than 1024 so skip those and fallback to cpu
-
- 12 Nov, 2025 1 commit
-
-
Daniel Hiltgen authored
This should be reverted once we update ggml past b6897
-
cuda panics on batches larger than 1024 so skip those and fallback to cpu
This should be reverted once we update ggml past b6897