"llm/git@developer.sourcefind.cn:OpenDAS/ollama.git" did not exist on "b85982eb9138a36d6b17f7fa2b555dfd92da8738"
perf(inference): adjust batch ratio for GPU memory sizes
- Simplify batch ratio logic for GPU memory >= 16GB - Remove unnecessary conditions for 20GB and 40GB memory
Showing
Please register or sign in to comment