perf(inference): adjust batch ratio for GPU memory sizes
- Remove separate condition for GPU memory >= 24GB - Simplify logic to use a single threshold of 16GB
Showing
Please register or sign in to comment
- Remove separate condition for GPU memory >= 24GB - Simplify logic to use a single threshold of 16GB