perf(inference): adjust batch ratio for high GPU memory
- Increase batch ratio to 8 for GPU memory >=16GB - Improve inference performance on systems with higher GPU memory
Showing
Please register or sign in to comment
- Increase batch ratio to 8 for GPU memory >=16GB - Improve inference performance on systems with higher GPU memory