perf(inference): optimize batch processing for different GPU memory sizes
- Set NPUDTCompile to false for better performance on NPU - Adjust batch ratio
Showing
Please register or sign in to comment
- Set NPUDTCompile to false for better performance on NPU - Adjust batch ratio