Commit 0e6e6b31 authored by zhouxiang's avatar zhouxiang
Browse files

优化int8生成速度,优化int8精度下大token数输入时性能不佳的问题,优化int8多batch效果

parent 4536fa79
Markdown is supported
0% or .
You are about to add 0 people to the discussion. Proceed with caution.
Finish editing this message first!
Please register or to comment