"git@developer.sourcefind.cn:OpenDAS/autoawq_kernels.git" did not exist on "2cae2907b6c07f83aa6a17ca5b475df574896e7b"
Report first-token-latency and token-latency percentiles (#736)
* update profile scripts * add top_p, top_k and temperature as input arguments * fix input_ids * update profile_throughput * update profile_restful_api * update profile_serving * update * update * add progress bar * remove TODO comments * update * remove useless profile_* argument * remove log level * change concurrency default value to 64 * update restful_api.md * update according to review comments * fix docstring
Showing
Please register or sign in to comment