"server/text_generation_server/cli.py" did not exist on "0fbc69194694b60badae3bf643bc76985f69c0f4"
Report first-token-latency and token-latency percentiles (#736)
* update profile scripts * add top_p, top_k and temperature as input arguments * fix input_ids * update profile_throughput * update profile_restful_api * update profile_serving * update * update * add progress bar * remove TODO comments * update * remove useless profile_* argument * remove log level * change concurrency default value to 64 * update restful_api.md * update according to review comments * fix docstring
Showing
Please register or sign in to comment