fix(server): use mem_get_info to get kv cache size (#664)
Close https://github.com/huggingface/text-generation-inference/issues/649 Close https://github.com/huggingface/text-generation-inference/issues/651 Close https://github.com/huggingface/text-generation-inference/issues/653 Close #636
Showing
Please register or sign in to comment