feat: add KV cache transfer estimated latency metric for disaggregated serving (#7590)
Signed-off-by:Jont828 <jt572@cornell.edu> Co-authored-by:
Hongkuan Zhou <tedzhouhk@gmail.com>
Showing
Please register or sign in to comment
Signed-off-by:Jont828 <jt572@cornell.edu> Co-authored-by:
Hongkuan Zhou <tedzhouhk@gmail.com>