fix(kv-cache): increase hybrid attention grouping threshold from 1.25 to 1.5 (#36684)
Signed-off-by:
Jaime Campos Salas <jaime.campos.salas@gmail.com>
Showing
Please register or sign in to comment
Signed-off-by:
Jaime Campos Salas <jaime.campos.salas@gmail.com>