-
rasmith authored
[AMD] [Quantization] Add override flag for attention dtype instead of using kv_cache_dtype trigger (#17331) Signed-off-by:Randall Smith <Randall.Smith@amd.com>
c7ea0b56
[AMD] [Quantization] Add override flag for attention dtype instead of using kv_cache_dtype trigger (#17331)
Signed-off-by:
Randall Smith <Randall.Smith@amd.com>