[AMD] [Quantization] Add override flag for attention dtype instead of using...
[AMD] [Quantization] Add override flag for attention dtype instead of using kv_cache_dtype trigger (#17331)
Signed-off-by:
Randall Smith <Randall.Smith@amd.com>
Showing
Please register or sign in to comment