Support FP8 Quantization and Inference Run on Intel Gaudi (HPU) using INC...
Support FP8 Quantization and Inference Run on Intel Gaudi (HPU) using INC (Intel Neural Compressor) (#12010) Signed-off-by:Nir David <ndavid@habana.ai> Signed-off-by:
Uri Livne <ulivne@habana.ai> Co-authored-by:
Uri Livne <ulivne@habana.ai>
Showing
Please register or sign in to comment