[W8A8 Block Linear Refactor][1/N] Keep all quantization types into `QuantFP8` class. (#33047)
Signed-off-by:maral <maralbahari.98@gmail.com> Co-authored-by:
Cyrus Leung <tlleungac@connect.ust.hk>
Showing
Please register or sign in to comment