feat: Add FP4 (E2M1) KV Cache Support with Quantization Utilities for MLA (#10078)
Signed-off-by:Ho-Ren (Jack) Chuang <horenchuang@bytedance.com> Co-authored-by:
Yichen Wang <yichen.wang@bytedance.com>
Showing
Please register or sign in to comment