Support Llama4 fp8 inference (#5194)
Co-authored-by:laixinn <xielx@shanghaitech.edu.cn> Co-authored-by:
sleepcoo <sleepcoo@gmail.com> Co-authored-by:
zhyncs <me@zhyncs.com>
Showing
Please register or sign in to comment
Co-authored-by:laixinn <xielx@shanghaitech.edu.cn> Co-authored-by:
sleepcoo <sleepcoo@gmail.com> Co-authored-by:
zhyncs <me@zhyncs.com>