Hybrid kv cache for LLaMA4 (#6563)
Co-authored-by:Cheng Wan <54331508+ch-wan@users.noreply.github.com> Co-authored-by:
tarinkk <rt572@physics.rutger.edu> Co-authored-by:
tarinkk <rt572@rutgers.physics.edu> Co-authored-by:
Hanming Lu <69857889+hanming-lu@users.noreply.github.com>
Showing
Please register or sign in to comment