- 20 Jul, 2024 1 commit
-
-
Cyrus Leung authored
-
- 17 Jul, 2024 1 commit
-
-
Cody Yu authored
-
- 08 Jul, 2024 1 commit
-
-
afeldman-nm authored
[Kernel] Correctly invoke prefill & decode kernels for cross-attention (towards eventual encoder/decoder model support) (#4888) Co-authored-by:Woosuk Kwon <woosuk.kwon@berkeley.edu>
-