- 20 May, 2024 1 commit
-
-
Longjie Zheng authored
* first version * fix sliding window * fix style * add sliding window cache * fix style * address comments * fix test * fix style * move sliding window check inside cache init * revert changes on irrelevant files & add comment on SlidingWindowCache * address comments & fix style fix style * update causal mask * [run-slow] mistral * [run-slow] mistral * [run-slow] mistral * [run-slow] mistral * [run-slow] mistral * [run-slow] llama * [run-slow] mistral * [run-slow] mistral * [run-slow] mistral * revert CI from a10 to t4 * wrap up
-
- 30 Apr, 2024 1 commit
-
-
Joao Gante authored
-
- 22 Apr, 2024 1 commit
-
-
Steven Liu authored
* first draft * feedback * static cache snippet * feedback * feedback
-