Merge pull request #866 from InfiniTensor/issue/848_new
issue/848: add paged attention prefill for nvidia gpu with test pass
Showing
Please register or sign in to comment
issue/848: add paged attention prefill for nvidia gpu with test pass