-
Lingfan Yu authored
[Neuron][Kernel] Support Longer Sequences in NKI-based Flash PagedAttention and Improve Efficiency (#12921) Signed-off-by:Lingfan Yu <lingfany@amazon.com>
e92694b6
[Neuron][Kernel] Support Longer Sequences in NKI-based Flash PagedAttention and Improve Efficiency (#12921)
Signed-off-by:
Lingfan Yu <lingfany@amazon.com>