[CK_TILE] naive attn (#1708)
* add reference attention fwd * refactor addresser * update * paged, and i8 reflect-quant * lets call it forward-quant * fix error in decode variation * update naive-attn * fix page table * fix build err
Showing
This diff is collapsed.
Please register or sign in to comment