[Dev] Update linear attention examples to enhance performance on Hopper GPUs (#621)
* Tune linear attention examples on H100 * Add retnet fwd kernel * fix lint
Showing
Please register or sign in to comment
* Tune linear attention examples on H100 * Add retnet fwd kernel * fix lint