[Model Bash] DeepSeek R1 BF16 Min Latency QKV A GEMM (0.5% E2E Speedup) (#34758)
Signed-off-by:Robert Shaw <robshaw@redhat.com> Co-authored-by:
Robert Shaw <robshaw@redhat.com>
Showing
csrc/dsv3_fused_a_gemm.cu
0 → 100644
This diff is collapsed.
Please register or sign in to comment