"tests/vscode:/vscode.git/clone" did not exist on "ade806b4601f520704bbdeab0c0545cb73e9e3c4"
feat: add triton kernels to decrease latency of large batches (#2687)
* feat: add triton kernels to decrease latency of large batches * cast to int32 * fix kernel * fix kernel * disable triton on rocm * fix speculation * add slots filtering kernel
Showing
Please register or sign in to comment