- 21 Nov, 2024 1 commit
-
-
OlivierDehaene authored
* feat: add payload limit * update launcher
-
- 04 Nov, 2024 1 commit
-
-
Nicolas Patry authored
-
- 25 Oct, 2024 1 commit
-
-
OlivierDehaene authored
* feat: add triton kernels to decrease latency of large batches * cast to int32 * fix kernel * fix kernel * disable triton on rocm * fix speculation * add slots filtering kernel
-