[V1][Spec Decode] Optimize Rejection Sampler with Triton Kernels (#14930)
Signed-off-by:
Woosuk Kwon <woosuk.kwon@berkeley.edu>
Showing
vllm/v1/sample/ops/utils.py
0 → 100644
Please register or sign in to comment
Signed-off-by:
Woosuk Kwon <woosuk.kwon@berkeley.edu>