[Feature] decode-only forward pass (#153)
* decode only forward pass * fix lint * batch embedding
Showing
lmdeploy/turbomind/decode.py
0 → 100644
Please register or sign in to comment
* decode only forward pass * fix lint * batch embedding