Support Faster JSON decoding for llava (#137)
When sending fast-forwarded reqs to model_rpc, re-calculate `pad_input_ids`
Showing
Please register or sign in to comment
When sending fast-forwarded reqs to model_rpc, re-calculate `pad_input_ids`