"test/srt/test_wave_attention_backend.py" did not exist on "77a3954bf7313ed23eda2a857c8840d3b88608a2"
Support penalty in overlap mode; return logprob with chunked prefill; improve...
Support penalty in overlap mode; return logprob with chunked prefill; improve benchmark scripts (#3988) Co-authored-by:SangBin Cho <rkooo567@gmail.com> Co-authored-by:
dhou-xai <dhou@x.ai> Co-authored-by:
Hanming Lu <hanming_lu@berkeley.edu>
Showing
Please register or sign in to comment