"...Megatron-LM-v1.1.5-3D_parallelism/megatron/global_vars.py" did not exist on "aebde649e30016aa33b2e1345cb22210a2e49b04"
-
Lianmin Zheng authored
Support penalty in overlap mode; return logprob with chunked prefill; improve benchmark scripts (#3988) Co-authored-by:
SangBin Cho <rkooo567@gmail.com> Co-authored-by:
dhou-xai <dhou@x.ai> Co-authored-by:
Hanming Lu <hanming_lu@berkeley.edu>
ac238727