Fix LLaMa beam search when using parallelize (#24224)
* Fix LLaMa beam search when using parallelize same issue as T5 #11717 * fix code format in modeling_llama.py * fix format of _reorder_cache in modeling_llama.py
Showing
Please register or sign in to comment