"tests/tokenization/test_tokenization_utils.py" did not exist on "158e82e061c02fc2f1613adb7ac1d1cb6adae71c"
HF <-> megatron checkpoint reshaping and conversion for GPT (#19317)
* HF <-> megatron checkpoint conversion handling reshaping from different tensor and parallel sizes * Apply suggestions from code review Co-authored-by:Sylvain Gugger <35901082+sgugger@users.noreply.github.com> * addressing comments * add doc strings and
🐛 fixes Co-authored-by:Sylvain Gugger <35901082+sgugger@users.noreply.github.com>
Showing
Please register or sign in to comment