"tests/visual_bert/__init__.py" did not exist on "2fd28d43630e1bd6af978c23707f59363fde7e27"
HF <-> megatron checkpoint reshaping and conversion for GPT (#19317)
* HF <-> megatron checkpoint conversion handling reshaping from different tensor and parallel sizes * Apply suggestions from code review Co-authored-by:Sylvain Gugger <35901082+sgugger@users.noreply.github.com> * addressing comments * add doc strings and
🐛 fixes Co-authored-by:Sylvain Gugger <35901082+sgugger@users.noreply.github.com>
Showing
This diff is collapsed.
Please register or sign in to comment