"tests/visual_bert/__init__.py" did not exist on "2fd28d43630e1bd6af978c23707f59363fde7e27"
Unverified Commit 56af8df3 authored by Sourab Mangrulkar's avatar Sourab Mangrulkar Committed by GitHub
Browse files

HF <-> megatron checkpoint reshaping and conversion for GPT (#19317)



* HF <-> megatron checkpoint conversion handling reshaping from different tensor and parallel sizes

* Apply suggestions from code review
Co-authored-by: default avatarSylvain Gugger <35901082+sgugger@users.noreply.github.com>

* addressing comments

* add doc strings and  🐛

 fixes
Co-authored-by: default avatarSylvain Gugger <35901082+sgugger@users.noreply.github.com>
parent 41ec5d0c
Markdown is supported
0% or .
You are about to add 0 people to the discussion. Proceed with caution.
Finish editing this message first!
Please register or to comment