"...git@developer.sourcefind.cn:hehl2/torchaudio.git" did not exist on "6b8102408453c4f7cec0a806b24930e60477c132"
Unverified Commit 56af8df3 authored by Sourab Mangrulkar's avatar Sourab Mangrulkar Committed by GitHub
Browse files

HF <-> megatron checkpoint reshaping and conversion for GPT (#19317)



* HF <-> megatron checkpoint conversion handling reshaping from different tensor and parallel sizes

* Apply suggestions from code review
Co-authored-by: default avatarSylvain Gugger <35901082+sgugger@users.noreply.github.com>

* addressing comments

* add doc strings and  🐛

 fixes
Co-authored-by: default avatarSylvain Gugger <35901082+sgugger@users.noreply.github.com>
parent 41ec5d0c
Markdown is supported
0% or .
You are about to add 0 people to the discussion. Proceed with caution.
Finish editing this message first!
Please register or to comment