[fix] Fix mxfp4 weight loading bug with TP sharding in GPT-OSS (#9433)
Signed-off-by:Hao Lu <14827759+hlu1@users.noreply.github.com> Signed-off-by:
Xinyuan Tong <xinyuantong.cs@gmail.com> Co-authored-by:
Xinyuan Tong <xinyuantong.cs@gmail.com>
Showing
Please register or sign in to comment