1. remove the weight broadcast in the constructor 2. disable unnecessary allreduces for clip-after-ar
Attach a file by drag & drop or click to upload