"git@developer.sourcefind.cn:OpenDAS/torchaudio.git" did not exist on "9c7bf1bc586fea64e5729ea8b2cc4a68979f3ffe"
Add `DenseMoELayer` and wire it up in Mixtral/Deepseek V2 (#2537)
This replaces the custom layers in both models.
Showing
Please register or sign in to comment