"tests/models/vscode:/vscode.git/clone" did not exist on "a3345c1f1333fb6d751826477395fc922cde43e5"
MixtralSparseMoeBlock: add gate jitter (#29865)
This commit adds gate jitter to MixtralSparseMoeBlock's input data before passing it through the MoE layer, if turned on.
Showing
Please register or sign in to comment