"git@developer.sourcefind.cn:OpenDAS/torchaudio.git" did not exist on "955cdbdc230b15d44a260f214914b09f57fb598d"
[WIP] Add support for Mistral-Nemo by supporting head_dim through config (#2254)
* Support passing head_dim through config
* Using `head_dim` as a fallback is necessary since it's a non standard
key in mistralConfig (as defined in transformers).
* Shorter diff.
---------
Co-authored-by:
Nicolas Patry <patry.nicolas@protonmail.com>
Showing
Please register or sign in to comment