Softcapping for gemma2. (#2273)
* Softcapping for gemma2. * Less clutter. * No access to transformers config, only config_dict here. * 0.0 is the null value in the C++ API.
Showing
This diff is collapsed.
This diff is collapsed.
Please register or sign in to comment