"src/transformers/models/persimmon/modeling_persimmon.py" did not exist on "015f8e110d270a0ad42de4ae5b98198d69eb1964"
-
Matt authored
* hidden layers, huh, what are they good for (absolutely nothing) * Some tests break with 1 hidden layer, use 2 * Use 1 hidden layer in a few slow models * Use num_hidden_layers=2 everywhere * Slightly higher tol for groupvit * Slightly higher tol for groupvit
134caef3