Improve support for num_recycle=0.
Previously, setting num_recycle=0 in a pretrained recycling model skipped creating some Linears and LayerNorms. This meant that the the offsets of these modules were not applied leading to a degraded performance in that case. PiperOrigin-RevId: 429075328 Change-Id: I9257f859521799f45e2deef3803c249311051225
Showing
Please register or sign in to comment