model: load non-repeated tensors into multiple backends
some tensors are expected to be used in repeating layers but are not themselves repeated. this change copies these tensors into the same backends as their repeating counterparts to minimize copying tensors between backends
Showing
Please register or sign in to comment