-
Nicolas Patry authored
Should be more robust to shared tensors (ok when using `from_pretrained). But forcing us to add new checks in our loading code (since the chosen key to keep might be different from `transformers`). --------- Co-authored-by:Ubuntu <ubuntu@ip-172-31-41-161.ec2.internal>
49b4b33e