"...text-generation-inference.git" did not exist on "d0e30771c2995360fcfcdc5099d4a45284499e2f"
Fix mutable proj_out weight in the Attention layer (#73)
* Catch unused params in DDP * Fix proj_out, add test
Showing
Please register or sign in to comment