"...text-generation-inference.git" did not exist on "e7248fe90e27c7c8e39dd4cac5874eb9f96ab182"
Merge pull request #867 from jph00/patch-2
Avoid double-quantizing when calling `cuda()`
Showing
Please register or sign in to comment
Avoid double-quantizing when calling `cuda()`