"googlemock/include/gmock/vscode:/vscode.git/clone" did not exist on "a3c0dd0f4d58e6c01a1432fdc69a9aff937309a9"
[JAX] Use 1x quantization + jax transpose for performance for tensor-scaling (#1830)
* Use 1x quantization + jax transpose on BW for performance Signed-off-by:Jeremy Berchtold <jberchtold@nvidia.com> * Use 1x quantization on Hopper as well as it is also faster Signed-off-by:
Jeremy Berchtold <jberchtold@nvidia.com> * Undo architecture check helper function Signed-off-by:
Jeremy Berchtold <jberchtold@nvidia.com> * Lint Signed-off-by:
Jeremy Berchtold <jberchtold@nvidia.com> --------- Signed-off-by:
Jeremy Berchtold <jberchtold@nvidia.com>
Showing
Please register or sign in to comment