"...git@developer.sourcefind.cn:kecinstone/2024-pra-vllm.git" did not exist on "928de46888b9b257dfa491047a7d9cd199ca585b"
Support arbitrary output dtypes in PyT GEMM functions (#75)
* Deprecate fp32_output option for PyT linear layers Automatically detect dtype for user-provided output tensors. Signed-off-by:Tim Moon <tmoon@nvidia.com> * Remove deprecated options Signed-off-by:
Tim Moon <tmoon@nvidia.com> --------- Signed-off-by:
Tim Moon <tmoon@nvidia.com> Co-authored-by:
Kirthi Shankar Sivamani <ksivamani@nvidia.com>
Showing
Please register or sign in to comment