"git@developer.sourcefind.cn:kecinstone/2024-pra-vllm.git" did not exist on "d7afab6d3af84c18ecb9cbc478842e3bf62af906"
[PyTorch] Do not allocate FP8 workspace buffers when params are FP8 (#647)
Do not allocate FP8 workspace buffers when params are FP8
Signed-off-by:
Tim Moon <tmoon@nvidia.com>
Showing
Please register or sign in to comment