tests/cpp/operator/test_cast_float8blockwise.cu · 3e38a2eae276c7ffe4c4f35ded651de3a08f140e · OpenDAS / TransformerEngine

[Workaround] Use bf16 lds to save fp32 input · 3e38a2ea

wenjh authored Jun 06, 2025



quantize_transpose_vector_blockwise function use lds exceeding 64kb when
input type is fp32. But max size of lds in dcu is 64kb, thus we use lds
as bfp16 for workaround.
Signed-off-by: wenjh <wenjh@sugon.com>

3e38a2ea

test_cast_float8blockwise.cu 26.6 KB

Replace test_cast_float8blockwise.cu