Commit 504a12b8 authored by zhuwenwen's avatar zhuwenwen
Browse files

add kvcache fp8

parent d074a953
This diff is collapsed.
This diff is collapsed.
......@@ -172,7 +172,7 @@ STR_DTYPE_TO_TORCH_DTYPE = {
"half": torch.half,
"bfloat16": torch.bfloat16,
"float": torch.float,
# "fp8": torch.uint8,
"fp8": torch.uint8,
# "fp8_e4m3": torch.uint8,
# "fp8_e5m2": torch.uint8,
"int8": torch.int8,
......
Markdown is supported
0% or .
You are about to add 0 people to the discussion. Proceed with caution.
Finish editing this message first!
Please register or to comment