bench_per_token_quant_fp8.py 8 KB