@@ -51,7 +51,7 @@ Test Data: The number of input tokens is 1, and the number of generated tokens i
The throughput of TurboMind exceeds 2000 tokens/s, which is about 5% - 15% higher than DeepSpeed overall and outperforms huggingface transformers by up to 2.3x