"vllm/tool_parsers/hermes_tool_parser.py" did not exist on "f256ebe4df6757d76f1f1642d7e110268a2f8190"
-
Wentao Ye authored
[Perf] Optimize cutlass moe problem size calculation, 5.3% E2E Throughput improvement, 2.2% TTFT improvement (#31830) Signed-off-by:
yewentao256 <zhyanwentao@126.com> Signed-off-by:
Wentao Ye <44945378+yewentao256@users.noreply.github.com> Co-authored-by:
Luka Govedič <ProExpertProg@users.noreply.github.com>
308feab3