Unverified Commit 26294b2f authored by Lianmin Zheng's avatar Lianmin Zheng Committed by GitHub
Browse files

Update README.md

parent 75b31a2a
...@@ -396,7 +396,8 @@ Instructions for supporting a new model are [here](https://github.com/sgl-projec ...@@ -396,7 +396,8 @@ Instructions for supporting a new model are [here](https://github.com/sgl-projec
- Mixtral-8x7B on NVIDIA A10G, FP16, Tensor Parallelism=8 - Mixtral-8x7B on NVIDIA A10G, FP16, Tensor Parallelism=8
![mixtral_8x7b](assets/mixtral_8x7b.jpg) ![mixtral_8x7b](assets/mixtral_8x7b.jpg)
Learn more [here](docs/benchmark_results.md). - Learn more about the above [results](docs/benchmark_results.md).
- Synthetic latency and throughput benchmark [scripts](https://github.com/sgl-project/sglang/tree/main/benchmark/latency_throughput).
## Roadmap ## Roadmap
https://github.com/sgl-project/sglang/issues/157 https://github.com/sgl-project/sglang/issues/157
......
Markdown is supported
0% or .
You are about to add 0 people to the discussion. Proceed with caution.
Finish editing this message first!
Please register or to comment