Unverified Commit 09f45747 authored by binmakeswell's avatar binmakeswell Committed by GitHub
Browse files

[doc] update OPT serving (#2804)

* [doc] update OPT serving

* [doc] update OPT serving
parent 56ddc9ca
...@@ -195,10 +195,6 @@ Colossal-AI 为您提供了一系列并行组件。我们的目标是让您的 ...@@ -195,10 +195,6 @@ Colossal-AI 为您提供了一系列并行组件。我们的目标是让您的
- [Energon-AI](https://github.com/hpcaitech/EnergonAI) :用相同的硬件推理加速50% - [Energon-AI](https://github.com/hpcaitech/EnergonAI) :用相同的硬件推理加速50%
<p id="OPT-Serving" align="center">
<img src="https://raw.githubusercontent.com/hpcaitech/public_assets/main/colossalai/img/OPT_serving.png" width=800/>
</p>
- [OPT推理服务](https://colossalai.org/docs/advanced_tutorials/opt_service): 体验1750亿参数OPT在线推理服务 - [OPT推理服务](https://colossalai.org/docs/advanced_tutorials/opt_service): 体验1750亿参数OPT在线推理服务
<p id="BLOOM-Inference" align="center"> <p id="BLOOM-Inference" align="center">
......
...@@ -197,10 +197,6 @@ Please visit our [documentation](https://www.colossalai.org/) and [examples](htt ...@@ -197,10 +197,6 @@ Please visit our [documentation](https://www.colossalai.org/) and [examples](htt
- [Energon-AI](https://github.com/hpcaitech/EnergonAI): 50% inference acceleration on the same hardware - [Energon-AI](https://github.com/hpcaitech/EnergonAI): 50% inference acceleration on the same hardware
<p id="OPT-Serving" align="center">
<img src="https://raw.githubusercontent.com/hpcaitech/public_assets/main/colossalai/img/OPT_serving.png" width=800/>
</p>
- [OPT Serving](https://colossalai.org/docs/advanced_tutorials/opt_service): Try 175-billion-parameter OPT online services - [OPT Serving](https://colossalai.org/docs/advanced_tutorials/opt_service): Try 175-billion-parameter OPT online services
<p id="BLOOM-Inference" align="center"> <p id="BLOOM-Inference" align="center">
......
Markdown is supported
0% or .
You are about to add 0 people to the discussion. Proceed with caution.
Finish editing this message first!
Please register or to comment