Unverified Commit 61916fef authored by regisss's avatar regisss Committed by GitHub
Browse files

Update Habana Gaudi doc (#3863)

* Update Habana Gaudi doc

* Fix typo
parent fc6acb6b
...@@ -16,8 +16,8 @@ specific language governing permissions and limitations under the License. ...@@ -16,8 +16,8 @@ specific language governing permissions and limitations under the License.
## Requirements ## Requirements
- Optimum Habana 1.5 or later, [here](https://huggingface.co/docs/optimum/habana/installation) is how to install it. - Optimum Habana 1.6 or later, [here](https://huggingface.co/docs/optimum/habana/installation) is how to install it.
- SynapseAI 1.9. - SynapseAI 1.10.
## Inference Pipeline ## Inference Pipeline
...@@ -41,7 +41,7 @@ pipeline = GaudiStableDiffusionPipeline.from_pretrained( ...@@ -41,7 +41,7 @@ pipeline = GaudiStableDiffusionPipeline.from_pretrained(
scheduler=scheduler, scheduler=scheduler,
use_habana=True, use_habana=True,
use_hpu_graphs=True, use_hpu_graphs=True,
gaudi_config="Habana/stable-diffusion", gaudi_config="Habana/stable-diffusion-2",
) )
``` ```
...@@ -62,18 +62,18 @@ For more information, check out Optimum Habana's [documentation](https://hugging ...@@ -62,18 +62,18 @@ For more information, check out Optimum Habana's [documentation](https://hugging
## Benchmark ## Benchmark
Here are the latencies for Habana first-generation Gaudi and Gaudi2 with the [Habana/stable-diffusion](https://huggingface.co/Habana/stable-diffusion) Gaudi configuration (mixed precision bf16/fp32): Here are the latencies for Habana first-generation Gaudi and Gaudi2 with the [Habana/stable-diffusion](https://huggingface.co/Habana/stable-diffusion) and [Habana/stable-diffusion-2](https://huggingface.co/Habana/stable-diffusion-2) Gaudi configurations (mixed precision bf16/fp32):
- [Stable Diffusion v1.5](https://huggingface.co/runwayml/stable-diffusion-v1-5) (512x512 resolution): - [Stable Diffusion v1.5](https://huggingface.co/runwayml/stable-diffusion-v1-5) (512x512 resolution):
| | Latency (batch size = 1) | Throughput (batch size = 8) | | | Latency (batch size = 1) | Throughput (batch size = 8) |
| ---------------------- |:------------------------:|:---------------------------:| | ---------------------- |:------------------------:|:---------------------------:|
| first-generation Gaudi | 4.22s | 0.29 images/s | | first-generation Gaudi | 3.80s | 0.308 images/s |
| Gaudi2 | 1.70s | 0.925 images/s | | Gaudi2 | 1.33s | 1.081 images/s |
- [Stable Diffusion v2.1](https://huggingface.co/stabilityai/stable-diffusion-2-1) (768x768 resolution): - [Stable Diffusion v2.1](https://huggingface.co/stabilityai/stable-diffusion-2-1) (768x768 resolution):
| | Latency (batch size = 1) | Throughput | | | Latency (batch size = 1) | Throughput |
| ---------------------- |:------------------------:|:-------------------------------:| | ---------------------- |:------------------------:|:-------------------------------:|
| first-generation Gaudi | 23.3s | 0.045 images/s (batch size = 2) | | first-generation Gaudi | 10.2s | 0.108 images/s (batch size = 4) |
| Gaudi2 | 7.75s | 0.14 images/s (batch size = 5) | | Gaudi2 | 3.17s | 0.379 images/s (batch size = 8) |
Markdown is supported
0% or .
You are about to add 0 people to the discussion. Proceed with caution.
Finish editing this message first!
Please register or to comment