[Docs] add note on fp16 in fast diffusion (#6380)

add note on fp16

[Docs] add note on fp16 in fast diffusion (#6380)
add note on fp16
203724e9 · Sayak Paul · GitHub · e7044a42 · 203724e9
Unverified Commit 203724e9 authored Dec 29, 2023 by Sayak Paul Committed by GitHub Dec 29, 2023
Show whitespace changes
Inline Side-by-side

Showing with 2 additions and 0 deletions

docs/source/en/tutorials/fast_diffusion.md docs/source/en/tutorials/fast_diffusion.md +2 -0

No files found.
--- a/docs/source/en/tutorials/fast_diffusion.md
+++ b/docs/source/en/tutorials/fast_diffusion.md
@@ -96,6 +96,8 @@ bfloat16 reduces the latency from 7.36 seconds to 4.63 seconds:
 </div>
+_(We later ran the experiments in float16 and found out that the recent versions of torchao do not incur numerical problems from float16.)_
 **Why bfloat16?** 
 * Using a reduced numerical precision (such as float16, bfloat16) to run inference doesn’t affect the generation quality but significantly improves latency.