"...text-generation-inference.git" did not exist on "f91e9d282d73e09cdb876924412f2ed66212d736"
Unverified Commit bdefabd1 authored by Sayak Paul's avatar Sayak Paul Committed by GitHub
Browse files

[Docs] update the PT 2.0 optimization doc with latest findings (#3370)



* add: benchmarking stats for A100 and V100.

* Apply suggestions from code review
Co-authored-by: default avatarPatrick von Platen <patrick.v.platen@gmail.com>

* address patrick's comments.

* add: rtx 4090 stats

* 

 benchmark reports done

* Apply suggestions from code review
Co-authored-by: default avatarPedro Cuenca <pedro@huggingface.co>

* 3313 pr link.

* add: plots.
Co-authored-by: default avatarPedro <pedro@huggingface.co>

* fix formattimg

* update number percent.

---------
Co-authored-by: default avatarPatrick von Platen <patrick.v.platen@gmail.com>
Co-authored-by: default avatarPedro Cuenca <pedro@huggingface.co>
parent 909742db
This diff is collapsed.
Markdown is supported
0% or .
You are about to add 0 people to the discussion. Proceed with caution.
Finish editing this message first!
Please register or to comment