"...text-generation-inference.git" did not exist on "f91e9d282d73e09cdb876924412f2ed66212d736"
[Docs] update the PT 2.0 optimization doc with latest findings (#3370)
* add: benchmarking stats for A100 and V100. * Apply suggestions from code review Co-authored-by:Patrick von Platen <patrick.v.platen@gmail.com> * address patrick's comments. * add: rtx 4090 stats *
⚔ benchmark reports done * Apply suggestions from code review Co-authored-by:Pedro Cuenca <pedro@huggingface.co> * 3313 pr link. * add: plots. Co-authored-by:
Pedro <pedro@huggingface.co> * fix formattimg * update number percent. --------- Co-authored-by:
Patrick von Platen <patrick.v.platen@gmail.com> Co-authored-by:
Pedro Cuenca <pedro@huggingface.co>
Showing
This diff is collapsed.
Please register or sign in to comment