"src/turbomind/kernels/unfused_attention_kernels.cu" did not exist on "9efcac38af58b7247e205c47efe090b4c6ec7574"
- 07 Sep, 2023 1 commit
-
-
Merve Noyan authored
IDK what else to add in this guide, I looked for relevant code in TGI codebase and saw that it's used in quantization as well (maybe I could add that?)
-