"vscode:/vscode.git/clone" did not exist on "0722acf1b17d2152a50c49dfffcc495f64253fa7"
- 01 Aug, 2024 1 commit
-
-
Daniël de Kok authored
* Fix cache block size for flash decoding This seems to have been accidentally dropped during the TRT-LLM PR rebase. * Also run CI on changes to `backends`
-
- 05 Jul, 2024 1 commit
-
-
Daniël de Kok authored
* Add more representative Llama GPTQ test The Llama GPTQ test is updated to use a model with the commonly-used quantizer config format and activation sorting. The old test is kept around (but renamed) since it tests the format produced by `text-generation-server quantize`. * Add support for manually triggering a release build
-
- 24 Jun, 2024 1 commit
-
-
Nicolas Patry authored
* New runner. Manual squash. * Network host. * Put back trufflehog with proper extension. * No network host ? * Moving buildx install after tailscale ? * 1.79
-