"git@developer.sourcefind.cn:OpenDAS/torch-harmonics.git" did not exist on "ba7a499698a8d7266cc3dbf9711ad5732b0f3ec6"
feat(src): add kv cache int8 quantization (#22)
* feat(src): add int8 and compile passed * feat(kernels): fix * feat(llama): update kernel * feat(src): add debug * fix(kernel): k_cache use int8_t pointer * style(llama): clean code * feat(deploy.py): revert to enable fmha * style(LlamaV2): clean code * feat(deploy.py): add default quant policy
Showing
Please register or sign in to comment