"llm/llama.cpp/ggml/src/ggml-cuda/scale.cuh" did not exist on "ecd2f176277db4f074e25a2c3646b04b51cec119"
Ck tile/layernorm: implement naive reduce, opt performance (#1784)
* add no welford * enable output raw * raw of int8 * fix build * fix smoke test err * [ck_tile]layernorm: fix welford ok, set int8 and bf16 small N as default and others open by generate * [cktile]layernorm, fix err commit files and remove uselss * fix quant 8192 err & change norm_reduce class and file name --------- Co-authored-by:coderfeli <coderfeli@163.com> Co-authored-by:
carlushuang <carlus.huang@amd.com>
Showing
Please register or sign in to comment