-
Penut Chen authored
* Fix the incorrect permutation of gguf * rename num_kv_heads Co-authored-by:
Marc Sun <57196510+SunMarc@users.noreply.github.com> * add typing to num_kv_heads Co-authored-by:
Marc Sun <57196510+SunMarc@users.noreply.github.com> * rename variables * refactor permute function name * update the expected text of the llama3 q4 test --------- Co-authored-by:
Marc Sun <57196510+SunMarc@users.noreply.github.com>
ac946aac