* Fix llama MQA * Fix permute shape * Update llama.py
Attach a file by drag & drop or click to upload