- 22 May, 2025 1 commit
-
-
Michael Yang authored
* fix mllama convert - transform attn_gate and ffn_gate - swap attention heads for vision models * fix mllama the mlp gate which was applied in the wrong place
-
- 14 May, 2025 1 commit
-
-
Michael Yang authored
-