• Michael Yang's avatar
    fix: mllama quality (#10807) · adff143b
    Michael Yang authored
    * fix mllama convert
    
    - transform attn_gate and ffn_gate
    - swap attention heads for vision models
    
    * fix mllama
    
    the mlp gate which was applied in the wrong place
    adff143b
convert_mllama.go 5.24 KB