• zhuwenwen's avatar
    support prefix cache on kme · 6f1db287
    zhuwenwen authored
    fix the error in test_moe caused by moe align not supporting 511
    multi-modal switching to torch implementation on z100l&k100
    6f1db287
vision.py 5.65 KB