• Haocong WANG's avatar
    AIT Attention API refactor (#8) · efee4541
    Haocong WANG authored
    * sanity pass
    
    * sanity pass 2
    
    * confirm significant performance regression.
    
    * turn on all instances
    
    * turn off instance format
    
    * Fix bug & tunning & format
    
    * DML meta, self_attn+cross_attn
    
    * sanity pass
    
    * remove useless flag
    
    * update tile and problem size used in AIT attention
    
    * bug fix in grouped conv supporting check
    efee4541
unet_mha.sh 2.85 KB