1. 13 Jun, 2023 1 commit
    • Haocong WANG's avatar
      AIT Attention API refactor (#8) · efee4541
      Haocong WANG authored
      * sanity pass
      
      * sanity pass 2
      
      * confirm significant performance regression.
      
      * turn on all instances
      
      * turn off instance format
      
      * Fix bug & tunning & format
      
      * DML meta, self_attn+cross_attn
      
      * sanity pass
      
      * remove useless flag
      
      * update tile and problem size used in AIT attention
      
      * bug fix in grouped conv supporting check
      efee4541