"configs/datasets/SuperGLUE_WiC/SuperGLUE_WiC_gen_d06864.py" did not exist on "36f111100f1bce17aecfa37af0cf748d5f93a702"
  • Li Zhang's avatar
    [Feature] Blazing fast W4A16 inference (#202) · c3290cad
    Li Zhang authored
    * add w4a16
    
    * fix `deploy.py`
    
    * add doc
    
    * add w4a16 kernels
    
    * fuse w1/w3 & bugfixes
    
    * fix typo
    
    * python
    
    * guard sm75/80 features
    
    * add missing header
    
    * refactor
    
    * qkvo bias
    
    * update cost model
    
    * fix lint
    
    * update `deploy.py`
    c3290cad
LlamaTritonModel.cc 17.4 KB