• Cuiqing Li's avatar
    [Kernels] added triton-implemented of self attention for colossal-ai (#4241) · 4b977541
    Cuiqing Li authored
    * added softmax kernel
    
    * added qkv_kernel
    
    * added ops
    
    * adding tests
    
    * upload tets
    
    * fix tests
    
    * debugging
    
    * debugging tests
    
    * debugging
    
    * added
    
    * fixed errors
    
    * added softmax kernel
    
    * clean codes
    
    * added tests
    
    * update tests
    
    * update tests
    
    * added attention
    
    * add
    
    * fixed pytest checking
    
    * add cuda check
    
    * fix cuda version
    
    * fix typo
    4b977541
test_softmax.py 1.02 KB