"vscode:/vscode.git/clone" did not exist on "f971e25a4dff33d5d219dccf523e32c62360ffd2"
  1. 06 Nov, 2022 1 commit
    • Xin Yao's avatar
      [Feature] Add bfloat16 (bf16) support (#4648) · 96297fb8
      Xin Yao authored
      * add bf16 specializations
      
      * remove SWITCH_BITS
      
      * enable amp for bf16
      
      * remove SWITCH_BITS for cpu kernels
      
      * enbale bf16 based on CUDART
      
      * fix compiling for sm<80
      
      * fix cpu build
      
      * enable unit tests
      
      * update doc
      
      * disable test for CUDA < 11.0
      
      * address comments
      
      * address comments
      96297fb8
  2. 21 Sep, 2022 1 commit
  3. 28 Jan, 2021 1 commit
  4. 22 Nov, 2020 1 commit
  5. 14 Aug, 2020 1 commit
  6. 13 Jul, 2020 1 commit
    • Zihao Ye's avatar
      [Feature] Autograd of gspmm and gsddmm on PyTorch/MXNet/Tensorflow (#1680) · f25b1a06
      Zihao Ye authored
      * init
      
      * reverse(by minjie
      
      * upd
      
      * upd
      
      * upd
      
      * upd
      
      * upd
      
      * upd
      
      * upd
      
      * gpu
      
      * upd
      
      * upd
      
      * upd
      
      * upd
      
      * udp
      
      * upd
      
      * upd
      
      * imidiot
      
      * fix
      
      * upd
      
      * upd
      
      * upd
      
      * udp
      
      * upd
      
      * upd
      
      * fix
      
      * udp
      
      * upd
      
      * upd
      
      * upd
      
      * upd
      
      * upd
      
      * fix
      
      * remove redundency
      
      * upd
      
      * upd
      
      * upd
      
      * cache
      
      * upd
      
      * upd
      
      * upd
      
      * upd
      
      * upd
      
      * udp
      
      * upd
      
      * trigger
      
      * upd
      
      * fix
      
      * upd
      
      * unused import
      
      * upd
      
      * upd
      f25b1a06
  7. 28 Jun, 2020 1 commit
  8. 22 Jun, 2020 1 commit
    • Zihao Ye's avatar
      [kernel] New SpMM & SDDMM kernel on CPU and CUDA (#1644) · 071cba1f
      Zihao Ye authored
      * udp
      
      * simplify
      
      * sddmm dot cpu
      
      * upd
      
      * format
      
      * upd
      
      * compatible with MJ's PR
      
      * lint
      
      * upd
      
      * upd
      
      * upd
      
      * python end
      
      * upd
      
      * upd
      
      * lint
      
      * lint
      
      * upd
      
      * upd
      
      * upd
      
      * upd
      
      * upd
      
      * lint
      
      * fix mxnet
      
      * upd
      
      * lint
      
      * use minjie's ptr
      
      * macro
      
      * upd
      
      * reorg
      
      * lint
      
      * fix corner cases
      
      * upd
      
      * enrich cpu docs
      
      * upd
      
      * upd
      
      * lint
      
      * lint
      
      * pylint
      
      * sx review
      
      * improve docstring
      
      * python doc
      
      * upd
      
      * restructure
      
      * lint
      
      * upd test
      
      * upd
      
      * pylint
      
      * fix corner cases and test
      071cba1f