"examples/vscode:/vscode.git/clone" did not exist on "6aafbb6d5ba35aafed6796306f5f90285e258248"
  • Bartłomiej Kocot's avatar
    Add Grouped Conv Fwd Large Tensor kernel (#1432) · 4ec5c52a
    Bartłomiej Kocot authored
    * Support 64 bit indexing
    
    * Add new grouped conv fwd kernel for large tensors
    
    * Add instances large tensor
    
    * Fixes for transform conv to gemm
    
    * Fixes
    
    * fixes
    
    * Remove not needed instances
    
    * examples fixes
    
    * Remove not need ds arrays
    
    * Fix tests
    
    * Add 2GB check in gridwise dl
    
    * Fixes
    4ec5c52a
conv_util.cpp 6.55 KB