• Bartłomiej Kocot's avatar
    Add Grouped Conv Fwd Large Tensor kernel (#1432) · 4ec5c52a
    Bartłomiej Kocot authored
    * Support 64 bit indexing
    
    * Add new grouped conv fwd kernel for large tensors
    
    * Add instances large tensor
    
    * Fixes for transform conv to gemm
    
    * Fixes
    
    * fixes
    
    * Remove not needed instances
    
    * examples fixes
    
    * Remove not need ds arrays
    
    * Fix tests
    
    * Add 2GB check in gridwise dl
    
    * Fixes
    4ec5c52a
convolution_parameter.cpp 9.3 KB