"profiler/vscode:/vscode.git/clone" did not exist on "57e82227d820888e3038f195d4f8d496ec9de226"
  • Bartłomiej Kocot's avatar
    Add Grouped Conv Fwd Large Tensor kernel (#1432) · 4ec5c52a
    Bartłomiej Kocot authored
    * Support 64 bit indexing
    
    * Add new grouped conv fwd kernel for large tensors
    
    * Add instances large tensor
    
    * Fixes for transform conv to gemm
    
    * Fixes
    
    * fixes
    
    * Remove not needed instances
    
    * examples fixes
    
    * Remove not need ds arrays
    
    * Fix tests
    
    * Add 2GB check in gridwise dl
    
    * Fixes
    4ec5c52a
conv_util.cpp 6.55 KB