• Paul Fultz II's avatar
    Improve contiguous and concat performance (#368) · 9b55685c
    Paul Fultz II authored
    * Add env to trace nary device functions
    
    * Formatting
    
    * Improve contiguous and concat performance
    
    * Formatting
    
    * Remove unused variable
    
    * Formatting
    
    * Fix gpu tests
    
    * Formatting
    
    * Add more test for transposed concat
    
    * Formatting
    
    * Compute offset and not index
    
    * Compute multi-index once
    
    * Formatting
    
    * Fix transposed inputs
    
    * Formatting
    
    * Use product order for comparisons of hip_array
    
    * Formatting
    
    * Add missing s parameter
    
    * Formatting
    
    * Dont invert permutation
    
    * Fix tidy warnings
    
    * Formatting
    
    * Remove incorrect license
    
    * Use a single integer for stride
    
    * Formatting
    
    * Fix tidy issue
    9b55685c
simplify_reshapes.cpp 5.86 KB