• arai713's avatar
    Gridwise elementwise 2d (#466) · 0e5c264c
    arai713 authored
    
    
    * added 2d gridwise elementwise
    
    * added 2d version of device elementwise
    
    * added example file with updated device elementwise call
    
    * added Cmake file
    
    * changed NumDim into 2D
    
    * fixed compiler issues
    
    * fixed indexing for loop step
    
    * fixed NumDim dimension error
    
    * changed blockID to 2D
    
    * updated Grid Desc
    
    * updated kernel call
    
    * fixed 2d thread indexing
    
    * added dimensions for example file
    
    * commented out unused code
    
    * changed vector load
    
    * removed extra code
    
    * temporarily removing vector load on 2nd dim
    
    * changed vector load back, still causing errors
    
    * altered indexing
    
    * changed isSupportedArgument for 2D
    
    * changed indexing + do/while
    
    * fixed isSupportedArgument
    
    * changed dimension for debugging
    
    * fixed
    
    * added testing printouts
    
    * testing change
    
    * added variables to distribute threads through both dimensions
    
    * testing changes
    
    * integrated variable for thread distribution into device elementwise and added as parameter for gridwise elementwise
    
    * removed most of the extraneous code, testing with different dimensions
    
    * testing
    
    * removed debugging print statements
    
    * moved 2d elementwise permute into elementwise permute directory
    
    * fixed formatting
    
    * removed debugging comments from threadwise transfer
    Co-authored-by: default avatarJing Zhang <jizhan@amd.com>
    Co-authored-by: default avatarPo Yen Chen <PoYen.Chen@amd.com>
    0e5c264c
CMakeLists.txt 190 Bytes