1. 31 May, 2023 2 commits
  2. 29 May, 2023 6 commits
  3. 24 May, 2023 8 commits
    • Illia Silin's avatar
      Clean-up the headers (#713) · ac9e01e2
      Illia Silin authored
      
      
      * fix headers for gpu instances
      
      * remove unused headers
      
      ---------
      Co-authored-by: default avatarzjing14 <zhangjing14@gmail.com>
      ac9e01e2
    • Po-Yen, Chen's avatar
      Remove no-longer used data members · 8c1450f6
      Po-Yen, Chen authored
      8c1450f6
    • Po-Yen, Chen's avatar
      Fix wrong debug message · ae360af6
      Po-Yen, Chen authored
      ae360af6
    • Po-Yen, Chen's avatar
      Create descriptor on device side · 690b0ec9
      Po-Yen, Chen authored
      690b0ec9
    • Po-Yen, Chen's avatar
      Fix wrong error message (for type name) · d73041a6
      Po-Yen, Chen authored
      d73041a6
    • rocking's avatar
      Pool3d fwd (#697) · 76ec0089
      rocking authored
      * Expand the base class of pool2d, prepare to share base class with pool3d
      
      * Add pool3d device op
      
      * Add pool3d f16 example
      
      * Refactor the base class. implement generic pooling in the future
      
      * clang format
      
      * get original index in max pooling
      
      * Add outputindex to base class
      
      * Fix dimension
      
      * Add pooling instance
      
      * Use indexType instead
      
      * Remove useless header
      
      * Extract IndexDataType to template
      
      * Extract pooling reference code
      
      * clang format
      
      * clang format
      
      * Fix typo
      
      * Add tensor stride
      
      * Add missing header
      
      * Add index stride and output stride
      
      * Refine naming
      
      * Add type to base class
      
      * Rename file
      
      * Use proper size
      
      * Fix typo
      
      * Refine naming
      
      * Modify the argument into vector.
      
      * Add max pool profiler
      
      * Refine naming
      
      * Support f32 pool
      
      * Fix typo
      
      * Add avg pool2d fwd in profiler
      
      * clang format
      
      * Rename AccDatatype to ComputeDatatype
      
      * Fix init
      
      * test pool
      
      * Extract variable
      
      * Add client example
      
      * Check the pooling dim
      
      * clang format
      
      * Connect argv and arg_parser
      
      * Add found check
      
      * Remove useless header
      
      * Refine naming
      
      * Adjust the order of device_pool_fwd
      76ec0089
    • Po-Yen, Chen's avatar
      Remove unnecessary parameters · c69f237d
      Po-Yen, Chen authored
      c69f237d
    • Po-Yen, Chen's avatar
      77d0cf78
  4. 23 May, 2023 1 commit
    • Illia Silin's avatar
      Enable gemm_dl and other kernels on Navi3x. (#714) · d821d1e5
      Illia Silin authored
      * enable dl kernels on navi3
      
      * do not build xdl tests and examples on Navi
      
      * run tests before building everything on jenkins
      
      * disable gemm_bilinear on gfx1030
      
      * add gpu targets to installer on Navi
      
      * put tests in the same order as before
      
      * reduce the number of navi targets in CI
      
      * build CI installed for gfx940 as well
      
      * only build for MI300 during QA runs
      d821d1e5
  5. 18 May, 2023 1 commit
  6. 17 May, 2023 2 commits
  7. 16 May, 2023 6 commits
  8. 11 May, 2023 1 commit
  9. 10 May, 2023 1 commit
  10. 09 May, 2023 4 commits
  11. 06 May, 2023 8 commits