1. 21 Nov, 2024 2 commits
  2. 15 Nov, 2024 1 commit
  3. 08 Nov, 2024 1 commit
  4. 07 Nov, 2024 1 commit
  5. 06 Nov, 2024 3 commits
  6. 05 Nov, 2024 1 commit
  7. 04 Nov, 2024 1 commit
  8. 30 Oct, 2024 3 commits
  9. 29 Oct, 2024 2 commits
  10. 26 Oct, 2024 2 commits
  11. 25 Oct, 2024 1 commit
    • aledudek's avatar
      Generic threshold calculation (#1546) · 9385caa3
      aledudek authored
      * Calculate generic relative threshold pool3dfwd
      
      * Calculate absolute error threshold pool3d fwd
      
      * Generic threshold calculation take max input for relative error pool3dfwd
      
      * Remove max possible value for error calculation at runtime
      
      * Remove debug print in pool3dfwd
      
      * Pool3d fwd adjusted types in generic threshold calculation
      
      * Generic threshold calculation take into account number of accumulations and accdatatype
      
      * Generic threshold fix final error formula
      
      * Generic threshold calculation - num of accs fix
      
      * Generic threshold calculation - adjust absolute error
      
      * Generic threshold calculation - OutDataType in absolute error
      9385caa3
  12. 22 Oct, 2024 1 commit
  13. 18 Oct, 2024 1 commit
  14. 14 Oct, 2024 3 commits
  15. 12 Oct, 2024 1 commit
  16. 09 Oct, 2024 1 commit
  17. 07 Oct, 2024 1 commit
  18. 04 Oct, 2024 1 commit
  19. 02 Oct, 2024 1 commit
  20. 25 Sep, 2024 1 commit
  21. 20 Sep, 2024 2 commits
  22. 13 Sep, 2024 1 commit
    • Jun Liu's avatar
      Customize filesystem in CK for legacy systems (#1509) · 81bc1496
      Jun Liu authored
      
      
      * Legacy support: customized filesystem
      
      * Update cmakefile for python alternative path
      
      * fix build issues
      
      * CK has no boost dependency
      
      * More fixes to issues found on legay systems
      
      * fix clang format issue
      
      * Check if blob is correctly generated in cmake
      
      * fix the python issues
      
      * add a compiler flag for codegen when using alternative python
      
      * use target_link_options instead of target_compile_options
      
      ---------
      Co-authored-by: default avatarillsilin <Illia.Silin@amd.com>
      81bc1496
  23. 12 Sep, 2024 1 commit
  24. 11 Sep, 2024 2 commits
  25. 05 Sep, 2024 2 commits
  26. 03 Sep, 2024 1 commit
  27. 02 Sep, 2024 1 commit
  28. 21 Aug, 2024 1 commit
    • Andriy Roshchenko's avatar
      Adding Instances and Examples for FP8-based Scaled Convolution and AMAX Reduction. (#1473) · c3515f27
      Andriy Roshchenko authored
      * Enable CMakePresets build
      
      * Verify Convolution, Scaling and ReLU algorithms.
      
      * Add tensor element-wise scale and type cast operation.
      
      * Reduction implemented but does not work.
      
      * Exploration of Reduction functionality.
      
      * Completed example for Convolution scaled with ReLu activation and AMAX reduction.
      
      * WIP: Add required instances for convolution.
      
      * WIP: Create client example. Implement convolution stage.
      
      * Add elementwise instances.
      
      * Add elementwise scale + convert example.
      
      * Add reduction instances.
      
      * WIP: Client example for AMAX reduction.
      
      * WIP: Add instances for multistage reduction.
      
      * WIP: Implementation of multistage reduction.
      
      * Refactoring.
      
      * Clean up.
      
      * Add CMakePresets.json
      
      * Guard off FP8 instances when the data type is not available.
      
      * Add example for Scaled FP8 Convolution with AMAX reduction.
      
      * Refactor CombConvScaleRelu instances.
      
      * Add CombConvScale instances.
      
      * Add client example for Scaled FP8 Convolution with AMAX reduction.
      
      * Cleanup.
      c3515f27