1. 07 Mar, 2024 2 commits
  2. 06 Mar, 2024 2 commits
  3. 05 Mar, 2024 1 commit
  4. 02 Mar, 2024 1 commit
  5. 01 Mar, 2024 1 commit
  6. 29 Feb, 2024 1 commit
  7. 27 Feb, 2024 1 commit
    • Illia Silin's avatar
      Clip fp8 to +/-240 on all targets. (#1172) · d0c7b451
      Illia Silin authored
      * clip fp8 to +/-240 on all targets
      
      * if inputs to fp8 conversion are +/-inf, they remain unaltered
      
      * increase tolerance for test_elementwise_layernorm to prevent false errors
      
      * change the input values for gemm examples to floats
      
      * reduce gemm example float input values to prevent errors
      
      * increase the tolerance for gemm examples
      d0c7b451
  8. 26 Feb, 2024 1 commit
  9. 23 Feb, 2024 1 commit
  10. 21 Feb, 2024 1 commit
  11. 20 Feb, 2024 1 commit
  12. 16 Feb, 2024 1 commit
  13. 15 Feb, 2024 1 commit
  14. 13 Feb, 2024 3 commits
  15. 12 Feb, 2024 1 commit
  16. 09 Feb, 2024 2 commits
  17. 08 Feb, 2024 1 commit
  18. 07 Feb, 2024 4 commits
  19. 06 Feb, 2024 1 commit
  20. 05 Feb, 2024 1 commit
  21. 02 Feb, 2024 2 commits
  22. 31 Jan, 2024 2 commits
  23. 30 Jan, 2024 2 commits
  24. 29 Jan, 2024 1 commit
  25. 26 Jan, 2024 1 commit
  26. 25 Jan, 2024 1 commit
    • rocking's avatar
      layernorm & groupnorm bwd gamma beta (#1133) · 28f68a5a
      rocking authored
      * Add layernorm bwd gamma beta external api
      
      * Add groupnorm external api
      
      * Add layernorm bwd gamma beta profiler
      
      * Add groupnorm bwd gamma beta ckProfiler
      
      * Add layernorm & groupnorm bwd gamma beta test
      
      * Fix groupnorm bwd gamma beta profiler bug
      
      * Layernorm bwd weight client example
      
      * Groupnorm bwd weight client example
      
      * clang format
      
      * Remove useless header
      
      * Let inv_std be positive
      
      * Rename to num_bytes and move this calculation outside the loop
      28f68a5a
  27. 24 Jan, 2024 2 commits
    • Illia Silin's avatar
      Fixing most of the cppcheck errors. (#1142) · 180e5720
      Illia Silin authored
      * fix cppcheck errors, first pass
      
      * fix format
      
      * fix returned value in examples
      
      * add macro definitions for cppcheck
      
      * fix the profile_gemm logic
      
      * update the gemm profiler logic
      
      * add more difinitions to cppcheck, fix couple more errors
      
      * replace runtime error with message in device function
      
      * fix a couple of int4 issues
      
      * no return for fill function
      
      * fix errors in data_types.hpp
      
      * fix format
      
      * fix few remaining errors
      
      * fix errors in data_types.hpp
      
      * fix last couple of errors in datat_types.hpp
      180e5720
    • Bartłomiej Kocot's avatar
      Fix possible linting errors in changelog (#1141) · 6169fbbd
      Bartłomiej Kocot authored
      * Fix possible linting errors in changelog
      
      * Update CHANGELOG.md
      
      * Update CHANGELOG.md
      
      * Update CHANGELOG.md
      6169fbbd
  28. 22 Jan, 2024 1 commit