1. 23 Jul, 2024 2 commits
  2. 22 Jul, 2024 1 commit
    • Phil Wang's avatar
      backwards for softcapping (#1033) · 5f1ae4a3
      Phil Wang authored
      * check in the two ways of approaching backwards for softcapping, both functional
      
      * prepare the softcap switch for backwards
      
      * temporary
      
      * cleanup to the way Tri prefers
      
      * calculate dtanh when copying from scores -> dtanh Tensor
      
      * no ternary operators allowed for constexpr, so just use some hack found online
      
      * fix maybe_dtanh, restore some files
      
      * restore another file
      
      * move calculate_dtanh to utils and colocate with apply_softcap
      
      * cleanup
      
      * maybe last cleanup
      
      * save for another pr
      
      * remove a stray line
      
      * fix spacing
      
      * fix an issue, and make test_flash_attn.py ready to test softcapping backwards
      5f1ae4a3
  3. 11 Jul, 2024 1 commit
  4. 10 Jul, 2024 4 commits
  5. 08 Jul, 2024 1 commit
  6. 03 Jul, 2024 1 commit
  7. 01 Jul, 2024 1 commit
  8. 27 Jun, 2024 1 commit
  9. 15 Mar, 2024 1 commit
  10. 21 Feb, 2024 1 commit
  11. 23 Jan, 2024 1 commit
  12. 14 Jan, 2024 1 commit
  13. 13 Jan, 2024 1 commit
  14. 24 Dec, 2023 1 commit
  15. 22 Dec, 2023 1 commit
  16. 03 Oct, 2023 1 commit
  17. 26 Sep, 2023 1 commit
  18. 24 Sep, 2023 1 commit
  19. 21 Sep, 2023 1 commit
  20. 18 Sep, 2023 1 commit
  21. 16 Sep, 2023 1 commit
  22. 13 Sep, 2023 1 commit
  23. 04 Sep, 2023 2 commits
  24. 29 Aug, 2023 1 commit
  25. 25 Aug, 2023 1 commit
  26. 19 Aug, 2023 1 commit
  27. 16 Aug, 2023 1 commit
  28. 13 Aug, 2023 1 commit
  29. 01 Aug, 2023 2 commits
  30. 17 Jul, 2023 1 commit
  31. 13 Nov, 2022 2 commits
  32. 09 Nov, 2022 1 commit
  33. 06 Nov, 2022 1 commit