1. 23 Jul, 2024 2 commits
  2. 22 Jul, 2024 1 commit
  3. 11 Jul, 2024 1 commit
  4. 10 Jul, 2024 3 commits
  5. 08 Jul, 2024 1 commit
  6. 01 Jul, 2024 1 commit
  7. 27 Jun, 2024 1 commit
  8. 08 Apr, 2024 1 commit
  9. 15 Mar, 2024 2 commits
  10. 21 Feb, 2024 1 commit
  11. 10 Feb, 2024 2 commits
  12. 08 Feb, 2024 1 commit
  13. 30 Jan, 2024 1 commit
  14. 23 Jan, 2024 1 commit
  15. 21 Jan, 2024 2 commits
  16. 24 Dec, 2023 2 commits
  17. 22 Dec, 2023 1 commit
  18. 20 Dec, 2023 1 commit
    • Sanghun Cho's avatar
      Support alibi, by Sanghun Cho from Kakao Brain · e4f726fc
      Sanghun Cho authored
      
      
      * hard-code alibi in fwd
      
      * use params.h as hun_heads
      
      * hard-code alibi in bwd
      
      * add alibi on/off option
      
      * compute alibi_start, ratio outside of kernels
      
      * fix minor merge conflict
      
      * add test_alibi.py
      
      * change apply_alibi() location before masking
      
      * add alibi in splitkv kernel
      
      * fix backward func # of returns
      
      * add out-of-bound check in apply_alibi()
      
      * update test_alibi.py
      
      * update test_alibi.py for kvcache
      
      * simplify alibi parameter interface
      
      * fix performance issue
      by computing alibi outside of branch
      
      * update test_flash_attn_varlen_func() for left padding
      
      * implement alibi_slopes (b, nh) loading
      
      * optimize apply_alibi() a bit
      
      * update test cases for alibi_slopes loading
      
      * reflect stylistic comments
      
      * disable "seqlenq_ngroups_swapped" when using alibi
      
      ---------
      Co-authored-by: default avatarmonk.detective <monk.detective@kakaobrain.com>
      e4f726fc
  19. 27 Nov, 2023 1 commit
  20. 20 Nov, 2023 1 commit
  21. 03 Oct, 2023 1 commit
  22. 26 Sep, 2023 1 commit
  23. 24 Sep, 2023 1 commit
  24. 21 Sep, 2023 1 commit
  25. 18 Sep, 2023 1 commit
  26. 16 Sep, 2023 1 commit
  27. 12 Sep, 2023 1 commit
  28. 11 Sep, 2023 1 commit
  29. 04 Sep, 2023 1 commit
  30. 29 Aug, 2023 1 commit
  31. 27 Jul, 2023 1 commit
  32. 17 Jul, 2023 1 commit