"examples/trials/sklearn/vscode:/vscode.git/clone" did not exist on "c4d449c5a1be0cd069c79b2f2f2754a945089411"
Support alibi, by Sanghun Cho from Kakao Brain
* hard-code alibi in fwd
* use params.h as hun_heads
* hard-code alibi in bwd
* add alibi on/off option
* compute alibi_start, ratio outside of kernels
* fix minor merge conflict
* add test_alibi.py
* change apply_alibi() location before masking
* add alibi in splitkv kernel
* fix backward func # of returns
* add out-of-bound check in apply_alibi()
* update test_alibi.py
* update test_alibi.py for kvcache
* simplify alibi parameter interface
* fix performance issue
by computing alibi outside of branch
* update test_flash_attn_varlen_func() for left padding
* implement alibi_slopes (b, nh) loading
* optimize apply_alibi() a bit
* update test cases for alibi_slopes loading
* reflect stylistic comments
* disable "seqlenq_ngroups_swapped" when using alibi
---------
Co-authored-by:
monk.detective <monk.detective@kakaobrain.com>
Showing
csrc/flash_attn/src/alibi.h
0 → 100644
tests/test_alibi.py
0 → 100644
This diff is collapsed.
Please register or sign in to comment