- 02 May, 2025 1 commit
-
-
Caleb_Du authored
Signed-off-by:Caleb_Du <Caleb_Du@zju.edu.cn>
-
- 15 Apr, 2025 1 commit
-
-
Jinzhen Lin authored
Signed-off-by:
Jinzhen Lin <linjinzhen@hotmail.com> Co-authored-by:
Michael Goin <michael@neuralmagic.com> Co-authored-by:
mgoin <mgoin64@gmail.com>
-
- 12 Mar, 2025 2 commits
-
-
TJian authored
[FEAT] [ROCm] [Embedding] Add encoder-only model support into ROCm Flash Attention to enable embedding models. (#14664) Signed-off-by:tjtanaa <tunjian.tan@embeddedllm.com>
-
Sage Moore authored
Signed-off-by:Sage Moore <sage@neuralmagic.com>
-
- 11 Mar, 2025 1 commit
-
-
Jinzhen Lin authored
Signed-off-by:
Jinzhen Lin <linjinzhen@hotmail.com> Co-authored-by:
mgoin <mgoin64@gmail.com>
-
- 03 Feb, 2025 1 commit
-
-
Yang Chen authored
sgl_moe_align_block_size is based on: https://github.com/sgl-project/sglang/commit/ded9fcd09a43d5e7d5bb31a2bc3e9fc21bf65d2a moe_align_block_size is based on: https://github.com/sgl-project/sglang/commit/ba5112ff691d791a9e38c6c71f59324a5fcb49d0 Signed-off-by:
Yang Chen <yangche@fb.com>
-
- 24 Oct, 2024 1 commit
-
-
Charlie Fu authored
Signed-off-by:charlifu <charlifu@amd.com>
-
- 17 Oct, 2024 1 commit
-
-
bnellnm authored
-
- 04 Oct, 2024 2 commits
-
-
ElizaWszola authored
Co-authored-by:
Dipika <dipikasikka1@gmail.com> Co-authored-by:
Dipika Sikka <ds3822@columbia.edu>
-
Lucas Wilkinson authored
-
- 16 Sep, 2024 1 commit
-
-
ElizaWszola authored
Co-authored-by:Dipika <dipikasikka1@gmail.com>
-
- 10 Sep, 2024 1 commit
-
-
Dipika Sikka authored
-
- 27 Aug, 2024 1 commit
-
-
Dipika Sikka authored
Co-authored-by:ElizaWszola <eliza@neuralmagic.com>
-
- 22 Aug, 2024 1 commit
-
-
Michael Goin authored
-
- 21 Aug, 2024 1 commit
-
-
Dipika Sikka authored
Co-authored-by:ElizaWszola <eliza@neuralmagic.com>
-
- 02 Aug, 2024 1 commit
-
-
Lucas Wilkinson authored
-
- 09 Jun, 2024 1 commit
-
-
bnellnm authored
-