- 21 Mar, 2025 2 commits
-
-
Nicolò Lucchesi authored
Signed-off-by:NickLucche <nlucches@redhat.com>
-
Travis Johnson authored
Signed-off-by:Travis Johnson <tsjohnso@us.ibm.com>
-
- 20 Mar, 2025 1 commit
-
-
Mickaël Seznec authored
Signed-off-by:Mickael Seznec <mickael@mistral.ai>
-
- 15 Mar, 2025 1 commit
-
-
Li, Jiang authored
Signed-off-by:jiang1.li <jiang1.li@intel.com>
-
- 14 Mar, 2025 2 commits
-
-
Lucas Wilkinson authored
Signed-off-by:Lucas Wilkinson <lwilkins@redhat.com>
-
Thomas Parnell authored
Signed-off-by:
Thomas Parnell <tpa@zurich.ibm.com> Co-authored-by:
Jan van Lunteren <jvl@zurich.ibm.com> Co-authored-by:
Burkhard Ringlein <ngl@zurich.ibm.com> Co-authored-by:
Chih-Chieh Yang <chih.chieh.yang@ibm.com>
-
- 12 Mar, 2025 3 commits
-
-
TJian authored
[FEAT] [ROCm] [Embedding] Add encoder-only model support into ROCm Flash Attention to enable embedding models. (#14664) Signed-off-by:tjtanaa <tunjian.tan@embeddedllm.com>
-
Sage Moore authored
Signed-off-by:Sage Moore <sage@neuralmagic.com>
-
Li, Jiang authored
Signed-off-by:
jiang1.li <jiang1.li@intel.com> Co-authored-by:
Isotr0py <2037008807@qq.com>
-
- 11 Mar, 2025 3 commits
-
-
Yang.Tao authored
-
Jeff Daily authored
Signed-off-by:Jeff Daily <jeff.daily@amd.com>
-
Liangfu Chen authored
-
- 10 Mar, 2025 1 commit
-
-
Harry Mellor authored
Signed-off-by:Harry Mellor <19981378+hmellor@users.noreply.github.com>
-
- 07 Mar, 2025 1 commit
-
-
Luka Govedič authored
Signed-off-by:luka <luka@neuralmagic.com>
-
- 06 Mar, 2025 8 commits
-
-
Michael Goin authored
Signed-off-by:mgoin <mgoin64@gmail.com>
-
Michael Goin authored
Signed-off-by:mgoin <mgoin64@gmail.com>
-
Ying Zhong authored
Signed-off-by:ZhongYingMatrix <zhongyingmatrix@gmail.com>
-
Thomas Parnell authored
Signed-off-by:
Thomas Parnell <tpa@zurich.ibm.com> Co-authored-by:
Burkhard Ringlein <ngl@zurich.ibm.com> Co-authored-by:
Jan van Lunteren <jvl@zurich.ibm.com>
-
Thomas Parnell authored
-
Pavani Majety authored
-
Nicolò Lucchesi authored
-
Lucas Wilkinson authored
Signed-off-by:
Lucas Wilkinson <lwilkinson@neuralmagic.com> Co-authored-by:
Ying Zhong <zhongyingmatrix@gmail.com>
-
- 05 Mar, 2025 1 commit
-
-
Tyler Michael Smith authored
-
- 03 Mar, 2025 1 commit
-
-
TJian authored
-
- 27 Feb, 2025 3 commits
-
-
qli88 authored
Signed-off-by:qli88 <qiang.li2@amd.com>
-
Yang Chen authored
Signed-off-by:Yang Chen <yangche@fb.com>
-
Lucas Wilkinson authored
Signed-off-by:
Lucas Wilkinson <lwilkinson@neuralmagic.com> Signed-off-by:
Lucas Wilkinson <lwilkins@redhat.com>
-
- 26 Feb, 2025 2 commits
-
-
Harry Mellor authored
Signed-off-by:Harry Mellor <19981378+hmellor@users.noreply.github.com>
-
Sage Moore authored
Signed-off-by:Sage Moore <sage@neuralmagic.com>
-
- 25 Feb, 2025 5 commits
-
-
Junlin Zhou authored
-
Michael Goin authored
Signed-off-by:mgoin <mgoin64@gmail.com>
-
wangxiyuan authored
Signed-off-by:wangxiyuan <wangxiyuan1007@gmail.com>
-
cjackal authored
Signed-off-by:cjackal <44624812+cjackal@users.noreply.github.com>
-
Harry Mellor authored
-
- 23 Feb, 2025 1 commit
-
-
Andy Lo authored
Signed-off-by:Andy Lo <andy@mistral.ai>
-
- 22 Feb, 2025 2 commits
-
-
Sage Moore authored
[V1][Kernel] Refactor the prefix_prefill kernel so that the caller no longer has to pass in the context lengths (#13095)
-
Gordon Wong authored
-
- 21 Feb, 2025 2 commits
-
-
Lucas Wilkinson authored
Signed-off-by:
Lucas Wilkinson <lwilkinson@neuralmagic.com> Signed-off-by:
Lucas Wilkinson <lwilkins@redhat.com> Co-authored-by:
Patrick Horn <patrick.horn@gmail.com> Co-authored-by:
simon-mo <xmo@berkeley.edu> Co-authored-by:
Tyler Michael Smith <tyler@neuralmagic.com>
-
Lingfan Yu authored
Signed-off-by:Lingfan Yu <lingfany@amazon.com>
-
- 20 Feb, 2025 1 commit
-
-
Gregory Shtrasberg authored
-