- 08 Aug, 2025 1 commit
-
-
Lucas Wilkinson authored
Signed-off-by:
Lucas Wilkinson <lwilkins@redhat.com> Co-authored-by:
yewentao256 <zhyanwentao@126.com> Co-authored-by:
Wentao Ye <44945378+yewentao256@users.noreply.github.com>
-
- 06 Aug, 2025 2 commits
-
-
Lucas Wilkinson authored
Signed-off-by:LucasWilkinson <lwilkinson@neuralmagic.com>
-
Woosuk Kwon authored
Signed-off-by:Woosuk Kwon <woosuk.kwon@berkeley.edu>
-
- 01 Jul, 2025 1 commit
-
-
Woosuk Kwon authored
Signed-off-by:Woosuk Kwon <woosuk.kwon@berkeley.edu>
-
- 24 Jun, 2025 2 commits
-
-
Eli Uriegas authored
Signed-off-by:Eli Uriegas <eliuriegas@meta.com>
-
Lucas Wilkinson authored
Signed-off-by:Lucas Wilkinson <lwilkinson@neuralmagic.com>
-
- 17 Jun, 2025 1 commit
-
-
Lucas Wilkinson authored
-
- 28 May, 2025 1 commit
-
-
Luka Govedič authored
-
- 25 Apr, 2025 2 commits
-
-
yexin(叶鑫) authored
[Perf]Optimize rotary_emb implementation to use Triton operator for improved inference performance (#16457) Signed-off-by:
cynthieye <yexin93@qq.com> Co-authored-by:
MagnetoWang <magnetowang@outlook.com>
-
Lucas Wilkinson authored
Signed-off-by:Lucas Wilkinson <lwilkinson@neuralmagic.com>
-
- 17 Apr, 2025 1 commit
-
-
Lucas Wilkinson authored
Signed-off-by:Lucas Wilkinson <lwilkinson@neuralmagic.com>
-
- 20 Mar, 2025 1 commit
-
-
Mickaël Seznec authored
Signed-off-by:Mickael Seznec <mickael@mistral.ai>
-
- 06 Mar, 2025 1 commit
-
-
Pavani Majety authored
-
- 27 Feb, 2025 1 commit
-
-
Lucas Wilkinson authored
Signed-off-by:
Lucas Wilkinson <lwilkinson@neuralmagic.com> Signed-off-by:
Lucas Wilkinson <lwilkins@redhat.com>
-