- 17 Apr, 2025 1 commit
-
-
wang jiahao authored
Add bsz_tensors param to torch linear
-
- 16 Apr, 2025 2 commits
-
-
root authored
-
Chengyu Qiu authored
Feat: Add Function call support
-
- 15 Apr, 2025 2 commits
-
-
ZiWei Yuan authored
feat(build): display limited tail of subprocesses in real time
-
jizhilong authored
this is a followup on #1108
-
- 14 Apr, 2025 2 commits
-
-
ZiWei Yuan authored
chore: show cmake output in real time during build_ext
-
Creeper-MZ authored
-
- 13 Apr, 2025 4 commits
-
-
wang jiahao authored
使用长prompt时,避免rpc进程挂掉
-
wangkuigang-yewu-cmss authored
当prompt超过cache_len的时候,rpc进程会crash掉,导致整体不可用。 这里增加一个检查,让过长的prompt在请求早期就被提前过滤掉
-
ErvinXie authored
xxHash: fix link error due to non-position-independent code
-
ZiWei Yuan authored
Add missing references to ik_llama.cpp
-
- 11 Apr, 2025 2 commits
-
-
Iwan Kawrakow authored
-
Iwan Kawrakow authored
-
- 10 Apr, 2025 2 commits
-
-
Aubrey Li authored
Add PROPERTIES POSITION_INDEPENDENT_CODE option to fix the following error: /usr/bin/ld: ../../third_party/xxHash/libxxhash.a(xxhash.c.o): relocation R_X86_64_32S against `.rodata' can not be used when making a shared object; recompile with -fPIC Trying to link a non-PIC static library libxxhash.a into a .so shared library, which is not allowed. The object file xxhash.c.o must be recompiled with explicit -fPIC support.
-
jizhilong authored
otherwise cmake error messages may be suppressed, making debugging difficult
-
- 09 Apr, 2025 20 commits
-
-
Atream authored
Create SECURITY.md
-
Atream authored
-
Atream authored
Update llama4.md
-
Atream authored
-
Atream authored
Update llama4.md
-
Atream authored
-
Atream authored
Update llama4.md
-
Atream authored
-
Atream authored
Update llama4.md
-
Atream authored
-
Jianwei Dong authored
update llama4 tutorial
-
djw authored
-
Jianwei Dong authored
update llama4 tutorial
-
djw authored
-
Jianwei Dong authored
update llama4 tutorial
-
djw authored
-
Azure authored
balance_serve: Add g++ to compiler list
-
Aubrey Li authored
In some OS distributions, g++ exists in the following form: # ls -l /usr/bin/g++* -rwxr-xr-x 4 root root 985784 Dec 9 12:51 /usr/bin/g++ So make sure to add g++ to the compiler list as well.
-
Atream authored
Update attention.py
-
Atream authored
-
- 08 Apr, 2025 5 commits
-
-
wang jiahao authored
Update balance-serve.md
-
wang jiahao authored
-
wang jiahao authored
format kvc2, delete quant_configs, move model_configs to ~/.ktransfor…
-
qiyuxinlin authored
-
Atream authored
fix compile, add abi check to setup.py
-