- 16 Apr, 2025 2 commits
-
-
kevin authored
-
Chengyu Qiu authored
Feat: Add Function call support
-
- 15 Apr, 2025 2 commits
-
-
ZiWei Yuan authored
feat(build): display limited tail of subprocesses in real time
-
jizhilong authored
this is a followup on #1108
-
- 14 Apr, 2025 3 commits
-
-
ZiWei Yuan authored
chore: show cmake output in real time during build_ext
-
sean.su authored
Defined new data structures in chat.py to replace OpenAI's original implementation, adding support for tool calling. Implemented logic for extracting and processing tool calls, enabling dynamic function invocation during conversations. Added methods in balance_serve.py to retrieve sampling parameters, handling default values and edge cases. Updated ktransformers.py and transformers.py to support the passing of tool parameters. Modified the default value of top_p in config.py to 1.0 to increase generation diversity. Extended the message model in chat.py to support the transmission of tool call information. These changes enhance the system's flexibility and functionality, enabling more complex interaction patterns.
-
Creeper-MZ authored
-
- 13 Apr, 2025 4 commits
-
-
wang jiahao authored
使用长prompt时,避免rpc进程挂掉
-
wangkuigang-yewu-cmss authored
当prompt超过cache_len的时候,rpc进程会crash掉,导致整体不可用。 这里增加一个检查,让过长的prompt在请求早期就被提前过滤掉
-
ErvinXie authored
xxHash: fix link error due to non-position-independent code
-
ZiWei Yuan authored
Add missing references to ik_llama.cpp
-
- 11 Apr, 2025 2 commits
-
-
Iwan Kawrakow authored
-
Iwan Kawrakow authored
-
- 10 Apr, 2025 2 commits
-
-
Aubrey Li authored
Add PROPERTIES POSITION_INDEPENDENT_CODE option to fix the following error: /usr/bin/ld: ../../third_party/xxHash/libxxhash.a(xxhash.c.o): relocation R_X86_64_32S against `.rodata' can not be used when making a shared object; recompile with -fPIC Trying to link a non-PIC static library libxxhash.a into a .so shared library, which is not allowed. The object file xxhash.c.o must be recompiled with explicit -fPIC support.
-
jizhilong authored
otherwise cmake error messages may be suppressed, making debugging difficult
-
- 09 Apr, 2025 20 commits
-
-
Atream authored
Create SECURITY.md
-
Atream authored
-
Atream authored
Update llama4.md
-
Atream authored
-
Atream authored
Update llama4.md
-
Atream authored
-
Atream authored
Update llama4.md
-
Atream authored
-
Atream authored
Update llama4.md
-
Atream authored
-
Jianwei Dong authored
update llama4 tutorial
-
djw authored
-
Jianwei Dong authored
update llama4 tutorial
-
djw authored
-
Jianwei Dong authored
update llama4 tutorial
-
djw authored
-
Azure authored
balance_serve: Add g++ to compiler list
-
Aubrey Li authored
In some OS distributions, g++ exists in the following form: # ls -l /usr/bin/g++* -rwxr-xr-x 4 root root 985784 Dec 9 12:51 /usr/bin/g++ So make sure to add g++ to the compiler list as well.
-
Atream authored
Update attention.py
-
Atream authored
-
- 08 Apr, 2025 5 commits
-
-
wang jiahao authored
Update balance-serve.md
-
wang jiahao authored
-
wang jiahao authored
format kvc2, delete quant_configs, move model_configs to ~/.ktransfor…
-
qiyuxinlin authored
-
Atream authored
fix compile, add abi check to setup.py
-