- 09 Apr, 2025 8 commits
-
-
Jianwei Dong authored
update llama4 tutorial
-
djw authored
-
Jianwei Dong authored
update llama4 tutorial
-
djw authored
-
Azure authored
balance_serve: Add g++ to compiler list
-
Aubrey Li authored
In some OS distributions, g++ exists in the following form: # ls -l /usr/bin/g++* -rwxr-xr-x 4 root root 985784 Dec 9 12:51 /usr/bin/g++ So make sure to add g++ to the compiler list as well.
-
Atream authored
Update attention.py
-
Atream authored
-
- 08 Apr, 2025 6 commits
-
-
wang jiahao authored
Update balance-serve.md
-
wang jiahao authored
-
wang jiahao authored
format kvc2, delete quant_configs, move model_configs to ~/.ktransfor…
-
qiyuxinlin authored
-
Atream authored
fix compile, add abi check to setup.py
-
Atream authored
-
- 07 Apr, 2025 10 commits
-
-
wang jiahao authored
Update balance-serve.md
-
wang jiahao authored
-
Azure authored
Fix TypeError when invoke KLinearCPUInfer.forward()
-
wang jiahao authored
fix temperature=0, flashinfer sample error
-
dongjw authored
-
Atream authored
Fix compile
-
Aubrey Li authored
Fix the following error: File "/home/aubrey/work/ktransformers/ktransformers/operators/linear.py", line 825, in forward y = self.generate_linear.forward(x, bsz_tensor) ^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^ TypeError: KLinearCPUInfer.forward() takes 2 positional arguments but 3 were given -
Atream authored
-
Atream authored
-
Atream authored
-
- 05 Apr, 2025 2 commits
-
-
wang jiahao authored
Update balance-serve.md
-
wang jiahao authored
-
- 04 Apr, 2025 6 commits
-
-
wang jiahao authored
📝 Docs: Clarify CMake version requirement for CUDA dialects -
255 authored
Adds a note explaining that default CMake versions on systems like Ubuntu 22.04 LTS might not support newer CUDA dialects (e.g., CUDA 20), leading to specific build errors. Recommends installing a newer CMake via the Kitware APT repository as a resolution. This helps users troubleshoot errors like: "Target ... requires the language dialect 'CUDA20', but CMake does not know the compile flags..."
-
ZiWei Yuan authored
🔖 release v0.2.4post1 -
ZiWei Yuan authored
-
wang jiahao authored
Fix bug with non-base-multiple chunk_size, update test examples, and …
-
dongjw authored
Fix bug with non-base-multiple chunk_size, update test examples, and resolve issue with writing model_config. Hugging Face URL input is still unsupported.
-
- 03 Apr, 2025 8 commits
-
-
ZiWei Yuan authored
🔧 update config.yaml setting default config -
liam authored
-
Atream authored
Update modeling_deepseek_v3.py
-
Atream authored
-
wang jiahao authored
slove [Bug] #1023
-
wang jiahao authored
文档更新:model_path名字要求以及在示例中添加force_think
-
wangkuigang-yewu-cmss authored
* add documentations about `--model_path` requirements * add `--force_think` in doc (most users would run R1 and would want it to provide reasoning process)
-
Qin's repo authored
Only modified the mixed single and double quotes in server/config/config.py
-