- 09 Apr, 2025 17 commits
-
-
Atream authored
-
Atream authored
Update llama4.md
-
Atream authored
-
Atream authored
Update llama4.md
-
Atream authored
-
Atream authored
Update llama4.md
-
Atream authored
-
Jianwei Dong authored
update llama4 tutorial
-
djw authored
-
Jianwei Dong authored
update llama4 tutorial
-
djw authored
-
Jianwei Dong authored
update llama4 tutorial
-
djw authored
-
Azure authored
balance_serve: Add g++ to compiler list
-
Aubrey Li authored
In some OS distributions, g++ exists in the following form: # ls -l /usr/bin/g++* -rwxr-xr-x 4 root root 985784 Dec 9 12:51 /usr/bin/g++ So make sure to add g++ to the compiler list as well.
-
Atream authored
Update attention.py
-
Atream authored
-
- 08 Apr, 2025 6 commits
-
-
wang jiahao authored
Update balance-serve.md
-
wang jiahao authored
-
wang jiahao authored
format kvc2, delete quant_configs, move model_configs to ~/.ktransfor…
-
qiyuxinlin authored
-
Atream authored
fix compile, add abi check to setup.py
-
Atream authored
-
- 07 Apr, 2025 10 commits
-
-
wang jiahao authored
Update balance-serve.md
-
wang jiahao authored
-
Azure authored
Fix TypeError when invoke KLinearCPUInfer.forward()
-
wang jiahao authored
fix temperature=0, flashinfer sample error
-
dongjw authored
-
Atream authored
Fix compile
-
Aubrey Li authored
Fix the following error: File "/home/aubrey/work/ktransformers/ktransformers/operators/linear.py", line 825, in forward y = self.generate_linear.forward(x, bsz_tensor) ^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^ TypeError: KLinearCPUInfer.forward() takes 2 positional arguments but 3 were given -
Atream authored
-
Atream authored
-
Atream authored
-
- 05 Apr, 2025 2 commits
-
-
wang jiahao authored
Update balance-serve.md
-
wang jiahao authored
-
- 04 Apr, 2025 5 commits
-
-
wang jiahao authored
📝 Docs: Clarify CMake version requirement for CUDA dialects -
255 authored
Adds a note explaining that default CMake versions on systems like Ubuntu 22.04 LTS might not support newer CUDA dialects (e.g., CUDA 20), leading to specific build errors. Recommends installing a newer CMake via the Kitware APT repository as a resolution. This helps users troubleshoot errors like: "Target ... requires the language dialect 'CUDA20', but CMake does not know the compile flags..."
-
ZiWei Yuan authored
🔖 release v0.2.4post1 -
ZiWei Yuan authored
-
wang jiahao authored
Fix bug with non-base-multiple chunk_size, update test examples, and …
-