- 08 Apr, 2025 1 commit
-
-
Atream authored
-
- 07 Apr, 2025 10 commits
-
-
wang jiahao authored
Update balance-serve.md
-
wang jiahao authored
-
Azure authored
Fix TypeError when invoke KLinearCPUInfer.forward()
-
wang jiahao authored
fix temperature=0, flashinfer sample error
-
dongjw authored
-
Atream authored
Fix compile
-
Aubrey Li authored
Fix the following error: File "/home/aubrey/work/ktransformers/ktransformers/operators/linear.py", line 825, in forward y = self.generate_linear.forward(x, bsz_tensor) ^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^ TypeError: KLinearCPUInfer.forward() takes 2 positional arguments but 3 were given -
Atream authored
-
Atream authored
-
Atream authored
-
- 05 Apr, 2025 2 commits
-
-
wang jiahao authored
Update balance-serve.md
-
wang jiahao authored
-
- 04 Apr, 2025 6 commits
-
-
wang jiahao authored
📝 Docs: Clarify CMake version requirement for CUDA dialects -
255 authored
Adds a note explaining that default CMake versions on systems like Ubuntu 22.04 LTS might not support newer CUDA dialects (e.g., CUDA 20), leading to specific build errors. Recommends installing a newer CMake via the Kitware APT repository as a resolution. This helps users troubleshoot errors like: "Target ... requires the language dialect 'CUDA20', but CMake does not know the compile flags..."
-
ZiWei Yuan authored
🔖 release v0.2.4post1 -
ZiWei Yuan authored
-
wang jiahao authored
Fix bug with non-base-multiple chunk_size, update test examples, and …
-
dongjw authored
Fix bug with non-base-multiple chunk_size, update test examples, and resolve issue with writing model_config. Hugging Face URL input is still unsupported.
-
- 03 Apr, 2025 15 commits
-
-
ZiWei Yuan authored
🔧 update config.yaml setting default config -
liam authored
-
Atream authored
Update modeling_deepseek_v3.py
-
Atream authored
-
wang jiahao authored
slove [Bug] #1023
-
wang jiahao authored
文档更新:model_path名字要求以及在示例中添加force_think
-
wangkuigang-yewu-cmss authored
* add documentations about `--model_path` requirements * add `--force_think` in doc (most users would run R1 and would want it to provide reasoning process)
-
Qin's repo authored
Only modified the mixed single and double quotes in server/config/config.py
-
wang jiahao authored
fix local_chat bug and update doc
-
dongjw authored
-
dongjw authored
-
ZiWei Yuan authored
Update SUMMARY.md
-
ZiWei Yuan authored
-
wang jiahao authored
delete sudo install
-
dongjw authored
-
- 02 Apr, 2025 6 commits
-
-
Atream authored
yaml: fix Marlin AssertionError
-
Atream authored
fix: refine backend error message to include ROCM_HOME
-
wang jiahao authored
Update balance-serve.md
-
wang jiahao authored
-
wang jiahao authored
In v0.2.4 version, we’ve added highly desired multi-concurrency support to the community through a major refactor of the whole architecture.
-
dongjw authored
-