- 25 Feb, 2025 23 commits
-
-
Azure authored
[update] Update doc.
-
Azure authored
-
ZiWei Yuan authored
⚡ release v0.2.2rc1 -
liam authored
-
Azure authored
[release] Release 0.2.2rc.
-
Azure authored
[update] Update readme.
-
Azure authored
Update README.md
-
Atream authored
-
Atream authored
-
Azure authored
-
Atream authored
-
Atream authored
-
Azure authored
-
ZiWei Yuan authored
📝 add benchmark.md -
liam authored
-
ZiWei Yuan authored
⚡ update git ignore add docker dev container -
liam authored
-
Azure authored
-
Azure authored
-
Atream authored
Feat absorb for long prefill
-
Atream authored
-
Azure authored
[feat] Support fp8 linear kernel;
-
Azure authored
-
- 24 Feb, 2025 9 commits
-
-
Azure authored
-
Atream authored
musa: support bf16
-
Atream authored
Ensure backward compatibility with PyTorch 2.2
-
Xiaodong Ye authored
Signed-off-by:Xiaodong Ye <xiaodong.ye@mthreads.com>
-
Azure authored
-
Azure authored
-
Atream authored
-
Atream authored
fix KExpertsMarlin on GPU with out CUDA Graph
-
Atream authored
-
- 23 Feb, 2025 8 commits
-
-
Atream authored
support moonlight, use ktransformers/optimize/optimize_rules/Moonlight-16B-A3B.yaml
-
Atream authored
-
Atream authored
-
DDong Jianwei authored
-
Atream authored
-
Atream authored
fix bf16 load, TODO: refactor cpu dequant
-
Atream authored
-
Xiaodong Ye authored
Signed-off-by:Xiaodong Ye <xiaodong.ye@mthreads.com>
-