- 31 Mar, 2026 2 commits
- 23 Mar, 2026 1 commit
-
-
lishen authored
-
- 19 Mar, 2026 1 commit
-
-
lijian6 authored
2. Add channels 2 code backup for ep128 support. Signed-off-by:lijian6 <lijian6@sugon.com>
-
- 06 Mar, 2026 1 commit
-
-
lishen authored
-
- 10 Feb, 2026 1 commit
-
-
lishen authored
-
- 09 Feb, 2026 1 commit
-
-
lishen authored
-
- 04 Feb, 2026 4 commits
- 02 Feb, 2026 1 commit
-
-
lishen authored
-
- 23 Jan, 2026 2 commits
-
-
lishen authored
Signed-off-by:lishen <lishen@sugon.com>
-
lishen authored
-
- 15 Jan, 2026 1 commit
-
-
lijian6 authored
Signed-off-by:lijian <lijian6@sugon.com>
-
- 30 Dec, 2025 2 commits
- 26 Dec, 2025 1 commit
-
-
lishen authored
-
- 23 Dec, 2025 2 commits
- 15 Dec, 2025 1 commit
-
-
lishen authored
-
- 12 Dec, 2025 1 commit
-
-
lijian6 authored
Signed-off-by:lijian <lijian6@sugon.com>
-
- 02 Dec, 2025 2 commits
-
-
lijian6 authored
Signed-off-by:lijian <lijian6@sugon.com>
-
lijian6 authored
2. Fix rocshmem internode hang on 508. Signed-off-by:lijian <lijian6@sugon.com>
-
- 25 Nov, 2025 1 commit
-
-
lishen authored
-
- 14 Nov, 2025 1 commit
-
-
lishen authored
-
- 07 Nov, 2025 1 commit
-
-
lishen authored
-
- 06 Nov, 2025 1 commit
-
-
lijian6 authored
2. Add internode ll mode. 3. Add test internode ll mode. Signed-off-by:lijian <lijian6@sugon.com>
-
- 05 Nov, 2025 1 commit
-
-
lishen authored
-
- 03 Nov, 2025 1 commit
-
-
lishen authored
-
- 30 Oct, 2025 1 commit
-
-
lishen authored
-
- 21 Oct, 2025 2 commits
-
-
lijian6 authored
Signed-off-by:lijian <lijian6@sugon.com>
-
lijian6 authored
backup pass way. Signed-off-by:lishen <lishen@sugon.com>
-
- 20 Oct, 2025 2 commits
-
-
lijian6 authored
Signed-off-by:lijian <lijian6@sugon.com>
-
lijian6 authored
Signed-off-by:lijian <lijian6@sugon.com>
-
- 17 Oct, 2025 1 commit
-
-
lijian6 authored
Signed-off-by:lijian <lijian6@sugon.com>
-
- 24 Sep, 2025 1 commit
-
-
Tailing Yuan authored
Co-authored-by:Yifei Zhang <219273404+yifeizhang-c@users.noreply.github.com>
-
- 22 Sep, 2025 1 commit
-
-
Shangyan Zhou authored
-
- 17 Sep, 2025 1 commit
-
-
Shangyan Zhou authored
* Fix hidden_size % 128 != 0 * Add `align_down()` function * Use the full warp to wait TMA store * Support arbitrary hidden sizes in fp8 cast * lint
-
- 16 Sep, 2025 1 commit
-
-
Chenggang Zhao authored
* Remove redundant TMA flushes * Less barrier initialization overhead * Simplify `elect_one_sync` * Use `elect_one_sync` instead of lanes * Minor fix * Polish testing prints * Refactor for internode kernels * Better performance
-