"wrappers/vscode:/vscode.git/clone" did not exist on "470c9b7f90cc284a85c9ffabb29f73840b7ee8a1"
  1. 27 Jun, 2025 3 commits
  2. 26 Jun, 2025 1 commit
  3. 25 Jun, 2025 2 commits
  4. 24 Jun, 2025 3 commits
  5. 23 Jun, 2025 2 commits
  6. 20 Jun, 2025 1 commit
  7. 18 Jun, 2025 1 commit
  8. 16 Jun, 2025 4 commits
  9. 13 Jun, 2025 2 commits
  10. 12 Jun, 2025 1 commit
  11. 11 Jun, 2025 2 commits
  12. 10 Jun, 2025 1 commit
  13. 09 Jun, 2025 4 commits
  14. 06 Jun, 2025 1 commit
    • Chenggang Zhao's avatar
      Use TMA instead of LD/ST for intra-node normal kernels (#191) · c8dceba1
      Chenggang Zhao authored
      * Update CMake files
      
      * Use TMA instead of LD/ST for intranode dispatch
      
      * Use TMA instead of LD/ST for intranode combine
      
      * Adjust configs
      
      * Test default configs as well
      
      * More warps for combine
      
      * Add inter-thread fence
      
      * Enable more warps
      
      * Do not use TMA for senders
      
      * Update configs
      
      * Remove useless wait
      c8dceba1
  15. 03 Jun, 2025 1 commit
  16. 28 May, 2025 1 commit
  17. 23 May, 2025 2 commits
  18. 12 May, 2025 1 commit
  19. 10 May, 2025 1 commit
  20. 08 May, 2025 1 commit
  21. 29 Apr, 2025 1 commit
  22. 22 Apr, 2025 4 commits