1. 04 Oct, 2024 1 commit
  2. 02 Aug, 2024 1 commit
  3. 31 Jul, 2024 1 commit
  4. 30 Jul, 2024 1 commit
  5. 21 Jul, 2024 1 commit
  6. 18 Jun, 2024 1 commit
  7. 14 Jun, 2024 1 commit
  8. 09 Jun, 2024 1 commit
  9. 31 May, 2024 2 commits
  10. 23 May, 2024 1 commit
  11. 22 May, 2024 1 commit
  12. 16 May, 2024 1 commit
  13. 24 Apr, 2024 1 commit
    • alexm-nm's avatar
      [Bugfix] Fix marlin kernel crash on H100 (#4218) · aae08249
      alexm-nm authored
      This PR addresses the Marlin kernel H100 crash that was reported here: neuralmagic#187.
      The reason for the crash was the inline PTX assembly that introduced the async_copy with streaming behavior. The solution is to use the more standard PTX for async_copy (without the fractional L2 policy for "evict_first"). There is no performance difference between standard async_copy PTX and the previous one.
      aae08249
  14. 01 Mar, 2024 1 commit