"include/ck/utility/get_id.hpp" did not exist on "6dfb92bbef33b4caea55f6b4ed7c449927ae771c"
  • Lei Wang's avatar
    [Doc] Update README.md for deepseek_mla on AMD (#389) · e9d4ceda
    Lei Wang authored
    * Update README.md for deepseek_mla: Refine performance comparison details and add acknowledgment section. Adjusted performance metrics for TileLang, highlighting its efficiency over Triton and assembly kernels. Included gratitude to the AMD ROCm team for their contributions.
    
    * Update README.md for deepseek_mla: Clarify performance metrics for TileLang, specifying the range of performance parity with hand-optimized assembly kernels. This adjustment enhances the accuracy of the comparative analysis against Triton implementations.
    e9d4ceda
README.md 11.6 KB