"examples/pytorch/vscode:/vscode.git/clone" did not exist on "5e64481be37a28294041acb3793173a4f94aaf46"
  • Adam Osewski's avatar
    Multiple fixes to GroupedGemm+SplitK (#707) · 70e4eb56
    Adam Osewski authored
    
    
    * Add license header.
    
    * Reduce number of logged output. Add constant initialization.
    
    * Add functional tests for grouped_gemm with different kbatch value.
    
    * Add debug log informations + remove unused code.
    
    * Don't pass kbatch to CalculateKPadded.
    
    * Turn on logging in grouped gemm and gemm splitk profiler
    
    * Debug: limit number of test cases to run;
    
    * Log more information and initialize with constant value.
    
    * Turn on DEBUG_LOG
    
    * Add more debug log informations.
    
    * Limit the number of instances to compile.
    
    * Use GridwiseGemmPipeline
    
    * Use KBatch to calculate K0
    
    * Multiple DebugLog messages.
    
    * Unit tests for multiple KBatch values.
    
    * Refactoring
    
    * Disable logging
    * extract out of if statement KBatch update.
    
    * Uncomment instances.
    
    * Disable DebugLog.
    
    * Use Kbatch when calculate KPadded.
    
    * Fix CGridDesc padding.
    
    * Use available helper functions.
    
    * Uncomment code commented for debuggin.
    
    * Remove unnecessary debug log messages.
    
    * Uncomment previously commented code for debug purposes.
    
    * Add KBatch info to profiler output summary log.
    
    * Add gtests for gemm splitk using ckProfiler API.
    
    * Add more test-cases for different data layout.
    
    * Add more test cases for gemm splitk
    
    * Remove old test.
    
    * Unit tests for MKNK ggemm interface.
    
    * Fix and add more unit-tests.
    
    * Constepxr everything!
    
    * Increase error threshold for fp16 and splitk.
    
    Since we're using fp16 atomic add for splitk there's a
    known precision loss.
    
    ---------
    Co-authored-by: default avatarAdam Osewski <aosewski@amd.com>
    Co-authored-by: default avatarzjing14 <zhangjing14@gmail.com>
    70e4eb56
test_gemm_splitk.cpp 1.69 KB