"test/vscode:/vscode.git/clone" did not exist on "5b07c33e393b9d2f59101ddebe14d83494439a46"
Commit 8fb20fc0 authored by Adam Osewski's avatar Adam Osewski
Browse files

Change KPack value.

Hardcode for now, as without AK1/BK1 there's no good way to determine
its value.
parent 69bda850
......@@ -79,7 +79,10 @@ struct BlockUniversalGemmAsBsCr
// TODO: Should we have two policies? Interwave & Intrawave ??
static constexpr index_t InterWaveSchedulingMacClusters = 1;
static constexpr index_t KPack = WarpGemm::kKPerThread;
// should be at least equal to: WarpGemm::Impl::kABKPerLane
// and the question is how to assess upper limit or exact value?
// TODO: Should we introduce AK1/BK1 parameters ?
static constexpr index_t KPack = 8;
static constexpr index_t KPerThread = KIterPerWarp * KPack;
static constexpr index_t KRepeat = KPerThread / KPack;
};
......
Markdown is supported
0% or .
You are about to add 0 people to the discussion. Proceed with caution.
Finish editing this message first!
Please register or to comment