Unverified Commit f5ec04f0 authored by zjing14's avatar zjing14 Committed by GitHub
Browse files

Grouped Gemm with Fixed K and N with SplitK (#818)



* move all arguments into device

* add b2c_tile_map

* add examples

* add SetDeviceKernelArgs

* dedicated fixed_nk solution

* init client api

* add grouped_gemm_bias example

* add a instance

* add instances

* formatting

* fixed cmake

* Update EnableCompilerWarnings.cmake

* Update cmake-ck-dev.sh

* clean; fixed comments

* fixed comment

* add instances for fp32 output

* add instances for fp32 output

* add fp32 out client example

* fixed CI

* init commit for kbatch

* add splitk gridwise

* format

* fixed

* clean deviceop

* clean code

* finish splitk

* fixed instances

* change m_loops to tile_loops

* add setkbatch

* clean code

* add splitK+bias

* add instances

* opt mk_nk instances

* clean examples

* fixed CI

* remove zero

* finished non-zero

* clean

* clean code

* optimized global_barrier

* fixed ci

* fixed CI

* removed AddBias

* format

* fixed CI

* fixed CI

* move 20_grouped_gemm to 21_grouped_gemm

---------
Co-authored-by: default avatarJing Zhang <jizha@amd.com>
parent 866377de
...@@ -37,6 +37,11 @@ void DeviceMem::ToDevice(const void* p) const ...@@ -37,6 +37,11 @@ void DeviceMem::ToDevice(const void* p) const
} }
} }
void DeviceMem::ToDevice(const void* p, const std::size_t cpySize) const
{
hip_check_error(hipMemcpy(mpDeviceBuf, const_cast<void*>(p), cpySize, hipMemcpyHostToDevice));
}
void DeviceMem::FromDevice(void* p) const void DeviceMem::FromDevice(void* p) const
{ {
if(mpDeviceBuf) if(mpDeviceBuf)
...@@ -49,6 +54,11 @@ void DeviceMem::FromDevice(void* p) const ...@@ -49,6 +54,11 @@ void DeviceMem::FromDevice(void* p) const
} }
} }
void DeviceMem::FromDevice(void* p, const std::size_t cpySize) const
{
hip_check_error(hipMemcpy(p, mpDeviceBuf, cpySize, hipMemcpyDeviceToHost));
}
void DeviceMem::SetZero() const void DeviceMem::SetZero() const
{ {
if(mpDeviceBuf) if(mpDeviceBuf)
......
...@@ -16,4 +16,3 @@ cmake ...@@ -16,4 +16,3 @@ cmake
-D CMAKE_VERBOSE_MAKEFILE:BOOL=ON \ -D CMAKE_VERBOSE_MAKEFILE:BOOL=ON \
-D USE_BITINT_EXTENSION_INT4=OFF \ -D USE_BITINT_EXTENSION_INT4=OFF \
${MY_PROJECT_SOURCE} ${MY_PROJECT_SOURCE}
Markdown is supported
0% or .
You are about to add 0 people to the discussion. Proceed with caution.
Finish editing this message first!
Please register or to comment