"docs/vscode:/vscode.git/clone" did not exist on "4cefa9b49b6cb2be6d7eac88315df65e0f0d8c9a"
Add RMSNorm (#45)
* Add rmsnorm kernels Signed-off-by:Reese Wang <rewang@nvidia.com> * Add rmsnorm cpp unit test Signed-off-by:
Reese Wang <rewang@nvidia.com> * Apply new Tensor struct Signed-off-by:
Reese Wang <rewang@nvidia.com> * Move scale/scale_inv/amax into the TE Tensor struct Signed-off-by:
Reese Wang <rewang@nvidia.com> * Add document Signed-off-by:
Reese Wang <rewang@nvidia.com> * Separate rmsnorm kernels from the layernorm Signed-off-by:
Reese Wang <rewang@nvidia.com> * fix indent Signed-off-by:
Reese Wang <rewang@nvidia.com> * Update rmsnorm test cases Signed-off-by:
Reese Wang <rewang@nvidia.com> * Update copyright year Signed-off-by:
Reese Wang <rewang@nvidia.com> * Fix the support matrix on the document Signed-off-by:
Reese Wang <rewang@nvidia.com> * Move register macro out of utils.cuh Signed-off-by:
Reese Wang <rewang@nvidia.com> Signed-off-by:
Reese Wang <rewang@nvidia.com>
Showing
Please register or sign in to comment