-
Shijie authored
* use separate qkv Signed-off-by:
jaywan <jaywan@nvidia.com> * add support for GQA Signed-off-by:
jaywan <jaywan@nvidia.com> * minor changes Signed-off-by:
Shijie Wang <jaywan@nvidia.com> * change rtol Signed-off-by:
Shijie Wang <jaywan@nvidia.com> * fix reshape issue Signed-off-by:
Shijie Wang <jaywan@nvidia.com> --------- Signed-off-by:
jaywan <jaywan@nvidia.com> Signed-off-by:
Shijie Wang <jaywan@nvidia.com>
bd7fd0a6