Skip to content
GitLab
Menu
Projects
Groups
Snippets
Loading...
Help
Help
Support
Community forum
Keyboard shortcuts
?
Submit feedback
Contribute to GitLab
Sign in / Register
Toggle navigation
Menu
Open sidebar
OpenDAS
Lmdeploy
Commits
40e07381
Commit
40e07381
authored
Nov 18, 2023
by
xiabo
Browse files
Adapt to rocm FT的修改补充
parent
ab8c95cb
Changes
1
Show whitespace changes
Inline
Side-by-side
Showing
1 changed file
with
2 additions
and
1 deletion
+2
-1
src/turbomind/triton_backend/llama/LlamaTritonModel.cc
src/turbomind/triton_backend/llama/LlamaTritonModel.cc
+2
-1
No files found.
src/turbomind/triton_backend/llama/LlamaTritonModel.cc
View file @
40e07381
...
@@ -209,7 +209,8 @@ std::unique_ptr<LlamaTritonSharedModelInstance<T>> LlamaTritonModel<T>::createSh
...
@@ -209,7 +209,8 @@ std::unique_ptr<LlamaTritonSharedModelInstance<T>> LlamaTritonModel<T>::createSh
ft
::
check_cuda_error
(
cudaGetDeviceProperties
(
cuda_device_prop_ptr
.
get
(),
device_id
));
ft
::
check_cuda_error
(
cudaGetDeviceProperties
(
cuda_device_prop_ptr
.
get
(),
device_id
));
if
(
std
::
is_same
<
T
,
half
>::
value
)
{
if
(
std
::
is_same
<
T
,
half
>::
value
)
{
cublas_wrapper
->
setGemmConfig
(
CUDA_R_16F
,
CUDA_R_16F
,
CUDA_R_16F
,
CUDA_R_32F
);
// cublas_wrapper->setGemmConfig(CUDA_R_16F, CUDA_R_16F, CUDA_R_16F, CUDA_R_32F);
cublas_wrapper
->
setGemmConfig
(
CUDA_R_16F
,
CUDA_R_16F
,
CUDA_R_16F
,
CUDA_R_16F
);
}
}
else
if
(
std
::
is_same
<
T
,
float
>::
value
)
{
else
if
(
std
::
is_same
<
T
,
float
>::
value
)
{
cublas_wrapper
->
setFP32GemmConfig
();
cublas_wrapper
->
setFP32GemmConfig
();
...
...
Write
Preview
Markdown
is supported
0%
Try again
or
attach a new file
.
Attach a file
Cancel
You are about to add
0
people
to the discussion. Proceed with caution.
Finish editing this message first!
Cancel
Please
register
or
sign in
to comment