Skip to content
GitLab
Menu
Projects
Groups
Snippets
Loading...
Help
Help
Support
Community forum
Keyboard shortcuts
?
Submit feedback
Contribute to GitLab
Sign in / Register
Toggle navigation
Menu
Open sidebar
OpenDAS
ollama
Commits
37708931
Commit
37708931
authored
Jan 09, 2024
by
Jeffrey Morgan
Browse files
update cuda overhead to 20% to fix crashes when switching between models and large context sizes
parent
f6cb0a55
Changes
1
Hide whitespace changes
Inline
Side-by-side
Showing
1 changed file
with
2 additions
and
8 deletions
+2
-8
gpu/gpu.go
gpu/gpu.go
+2
-8
No files found.
gpu/gpu.go
View file @
37708931
...
@@ -131,14 +131,8 @@ func getCPUMem() (memInfo, error) {
...
@@ -131,14 +131,8 @@ func getCPUMem() (memInfo, error) {
func
CheckVRAM
()
(
int64
,
error
)
{
func
CheckVRAM
()
(
int64
,
error
)
{
gpuInfo
:=
GetGPUInfo
()
gpuInfo
:=
GetGPUInfo
()
if
gpuInfo
.
FreeMemory
>
0
&&
(
gpuInfo
.
Library
==
"cuda"
||
gpuInfo
.
Library
==
"rocm"
)
{
if
gpuInfo
.
FreeMemory
>
0
&&
(
gpuInfo
.
Library
==
"cuda"
||
gpuInfo
.
Library
==
"rocm"
)
{
// leave 15% or 400MiB of VRAM free for overhead
// leave 20% of VRAM free for overhead
overhead
:=
gpuInfo
.
FreeMemory
*
3
/
20
return
int64
(
gpuInfo
.
FreeMemory
*
4
/
5
),
nil
minOverhead
:=
uint64
(
400
*
1024
*
1024
)
if
overhead
<
minOverhead
{
overhead
=
minOverhead
}
return
int64
(
gpuInfo
.
FreeMemory
-
overhead
),
nil
}
}
return
0
,
fmt
.
Errorf
(
"no GPU detected"
)
// TODO - better handling of CPU based memory determiniation
return
0
,
fmt
.
Errorf
(
"no GPU detected"
)
// TODO - better handling of CPU based memory determiniation
...
...
Write
Preview
Markdown
is supported
0%
Try again
or
attach a new file
.
Attach a file
Cancel
You are about to add
0
people
to the discussion. Proceed with caution.
Finish editing this message first!
Cancel
Please
register
or
sign in
to comment