Skip to content
GitLab
Menu
Projects
Groups
Snippets
Loading...
Help
Help
Support
Community forum
Keyboard shortcuts
?
Submit feedback
Contribute to GitLab
Sign in / Register
Toggle navigation
Menu
Open sidebar
OpenDAS
ollama
Commits
e835ef18
Commit
e835ef18
authored
Jun 21, 2024
by
Michael Yang
Browse files
fix: quantization with template
parent
c7c2f3bc
Changes
1
Hide whitespace changes
Inline
Side-by-side
Showing
1 changed file
with
10 additions
and
5 deletions
+10
-5
server/images.go
server/images.go
+10
-5
No files found.
server/images.go
View file @
e835ef18
...
...
@@ -414,17 +414,22 @@ func CreateModel(ctx context.Context, name model.Name, modelFileDir, quantizatio
return
err
}
layer
s
,
err
:=
parseFromFile
(
ctx
,
temp
,
""
,
fn
)
layer
,
err
:=
NewLayer
(
temp
,
baseLayer
.
MediaType
)
if
err
!=
nil
{
return
err
}
if
len
(
layers
)
!=
1
{
return
errors
.
New
(
"quantization failed"
)
if
_
,
err
:=
temp
.
Seek
(
0
,
io
.
SeekStart
);
err
!=
nil
{
return
err
}
ggml
,
_
,
err
:=
llm
.
DecodeGGML
(
temp
)
if
err
!=
nil
{
return
err
}
baseLayer
.
Layer
=
layer
s
[
0
]
.
Layer
baseLayer
.
GGML
=
layers
[
0
]
.
GGML
baseLayer
.
Layer
=
layer
baseLayer
.
GGML
=
ggml
}
}
...
...
Write
Preview
Markdown
is supported
0%
Try again
or
attach a new file
.
Attach a file
Cancel
You are about to add
0
people
to the discussion. Proceed with caution.
Finish editing this message first!
Cancel
Please
register
or
sign in
to comment