Skip to content
GitLab
Menu
Projects
Groups
Snippets
Loading...
Help
Help
Support
Community forum
Keyboard shortcuts
?
Submit feedback
Contribute to GitLab
Sign in / Register
Toggle navigation
Menu
Open sidebar
OpenDAS
ollama
Commits
8f440d57
Commit
8f440d57
authored
May 24, 2024
by
Michael Yang
Browse files
fix q5_0, q5_1
parent
4cc3be30
Changes
1
Hide whitespace changes
Inline
Side-by-side
Showing
1 changed file
with
1 addition
and
1 deletion
+1
-1
llm/ggml.go
llm/ggml.go
+1
-1
No files found.
llm/ggml.go
View file @
8f440d57
...
@@ -127,7 +127,7 @@ func (t Tensor) blockSize() uint64 {
...
@@ -127,7 +127,7 @@ func (t Tensor) blockSize() uint64 {
switch
t
.
Kind
{
switch
t
.
Kind
{
case
0
,
1
,
24
,
25
,
26
,
27
,
28
,
31
:
// F32, F16, I8, I16, I32, I64, F64, BF16
case
0
,
1
,
24
,
25
,
26
,
27
,
28
,
31
:
// F32, F16, I8, I16, I32, I64, F64, BF16
return
1
return
1
case
2
,
3
,
8
,
9
,
20
:
// Q4_0, Q4_1, Q8_0, Q8_1, IQ4_NL
case
2
,
3
,
4
,
5
,
6
,
7
,
8
,
9
,
20
:
// Q4_0, Q4_1,
Q5_0, Q5_1,
Q8_0, Q8_1, IQ4_NL
return
32
return
32
default
:
// All others
default
:
// All others
return
256
return
256
...
...
Write
Preview
Markdown
is supported
0%
Try again
or
attach a new file
.
Attach a file
Cancel
You are about to add
0
people
to the discussion. Proceed with caution.
Finish editing this message first!
Cancel
Please
register
or
sign in
to comment