Commits · 6bd0a983cd2cf74f27df2e5a5c80f1794a2ed7ef · OpenDAS / ollama

03 Apr, 2025 1 commit

model: support for mistral-small in the ollama runner · 6bd0a983

Bruce MacDonald authored Mar 14, 2025

Mistral is a popular research lab making open source models. This updates
the forward pass of llama architecture models to support both llama models
and mistral models by accounting for additional metadata present in mistral
models, and finding the correct dimensions for the output projection.

6bd0a983

02 Apr, 2025 1 commit

chore(all): replace instances of interface with any (#10067) · 9876c9fa

Bruce MacDonald authored Apr 02, 2025

Both interface{} and any (which is just an alias for interface{} introduced in Go 1.18) represent the empty interface that all types satisfy.

9876c9fa

18 Mar, 2025 1 commit
- convert: return name of unsupported architecture (#9862) · 61a88252
  Bruce MacDonald authored Mar 18, 2025
```
When a model's architecture cannot be converted return the name of the unsupported arch in the error message.
```
  61a88252
13 Mar, 2025 1 commit
- fix: change default context size for gemma3 (#9744) · 80c7ce38
  Patrick Devine authored Mar 13, 2025
  
  80c7ce38
11 Mar, 2025 12 commits
- all: address linter errors · 83f0ec82
  jmorganca authored Mar 11, 2025
  
  83f0ec82
- use 2d pooling · 63a39406
  Michael Yang authored Mar 11, 2025
  
  63a39406
- fix gemma3 1b conversion · 2e54d72f
  Patrick Devine authored Mar 10, 2025
  
  2e54d72f
- compat with upstream gguf · 6b32a2d5
  Michael Yang authored Mar 10, 2025
  
  6b32a2d5
- skip repacking vision tensors · d368c039
  Michael Yang authored Mar 09, 2025
  
  d368c039
- fix configs · 9b54267e
  Patrick Devine authored Mar 08, 2025
  
  9b54267e
- update model · 46bb0169
  Michael Yang authored Mar 08, 2025
  
  46bb0169
- fix conversion · c62861f4
  Patrick Devine authored Mar 07, 2025
  
  c62861f4
- set non-causal attention · 0df18004
  Michael Yang authored Mar 07, 2025
  
  0df18004
- temporary work around for converting spm · 631fecc6
  Patrick Devine authored Mar 07, 2025
  
  631fecc6
- add gemma vision encoder · 4b037a97
  Michael Yang authored Mar 06, 2025
  
  4b037a97
- gemma2 impl · 5f74d1fd
  Patrick Devine authored Feb 07, 2025
  
  5f74d1fd
14 Feb, 2025 1 commit

next ollama runner (#7913) · 58245413

Michael Yang authored Feb 14, 2025



feat: add new Ollama engine using ggml through cgo

This change introduces a new way to run pretrained models. It introduces 3 high level interfaces and a bunch of smaller helper interfaces to facilitate this.

- `model.Model` defines the interface for a model architecture. Models such as `llama` and `mllama`, which are provided as examples, can implement the model's forward propagation in the `Forward` method. This method will be called to generate completions. This interface can be found in `model/model.go`
- `ml.Backend` defines the interface for a backend tensor library, in this case `ggml`. Among other things, a Backend is responsible for loading a pretrained model into hardware (GPU, CPU, etc) and providing an interface for Models to access loaded tensors. This interface can be found in `ml/backend.go`
- `ml.Tensor` defines the interface for a tensor and tensor operations

This is the first implementation of the new engine. Follow up PRs will implement more features:

- non-greedy sampling (#8410)
- integration with Ollama and KV caching (#8301)
- more model support (#9080) with more coming soon
Co-authored-by: Bruce MacDonald <brucewmacdonald@gmail.com>

58245413

16 Jan, 2025 1 commit
- convert: import support for command-r models from safetensors (#6063) · 93a8daf2
  Josh authored Jan 15, 2025
```
---------
Co-authored-by: Patrick Devine <patrick@infrahq.com>
```
  93a8daf2
14 Jan, 2025 1 commit

convert: qwen2 from safetensors (#8408) · f6f37130

Bruce MacDonald authored Jan 14, 2025

Add native support for converting Qwen2 family models (including Qwen2.5)
from safetensors to gguf format so we can run it.

f6f37130

10 Dec, 2024 1 commit
- all: fix typos in documentation, code, and comments (#7021) · abfdc471
  Stefan Weil authored Dec 10, 2024
  
  abfdc471
04 Dec, 2024 1 commit
- fix unmarshaling merges · 44560129
  Michael Yang authored Dec 04, 2024
  
  44560129
18 Oct, 2024 1 commit

image processing for llama3.2 (#6963) · c7cb0f06

Patrick Devine authored Oct 18, 2024


Co-authored-by: jmorganca <jmorganca@gmail.com>
Co-authored-by: Michael Yang <mxyng@pm.me>
Co-authored-by: Jesse Gross <jesse@ollama.com>

c7cb0f06

10 Sep, 2024 1 commit
- catch when model vocab size is set correctly (#6714) · 84b84ce2
  Patrick Devine authored Sep 09, 2024
  
  84b84ce2
06 Sep, 2024 1 commit
- Fix gemma2 2b conversion (#6645) · 608e87bf
  Patrick Devine authored Sep 05, 2024
  
  608e87bf
28 Aug, 2024 1 commit
- throw an error when encountering unsupport tensor sizes (#6538) · 6c1c1ad6
  Patrick Devine authored Aug 27, 2024
  
  6c1c1ad6
27 Aug, 2024 3 commits
- more tokenizer tests · 60e47573
  Michael Yang authored Aug 27, 2024
  
  60e47573
- clean up convert tokenizer · eae3af68
  Michael Yang authored Aug 27, 2024
  
  eae3af68
- detect chat template from configs that contain lists · 3eb08377
  Michael Yang authored Aug 26, 2024
  
  3eb08377
23 Aug, 2024 1 commit
- convert safetensor adapters into GGUF (#6327) · 0c819e16
  Patrick Devine authored Aug 23, 2024
  
  0c819e16
21 Aug, 2024 3 commits
- llama3.1 · 77903ab8
  Michael Yang authored Jul 29, 2024
  
  77903ab8
- convert gemma2 · 3546bbd0
  Michael Yang authored Jun 28, 2024
  
  3546bbd0
- bert · 5a28b9cf
  Michael Yang authored Jun 06, 2024
  
  5a28b9cf
12 Aug, 2024 2 commits
- support new "longrope" attention factor · aec77d6a
  Bruce MacDonald authored Jul 02, 2024
  
  aec77d6a
- add conversion for microsoft phi 3 mini/medium 4k, 128 · 6ffb5cb0
  Michael Yang authored Jun 03, 2024
  
  6ffb5cb0
02 Aug, 2024 1 commit
- lint · b732beba
  Michael Yang authored Aug 01, 2024
  
  b732beba
31 Jul, 2024 5 commits
- convert: fix parse functions · d8e2664c
  Michael Yang authored Jul 31, 2024
  
  d8e2664c
- convert: only extract large files · eafc607a
  Michael Yang authored Jun 29, 2024
  
  eafc607a
- Update convert/reader_safetensors.go · 781fc2d5
  Michael Yang authored Jul 31, 2024
```
Co-authored-by: Jeffrey Morgan <jmorganca@gmail.com>
```
  781fc2d5
- comments · df993fa3
  Michael Yang authored Jul 08, 2024
  
  df993fa3
- refactor convert · 5e9db9fb
  Michael Yang authored May 31, 2024
  
  5e9db9fb