Commits · 716e36561530ce5d3f9fdc75d13cb95b37c87088 · OpenDAS / ollama

18 Feb, 2025 1 commit
- test: add test cases for HumanNumber (#9108) · 716e3656
  L. Jiang authored Feb 19, 2025
  
  716e3656
06 Feb, 2025 1 commit
- format: rename test file from byte_test.go to bytes_test.go (#8865) · 32285a6d
  Azis Alvriyanto authored Feb 07, 2025
  
  32285a6d
05 Feb, 2025 1 commit

format: byte formatting test coverage (#8692) · 8d8b9f83

Azis Alvriyanto authored Feb 06, 2025

Removed redundant checks and streamlined the switch-case structure.
Added test cases for both HumanBytes and HumanBytes2 to cover a wide range of scenarios.

8d8b9f83

02 Aug, 2024 1 commit
- lint · b732beba
  Michael Yang authored Aug 01, 2024
  
  b732beba
04 Jun, 2024 1 commit
- lint · e40145a3
  Michael Yang authored May 21, 2024
  
  e40145a3
14 May, 2024 1 commit
- Ollama `ps` command for showing currently loaded models (#4327) · 68459888
  Patrick Devine authored May 13, 2024
  
  68459888
08 May, 2024 1 commit
- Record GPU usage information · bee2f4a3
  Daniel Hiltgen authored May 04, 2024
```
This records more GPU usage information for eventual UX inclusion.
```
  bee2f4a3
07 May, 2024 1 commit

fix: store accurate model parameter size (#4058) · 527e9be0

Bruce MacDonald authored May 07, 2024

- add test for number formatting
- fix bug where 1B and 1M were not stored correctly
- display 2 decimal points for million param sizes
- display 1 decimal point for billion param sizes

527e9be0

23 Apr, 2024 1 commit

Request and model concurrency · 34b9db5a

Daniel Hiltgen authored Mar 30, 2024

This change adds support for multiple concurrent requests, as well as
loading multiple models by spawning multiple runners. The default
settings are currently set at 1 concurrent request per model and only 1
loaded model at a time, but these can be adjusted by setting
OLLAMA_NUM_PARALLEL and OLLAMA_MAX_LOADED_MODELS.

34b9db5a

10 Apr, 2024 1 commit
- partial offloading · 7e33a017
  Michael Yang authored Apr 05, 2024
  
  7e33a017
01 Apr, 2024 1 commit
- update memory calcualtions · 91b3e4d2
  Michael Yang authored Mar 18, 2024
```
count each layer independently when deciding gpu offloading
```
  91b3e4d2
24 Feb, 2024 1 commit
- remove format/openssh.go · fd10a2ad
  Michael Yang authored Feb 23, 2024
```
this is unnecessary now that x/crypto/ssh.MarshalPrivateKey has been
added
```
  fd10a2ad
28 Nov, 2023 1 commit
- progress: fix bar rate · 424d53ac
  Michael Yang authored Nov 18, 2023
  
  424d53ac
20 Nov, 2023 1 commit
- only show decimal points for smaller file size numbers · 93a10821
  Jeffrey Morgan authored Nov 20, 2023
  
  93a10821
17 Nov, 2023 1 commit
- format bytes · 9f04e5a8
  Michael Yang authored Nov 14, 2023
  
  9f04e5a8
14 Nov, 2023 1 commit
- replace go-humanize with format.HumanBytes · 01ea6002
  Michael Yang authored Nov 14, 2023
  
  01ea6002
09 Nov, 2023 1 commit

instead of static number of parameters for each model family, get the real... · c5e1bbab

Michael Yang authored Nov 08, 2023

instead of static number of parameters for each model family, get the real number from the tensors (#1022)

* parse tensor info

* refactor decoder

* return actual parameter count

* explicit rounding

* s/Human/HumanNumber/

c5e1bbab

19 Oct, 2023 1 commit
- go fmt · 2ce1793a
  Michael Yang authored Oct 19, 2023
  
  2ce1793a
13 Oct, 2023 1 commit
- fix memory check · 92189a58
  Michael Yang authored Oct 12, 2023
  
  92189a58
11 Oct, 2023 2 commits
- add format bytes · b599946b
  Michael Yang authored Oct 11, 2023
  
  b599946b
- cleanup format time · b5e08e33
  Michael Yang authored Oct 11, 2023
  
  b5e08e33
06 Sep, 2023 1 commit
- remove unused openssh key types · 0dae34b6
  Michael Yang authored Sep 06, 2023
  
  0dae34b6
11 Aug, 2023 1 commit
- Generate private/public keypair for use w/ auth (#324) · 9770e3b3
  Patrick Devine authored Aug 11, 2023
  
  9770e3b3
18 Jul, 2023 1 commit
- add new list command (#97) · 5bea29f6
  Patrick Devine authored Jul 18, 2023
  
  5bea29f6