- 07 May, 2024 22 commits
-
-
Tobias Gårdhus authored
-
boessu authored
fixing pip code.
-
Eli Bendersky authored
* api: fill up API documentation Followup for #2878 Now that the documentation is more complete, mention it in the README. Updates #2840 * fix typo/lint * Update README.md Co-authored-by:
Jeffrey Morgan <jmorganca@gmail.com> --------- Co-authored-by:
Jeffrey Morgan <jmorganca@gmail.com>
-
Giuseppe Lumia authored
Some terminals may send line feed characters when pasting text with newlines.
-
Michael Yang authored
update list handler to use model.Name
-
Michael Yang authored
quantize any fp16/fp32 model
-
Bruce MacDonald authored
- add test for number formatting - fix bug where 1B and 1M were not stored correctly - display 2 decimal points for million param sizes - display 1 decimal point for billion param sizes
-
Renat authored
-
Fernando Maclen authored
-
Michael Yang authored
types/model: fix parser for empty values
-
Michael Yang authored
fly example
-
Michael Yang authored
-
Michael Yang authored
Revert "fix golangci workflow not enable gofmt and goimports"
-
Michael Yang authored
This reverts commit 04f971c8.
-
Mélony QIN authored
* add details on kubernetes deployment and separate the testing process * Update examples/kubernetes/README.md thanks for suggesting this change, I agree with you and let's make this project better together ! Co-authored-by:
JonZeolla <Zeolla@gmail.com> --------- Co-authored-by:
QIN Mélony <MQN1@dsone.3ds.com> Co-authored-by:
JonZeolla <Zeolla@gmail.com>
-
Hause Lin authored
* Update README.md Add Ollama for R - ollama-r library * Update README.md --------- Co-authored-by:Jeffrey Morgan <jmorganca@gmail.com>
-
Jeffrey Morgan authored
-
alwqx authored
-
Michael Yang authored
-
Michael Yang authored
llm: add minimum based on layer size
-
Michael Yang authored
Co-authored-by:Jeffrey Morgan <jmorganca@gmail.com>
-
Michael Yang authored
-
- 06 May, 2024 18 commits
-
-
CrispStrobe authored
* note on naming restrictions else push would fail with cryptic retrieving manifest Error: file does not exist ==> maybe change that in code too * Update docs/import.md --------- Co-authored-by:
C-4-5-3 <154636388+C-4-5-3@users.noreply.github.com> Co-authored-by:
Jeffrey Morgan <jmorganca@gmail.com>
-
Jeffrey Morgan authored
-
Jackie Li authored
--------- Co-authored-by:Patrick Devine <patrick@infrahq.com>
-
Michael Yang authored
-
Michael Yang authored
-
Michael Yang authored
-
Michael Yang authored
-
Michael Yang authored
-
Michael Yang authored
-
Michael Yang authored
-
Michael Yang authored
-
Michael Yang authored
- FROM /path/to/{safetensors,pytorch} - FROM /path/to/fp{16,32}.bin - FROM model:fp{16,32} -
Jeffrey Chen authored
-
Mohamed A. Fouad authored
Add -e to viewing logs in order to show end of ollama logs
-
Daniel Hiltgen authored
User our bundled libraries (cuda) instead of the host library
-
Darinka authored
* Update api.md Changed the calculation of tps (token/s) in the documentation * Update docs/api.md --------- Co-authored-by:Jeffrey Morgan <jmorganca@gmail.com>
-
Daniel Hiltgen authored
Support Fedoras standard ROCm location
-
Daniel Hiltgen authored
Trying to live off the land for cuda libraries was not the right strategy. We need to use the version we compiled against to ensure things work properly
-