Commits · b3e5491e41811294de9d81649a96581af6522d08 · OpenDAS / ollama

22 Jul, 2024 1 commit
- server: collect nested tool call objects when parsing (#5824) · b3e5491e
  Jeffrey Morgan authored Jul 22, 2024
  
  b3e5491e
21 Jul, 2024 2 commits
- Remove out of space test temporarily (#5825) · 80ee9b5e
  Jeffrey Morgan authored Jul 21, 2024
  
  80ee9b5e
- llm: consider `head_dim` in llama arch (#5817) · 5534f2cc
  Jeffrey Morgan authored Jul 20, 2024
  
  5534f2cc
20 Jul, 2024 7 commits
- Merge pull request #5815 from dhiltgen/win_rocm_gfx_features · d321297d
  Daniel Hiltgen authored Jul 20, 2024
```
Adjust windows ROCm discovery
```
  d321297d
- Merge pull request #5506 from dhiltgen/sched_tests · 06e5d74e
  Daniel Hiltgen authored Jul 20, 2024
```
Refine scheduler unit tests for reliability
```
  06e5d74e
- Merge pull request #5583 from dhiltgen/integration_improvements · 5d707e6f
  Daniel Hiltgen authored Jul 20, 2024
```
Fix context exhaustion integration test for small gpus
```
  5d707e6f
- Adjust windows ROCm discovery · 283948c8
  Daniel Hiltgen authored Jul 19, 2024
```
The v5 hip library returns unsupported GPUs which wont enumerate at
inference time in the runner so this makes sure we align discovery.  The
gfx906 cards are no longer supported so we shouldn't compile with that
GPU type as it wont enumerate at runtime.
```
  283948c8
- add patch for tekken (#5807) · 1475eab9
  Jeffrey Morgan authored Jul 20, 2024
  
  1475eab9
- preserve last assistant message (#5802) · 20090f31
  Jeffrey Morgan authored Jul 19, 2024
  
  20090f31
- Fix generate test flakyness (#5804) · 69a2d4cc
  Jeffrey Morgan authored Jul 19, 2024
  
  69a2d4cc
19 Jul, 2024 3 commits
- server: validate template (#5734) · e8b954c6
  Josh authored Jul 19, 2024
```
add template validation to modelfile
```
  e8b954c6
- OpenAI: Function Based Testing (#5752) · c57317cb
  royjhan authored Jul 19, 2024
```
* distinguish error forwarding

* more coverage

* rm comment
```
  c57317cb
- adjust openai chat msg processing (#5729) · 51b2fd29
  royjhan authored Jul 19, 2024
  
  51b2fd29
18 Jul, 2024 5 commits
- Merge pull request #5780 from ollama/mxyng/tools · d0634b15
  Michael Yang authored Jul 18, 2024
```
fix parsing tool calls: break on unexpected eofs
```
  d0634b15
- fix parsing tool calls · 43606d6d
  Michael Yang authored Jul 18, 2024
  
  43606d6d
- server: check for empty tools array too (#5779) · 70b1010f
  Jeffrey Morgan authored Jul 18, 2024
  
  70b1010f
- always provide content even if empty (#5778) · 84e5721f
  Jeffrey Morgan authored Jul 18, 2024
  
  84e5721f
- server: only parse tool calls if tools are provided (#5771) · 319fb1ce
  Jeffrey Morgan authored Jul 18, 2024
```
* server: only parse tool calls if tools are provided

* still set `resp.Message.Content`
```
  319fb1ce
17 Jul, 2024 8 commits
- marshal json automatically for some template values (#5758) · b2554455
  Michael Yang authored Jul 17, 2024
  
  b2554455
- Merge pull request #5753 from ollama/mxyng/parse-tool-call · b23424bb
  Michael Yang authored Jul 17, 2024
```
parse tool call as individual objects
```
  b23424bb
- parse tool call as individual objects · 5fd69881
  Michael Yang authored Jul 17, 2024
  
  5fd69881
- stub response (#5750) · 5b82960d
  Michael Yang authored Jul 17, 2024
  
  5b82960d
- Merge pull request #5732 from ollama/mxyng/cleanup · cc9a252d
  Michael Yang authored Jul 17, 2024
```
remove ToolCall from GenerateResponse
```
  cc9a252d
- add sidellama link (#5702) · d281a6e6
  Pákozdi György authored Jul 17, 2024
  
  d281a6e6
- OpenAI: Support Tools (#5614) · 154f6f45
  royjhan authored Jul 16, 2024
```
* reopen pr

* tools

* remove tc from stream for now

* ID and Function

* openai expects arguments to be a string (#5739)

* mutually exclusive content and tool calls

* clean up

---------
Co-authored-by: Jeffrey Morgan <jmorganca@gmail.com>
```
  154f6f45
- OpenAI: Add Suffix to `v1/completions` (#5611) · 0d41623b
  royjhan authored Jul 16, 2024
```
* add suffix

* remove todo

* remove TODO

* add to test

* rm outdated prompt tokens info md

* fix test

* fix test
```
  0d41623b
16 Jul, 2024 14 commits
- remove ToolCall from GenerateResponse · c279f963
  Michael Yang authored Jul 16, 2024
  
  c279f963
- Merge pull request #5730 from ollama/mxyng/cleanup · 499e87c9
  Michael Yang authored Jul 16, 2024
```
remove unneeded tool calls
```
  499e87c9
- Merge pull request #5207 from ollama/mxyng/suffix · cd0853f2
  Michael Yang authored Jul 16, 2024
```
add insert support to generate endpoint
```
  cd0853f2
- add suffix support to generate endpoint · d290e875
  Michael Yang authored Jun 20, 2024
```
this change is triggered by the presence of "suffix", particularly
useful for code completion tasks
```
  d290e875
- README: Added AI Studio to the list of UIs (#5721) · 97c20ede
  Thorsten Sommer authored Jul 16, 2024
```
* Added AI Studio to the list of UIs
```
  97c20ede
- remove unneeded tool calls · 5a83f79a
  Michael Yang authored Jul 16, 2024
  
  5a83f79a
- OpenAI: /v1/embeddings compatibility (#5285) · 987dbab0
  royjhan authored Jul 16, 2024
```
* OpenAI v1 models

* Empty List Testing

* Add back envconfig

* v1/models docs

* Remove Docs

* OpenAI batch embed compatibility

* merge conflicts

* integrate with api/embed

* ep

* merge conflicts

* request tests

* rm resp test

* merge conflict

* merge conflict

* test fixes

* test fn renaming

* input validation for empty string

---------
Co-authored-by: jmorganca <jmorganca@gmail.com>
```
  987dbab0
- Merge pull request #5726 from ollama/mxyng/tools-templates · a8388beb
  Michael Yang authored Jul 16, 2024
```
fix unmarshal type errors
```
  a8388beb
- fix unmarshal type errors · 5afbb60f
  Michael Yang authored Jul 16, 2024
  
  5afbb60f
- server: omit model system prompt if empty (#5717) · 4cb5d7de
  Jeffrey Morgan authored Jul 16, 2024
  
  4cb5d7de
- Merge pull request #5684 from ollama/mxyng/tests · 8eac50dd
  Michael Yang authored Jul 16, 2024
```
add chat and generate tests with mock runner
```
  8eac50dd
- add chat and generate tests with mock runner · 4a565cbf
  Michael Yang authored Jul 13, 2024
  
  4a565cbf
- Merge pull request #5284 from ollama/mxyng/tools · 64039df6
  Michael Yang authored Jul 15, 2024
```
tools
```
  64039df6
- server: return empty slice on empty `/api/embed` request (#5713) · 7ac6d462
  Jeffrey Morgan authored Jul 15, 2024
```
* server: return empty slice on empty `/api/embed` request

* fix tests
```
  7ac6d462