- 20 Jul, 2024 4 commits
-
-
Daniel Hiltgen authored
The v5 hip library returns unsupported GPUs which wont enumerate at inference time in the runner so this makes sure we align discovery. The gfx906 cards are no longer supported so we shouldn't compile with that GPU type as it wont enumerate at runtime.
-
Jeffrey Morgan authored
-
Jeffrey Morgan authored
-
Jeffrey Morgan authored
-
- 19 Jul, 2024 3 commits
- 18 Jul, 2024 5 commits
-
-
Michael Yang authored
fix parsing tool calls: break on unexpected eofs
-
Michael Yang authored
-
Jeffrey Morgan authored
-
Jeffrey Morgan authored
-
Jeffrey Morgan authored
* server: only parse tool calls if tools are provided * still set `resp.Message.Content`
-
- 17 Jul, 2024 8 commits
-
-
Michael Yang authored
-
Michael Yang authored
parse tool call as individual objects
-
Michael Yang authored
-
Michael Yang authored
-
Michael Yang authored
remove ToolCall from GenerateResponse
-
Pákozdi György authored
-
royjhan authored
* reopen pr * tools * remove tc from stream for now * ID and Function * openai expects arguments to be a string (#5739) * mutually exclusive content and tool calls * clean up --------- Co-authored-by:Jeffrey Morgan <jmorganca@gmail.com>
-
royjhan authored
* add suffix * remove todo * remove TODO * add to test * rm outdated prompt tokens info md * fix test * fix test
-
- 16 Jul, 2024 15 commits
-
-
Michael Yang authored
-
Michael Yang authored
remove unneeded tool calls
-
Michael Yang authored
add insert support to generate endpoint
-
Michael Yang authored
this change is triggered by the presence of "suffix", particularly useful for code completion tasks
-
Thorsten Sommer authored
* Added AI Studio to the list of UIs
-
Michael Yang authored
-
royjhan authored
* OpenAI v1 models * Empty List Testing * Add back envconfig * v1/models docs * Remove Docs * OpenAI batch embed compatibility * merge conflicts * integrate with api/embed * ep * merge conflicts * request tests * rm resp test * merge conflict * merge conflict * test fixes * test fn renaming * input validation for empty string --------- Co-authored-by:jmorganca <jmorganca@gmail.com>
-
Michael Yang authored
fix unmarshal type errors
-
Michael Yang authored
-
Jeffrey Morgan authored
-
Michael Yang authored
add chat and generate tests with mock runner
-
Michael Yang authored
-
Michael Yang authored
tools
-
Jeffrey Morgan authored
* server: return empty slice on empty `/api/embed` request * fix tests
-
Michael Yang authored
-
- 15 Jul, 2024 5 commits
-
-
Daniel Hiltgen authored
Bump linux ROCm to 6.1.2
-
Michael Yang authored
-
Daniel Hiltgen authored
-
Jeffrey Morgan authored
-
royjhan authored
* Initial Batch Embedding * Revert "Initial Batch Embedding" This reverts commit c22d54895a280b54c727279d85a5fc94defb5a29. * Initial Draft * mock up notes * api/embed draft * add server function * check normalization * clean up * normalization * playing around with truncate stuff * Truncation * Truncation * move normalization to go * Integration Test Template * Truncation Integration Tests * Clean up * use float32 * move normalize * move normalize test * refactoring * integration float32 * input handling and handler testing * Refactoring of legacy and new * clear comments * merge conflicts * touches * embedding type 64 * merge conflicts * fix hanging on single string * refactoring * test values * set context length * clean up * testing clean up * testing clean up * remove function closure * Revert "remove function closure" This reverts commit 55d48c6ed17abe42e7a122e69d603ef0c1506787. * remove function closure * remove redundant error check * clean up * more clean up * clean up
-