- 20 Jul, 2024 4 commits
-
-
Daniel Hiltgen authored
Fix context exhaustion integration test for small gpus
-
Jeffrey Morgan authored
-
Jeffrey Morgan authored
-
Jeffrey Morgan authored
-
- 19 Jul, 2024 3 commits
- 18 Jul, 2024 5 commits
-
-
Michael Yang authored
fix parsing tool calls: break on unexpected eofs
-
Michael Yang authored
-
Jeffrey Morgan authored
-
Jeffrey Morgan authored
-
Jeffrey Morgan authored
* server: only parse tool calls if tools are provided * still set `resp.Message.Content`
-
- 17 Jul, 2024 8 commits
-
-
Michael Yang authored
-
Michael Yang authored
parse tool call as individual objects
-
Michael Yang authored
-
Michael Yang authored
-
Michael Yang authored
remove ToolCall from GenerateResponse
-
Pákozdi György authored
-
royjhan authored
* reopen pr * tools * remove tc from stream for now * ID and Function * openai expects arguments to be a string (#5739) * mutually exclusive content and tool calls * clean up --------- Co-authored-by:Jeffrey Morgan <jmorganca@gmail.com>
-
royjhan authored
* add suffix * remove todo * remove TODO * add to test * rm outdated prompt tokens info md * fix test * fix test
-
- 16 Jul, 2024 15 commits
-
-
Michael Yang authored
-
Michael Yang authored
remove unneeded tool calls
-
Michael Yang authored
add insert support to generate endpoint
-
Michael Yang authored
this change is triggered by the presence of "suffix", particularly useful for code completion tasks
-
Thorsten Sommer authored
* Added AI Studio to the list of UIs
-
Michael Yang authored
-
royjhan authored
* OpenAI v1 models * Empty List Testing * Add back envconfig * v1/models docs * Remove Docs * OpenAI batch embed compatibility * merge conflicts * integrate with api/embed * ep * merge conflicts * request tests * rm resp test * merge conflict * merge conflict * test fixes * test fn renaming * input validation for empty string --------- Co-authored-by:jmorganca <jmorganca@gmail.com>
-
Michael Yang authored
fix unmarshal type errors
-
Michael Yang authored
-
Jeffrey Morgan authored
-
Michael Yang authored
add chat and generate tests with mock runner
-
Michael Yang authored
-
Michael Yang authored
tools
-
Jeffrey Morgan authored
* server: return empty slice on empty `/api/embed` request * fix tests
-
Michael Yang authored
-
- 15 Jul, 2024 5 commits
-
-
Daniel Hiltgen authored
Bump linux ROCm to 6.1.2
-
Michael Yang authored
-
Daniel Hiltgen authored
-
Jeffrey Morgan authored
-
royjhan authored
* Initial Batch Embedding * Revert "Initial Batch Embedding" This reverts commit c22d54895a280b54c727279d85a5fc94defb5a29. * Initial Draft * mock up notes * api/embed draft * add server function * check normalization * clean up * normalization * playing around with truncate stuff * Truncation * Truncation * move normalization to go * Integration Test Template * Truncation Integration Tests * Clean up * use float32 * move normalize * move normalize test * refactoring * integration float32 * input handling and handler testing * Refactoring of legacy and new * clear comments * merge conflicts * touches * embedding type 64 * merge conflicts * fix hanging on single string * refactoring * test values * set context length * clean up * testing clean up * testing clean up * remove function closure * Revert "remove function closure" This reverts commit 55d48c6ed17abe42e7a122e69d603ef0c1506787. * remove function closure * remove redundant error check * clean up * more clean up * clean up
-