1. 27 Feb, 2024 2 commits
  2. 16 Jan, 2024 1 commit
  3. 15 Jan, 2024 2 commits
  4. 13 Jan, 2024 3 commits
  5. 11 Jan, 2024 7 commits
  6. 10 Jan, 2024 7 commits
  7. 07 Jan, 2024 6 commits
  8. 04 Jan, 2024 1 commit
  9. 02 Jan, 2024 2 commits
  10. 27 Dec, 2023 2 commits
  11. 23 Dec, 2023 1 commit
  12. 22 Dec, 2023 3 commits
    • Anjor Kanekar's avatar
    • Hailey Schoelkopf's avatar
      Upstream Mamba Support (`mamba_ssm`) (#1110) · 5503b274
      Hailey Schoelkopf authored
      * modularize HFLM code
      
      * pass through extra kwargs to AutoModel.from_pretrained call
      
      * remove explicit model_kwargs
      
      * rename gptq -> autogptq
      
      * fix tokenizer pad token errors
      
      * ensure model always respects device_map and autogptq's selected devices
      
      * add a _get_config helper fn
      
      * add mambaLMWrapper
      
      * add mamba extra
      
      * add mamba extra
      
      * fix conditional import
      
      * Fix botched merge commit
      
      * Remove beginning-of-file comment for consistency
      
      * Add docstring for mambaLM re: supported kwargs
      
      * Alphabetize extras
      
      * Update extras table
      
      * appease precommit
      
      * run precommit on mamba_lm
      5503b274
    • Zach Schillaci's avatar
      Generic decorator for handling rate limit errors (#1109) · 046ea6e2
      Zach Schillaci authored
      
      
      * Add retry error handler
      
      * fixup! Add retry error handler
      
      * Move to utils.py
      
      * Run isort on utils.py
      
      * Catch multiple exceptions
      
      * Update LMs with exception handler
      
      * Fixes to anthropic retry handler
      
      * fix callback kwarg
      
      * Update textsynth.py
      
      * fix python 3.8 incompatibility
      
      * fix indenterror I introduced
      
      * placate linter?
      
      * Update on_exception_callback kwarg name
      
      * fixup! Merge branch 'main' into add-retry-error-handler
      
      * fixup! fixup! Merge branch 'main' into add-retry-error-handler
      
      * Merge conflicts are fun
      
      * Run pre-commit
      
      ---------
      Co-authored-by: default avatarHailey Schoelkopf <65563625+haileyschoelkopf@users.noreply.github.com>
      046ea6e2
  13. 21 Dec, 2023 2 commits
  14. 20 Dec, 2023 1 commit
    • Vicki Boykis's avatar
      Implementing local OpenAI API-style chat completions on any given inference server (#1174) · fcfc0c60
      Vicki Boykis authored
      * LocalChatCompletionsLM add
      
      * clean up completions class
      
      * clean up completions class
      
      * update tokens
      
      * README
      
      * fix constructor
      
      * eos token
      
      * folding local-chat-completions into OpenAIChatCompletions
      
      * refactoring to include gen_kwargs as passable option
      
      * add todo on chat completion kwarg validation
      
      * Ruff and README fix
      
      * generalize to **kwargs
      
      * remove unnecessary kwargs
      
      * README and remove kwargs
      
      * README
      fcfc0c60