- 22 Feb, 2025 1 commit
-
-
zhuwenwen authored
-
- 18 Feb, 2025 1 commit
-
-
Woosuk Kwon authored
Signed-off-by:Woosuk Kwon <woosuk.kwon@berkeley.edu>
-
- 16 Feb, 2025 1 commit
-
-
Michael Goin authored
-
- 14 Feb, 2025 1 commit
-
-
Russell Bryant authored
-
- 13 Feb, 2025 1 commit
-
-
Nicolò Lucchesi authored
-
- 05 Feb, 2025 1 commit
-
-
Dipika Sikka authored
-
- 03 Feb, 2025 1 commit
-
-
Arthur authored
# Adds support for `transformers` as a backend Following https://github.com/huggingface/transformers/pull/35235 , a bunch of models should already be supported, we are ramping up support for more models. Thanks @Isotr0py for the TP support, and @hmellor for his help as well! This includes: - `trust_remote_code=True` support: any model on the hub, if it implements attention the correct way can be natively supported!! - tensor parallel support --------- Signed-off-by:
Harry Mellor <19981378+hmellor@users.noreply.github.com> Signed-off-by:
Isotr0py <2037008807@qq.com> Co-authored-by:
Isotr0py <41363108+Isotr0py@users.noreply.github.com> Co-authored-by:
Harry Mellor <19981378+hmellor@users.noreply.github.com> Co-authored-by:
Isotr0py <2037008807@qq.com> Co-authored-by:
Cyrus Leung <cyrus.tl.leung@gmail.com> Co-authored-by:
Michael Goin <mgoin64@gmail.com> Co-authored-by:
Isotr0py <mozf@mail2.sysu.edu.cn>
-
- 01 Feb, 2025 1 commit
-
-
Kevin H. Luu authored
-
- 24 Jan, 2025 1 commit
-
-
Dipika Sikka authored
-
- 23 Jan, 2025 1 commit
-
-
zhuwenwen authored
-
- 21 Jan, 2025 1 commit
-
-
Thomas Parnell authored
Signed-off-by:Thomas Parnell <tpa@zurich.ibm.com>
-
- 08 Jan, 2025 1 commit
-
-
zhuwenwen authored
-
- 20 Dec, 2024 1 commit
-
-
youkaichao authored
Signed-off-by:
drikster80 <ed.sealing@gmail.com> Signed-off-by:
youkaichao <youkaichao@gmail.com> Co-authored-by:
drikster80 <ed.sealing@gmail.com> Co-authored-by:
cenzhiyao <2523403608@qq.com>
-
- 18 Dec, 2024 2 commits
-
-
Kunshang Ji authored
Signed-off-by:Kunshang Ji <kunshang.ji@intel.com>
-
Wallas Henrique authored
-
- 14 Dec, 2024 2 commits
-
-
Russell Bryant authored
Signed-off-by:Russell Bryant <rbryant@redhat.com>
-
dhuangnm authored
Co-authored-by:dhuangnm <dhuang@MacBook-Pro-2.local>
-
- 12 Dec, 2024 2 commits
-
-
zhuwenwen authored
-
Alexander Matveev authored
Signed-off-by:
Roger Wang <ywang@roblox.com> Signed-off-by:
Alexander Matveev <alexm@neuralmagic.com> Co-authored-by:
Michael Goin <michael@neuralmagic.com> Co-authored-by:
Roger Wang <ywang@roblox.com>
-
- 11 Dec, 2024 2 commits
-
-
youkaichao authored
Signed-off-by:youkaichao <youkaichao@gmail.com>
-
Kevin H. Luu authored
-
- 10 Dec, 2024 1 commit
-
-
Russell Bryant authored
Signed-off-by:Russell Bryant <rbryant@redhat.com>
-
- 09 Dec, 2024 1 commit
-
-
Russell Bryant authored
Signed-off-by:Russell Bryant <rbryant@redhat.com>
-
- 03 Dec, 2024 2 commits
-
-
Michael Goin authored
Signed-off-by:mgoin <michael@neuralmagic.com>
-
Aaron Pham authored
Signed-off-by:
Aaron Pham <contact@aarnphm.xyz> Signed-off-by:
mgoin <michael@neuralmagic.com> Co-authored-by:
mgoin <michael@neuralmagic.com>
-
- 28 Nov, 2024 1 commit
-
-
youkaichao authored
Signed-off-by:youkaichao <youkaichao@gmail.com>
-
- 15 Nov, 2024 2 commits
-
-
Simon Mo authored
Signed-off-by:simon-mo <simon.mo@hey.com>
-
Guillaume Calmettes authored
[Bugfix] Ensure special tokens are properly filtered out for guided structured output with MistralTokenizer (#10363) Signed-off-by:Guillaume Calmettes <gcalmettes@scaleway.com>
-
- 13 Nov, 2024 1 commit
-
-
Dipika Sikka authored
Signed-off-by:Dipika <dipikasikka1@gmail.com>
-
- 31 Oct, 2024 1 commit
-
-
Guillaume Calmettes authored
[Misc][OpenAI] deprecate max_tokens in favor of new max_completion_tokens field for chat completion endpoint (#9837)
-
- 18 Oct, 2024 1 commit
-
-
Dipika Sikka authored
-
- 16 Oct, 2024 1 commit
-
-
Cyrus Leung authored
-
- 15 Oct, 2024 1 commit
-
-
Michael Goin authored
Co-authored-by:DarkLight1337 <tlleungac@connect.ust.hk>
-
- 30 Sep, 2024 1 commit
-
-
Roger Wang authored
-
- 28 Sep, 2024 1 commit
-
-
Tyler Titsworth authored
Signed-off-by:
tylertitsworth <tyler.titsworth@intel.com> Co-authored-by:
youkaichao <youkaichao@126.com>
-
- 26 Sep, 2024 1 commit
-
-
Cyrus Leung authored
-
- 25 Sep, 2024 1 commit
-
-
Chen Zhang authored
Co-authored-by:
simon-mo <xmo@berkeley.edu> Co-authored-by:
Chang Su <chang.s.su@oracle.com> Co-authored-by:
Simon Mo <simon.mo@hey.com> Co-authored-by:
Roger Wang <136131678+ywang96@users.noreply.github.com> Co-authored-by:
Roger Wang <ywang@roblox.com>
-
- 22 Sep, 2024 2 commits
-
-
youkaichao authored
-
youkaichao authored
-
- 17 Sep, 2024 1 commit
-
-
Joe Runde authored
Signed-off-by:Joe Runde <Joseph.Runde@ibm.com>
-