- 02 Nov, 2024 8 commits
-
-
youkaichao authored
Signed-off-by:youkaichao <youkaichao@gmail.com>
-
Yongzao authored
-
youkaichao authored
Signed-off-by:youkaichao <youkaichao@gmail.com>
-
youkaichao authored
Signed-off-by:youkaichao <youkaichao@gmail.com>
-
Robert Shaw authored
-
youkaichao authored
Signed-off-by:
youkaichao <youkaichao@gmail.com> Co-authored-by:
Nick Hill <nhill@redhat.com>
-
youkaichao authored
Signed-off-by:youkaichao <youkaichao@gmail.com>
-
sroy745 authored
-
- 01 Nov, 2024 15 commits
-
-
Peter Salas authored
Signed-off-by:Peter Salas <peter@fixie.ai>
-
Gene Der Su authored
Signed-off-by:Gene Su <e870252314@gmail.com>
-
youkaichao authored
Signed-off-by:youkaichao <youkaichao@gmail.com>
-
Pavani Majety authored
-
youkaichao authored
Signed-off-by:youkaichao <youkaichao@gmail.com>
-
André Jonasson authored
Signed-off-by:André Jonasson <andre.jonasson@gmail.com>
-
Travis Johnson authored
Signed-off-by:
Travis Johnson <tsjohnso@us.ibm.com> Signed-off-by:
Prashant Gupta <prashantgupta@us.ibm.com> Co-authored-by:
Prashant Gupta <prashantgupta@us.ibm.com> Co-authored-by:
Patrick von Platen <patrick.v.platen@gmail.com>
-
Cyrus Leung authored
-
Cyrus Leung authored
-
Yongzao authored
Signed-off-by:
youkaichao <youkaichao@gmail.com> Co-authored-by:
youkaichao <youkaichao@gmail.com>
-
Michael Goin authored
Signed-off-by:mgoin <michael@neuralmagic.com>
-
youkaichao authored
Signed-off-by:youkaichao <youkaichao@gmail.com>
-
Michael Goin authored
Signed-off-by:mgoin <michael@neuralmagic.com>
-
youkaichao authored
Signed-off-by:youkaichao <youkaichao@gmail.com>
-
Joe Runde authored
Signed-off-by:Joe Runde <Joseph.Runde@ibm.com>
-
- 31 Oct, 2024 6 commits
-
-
sasha0552 authored
[Bugfix] Fix `illegal memory access` error with chunked prefill, prefix caching, block manager v2 and xformers enabled together (#9532) Signed-off-by:sasha0552 <admin@sasha0552.org>
-
Jee Jee Li authored
-
Roger Wang authored
Signed-off-by:Roger Wang <ywang@roblox.com>
-
Michael Goin authored
Signed-off-by:mgoin <michael@neuralmagic.com>
-
Kevin H. Luu authored
-
Guillaume Calmettes authored
[Misc][OpenAI] deprecate max_tokens in favor of new max_completion_tokens field for chat completion endpoint (#9837)
-
- 30 Oct, 2024 9 commits
-
-
Joe Runde authored
Signed-off-by:Joe Runde <Joseph.Runde@ibm.com>
-
Elfie Guo authored
-
Went-Liang authored
Signed-off-by:Went-Liang <wenteng_liang@163.com>
-
Alex Brooks authored
Signed-off-by:Alex-Brooks <Alex.Brooks@ibm.com>
-
Woosuk Kwon authored
-
youkaichao authored
Signed-off-by:youkaichao <youkaichao@gmail.com>
-
Yan Ma authored
Signed-off-by:
YiSheng5 <syhm@mail.ustc.edu.cn> Signed-off-by:
yan ma <yan.ma@intel.com> Co-authored-by:
YiSheng5 <syhm@mail.ustc.edu.cn>
-
Michael Goin authored
-
youkaichao authored
Signed-off-by:youkaichao <youkaichao@gmail.com>
-
- 29 Oct, 2024 2 commits
-
-
Michael Goin authored
-
Will Eaton authored
Signed-off-by:
Max de Bayser <mbayser@br.ibm.com> Co-authored-by:
Max de Bayser <mbayser@br.ibm.com> Co-authored-by:
Maximilien de Bayser <maxdebayser@gmail.com>
-