- 01 Nov, 2024 12 commits
-
-
André Jonasson authored
Signed-off-by:André Jonasson <andre.jonasson@gmail.com>
-
Travis Johnson authored
Signed-off-by:
Travis Johnson <tsjohnso@us.ibm.com> Signed-off-by:
Prashant Gupta <prashantgupta@us.ibm.com> Co-authored-by:
Prashant Gupta <prashantgupta@us.ibm.com> Co-authored-by:
Patrick von Platen <patrick.v.platen@gmail.com>
-
Cyrus Leung authored
-
Michael Goin authored
-
Cyrus Leung authored
-
Cyrus Leung authored
-
Yongzao authored
Signed-off-by:
youkaichao <youkaichao@gmail.com> Co-authored-by:
youkaichao <youkaichao@gmail.com>
-
Michael Goin authored
Signed-off-by:mgoin <michael@neuralmagic.com>
-
youkaichao authored
Signed-off-by:youkaichao <youkaichao@gmail.com>
-
Michael Goin authored
Signed-off-by:mgoin <michael@neuralmagic.com>
-
youkaichao authored
Signed-off-by:youkaichao <youkaichao@gmail.com>
-
Joe Runde authored
Signed-off-by:Joe Runde <Joseph.Runde@ibm.com>
-
- 31 Oct, 2024 10 commits
-
-
Kevin H. Luu authored
Signed-off-by:kevin <kevin@anyscale.com>
-
Mor Zusman authored
Signed-off-by:mzusman <mor.zusmann@gmail.com>
-
sasha0552 authored
[Bugfix] Fix `illegal memory access` error with chunked prefill, prefix caching, block manager v2 and xformers enabled together (#9532) Signed-off-by:sasha0552 <admin@sasha0552.org>
-
Alexei-V-Ivanov-AMD authored
-
Alex Brooks authored
Signed-off-by:
Alex-Brooks <Alex.Brooks@ibm.com> Co-authored-by:
Cyrus Leung <cyrus.tl.leung@gmail.com> Co-authored-by:
DarkLight1337 <tlleungac@connect.ust.hk>
-
Jee Jee Li authored
-
Roger Wang authored
Signed-off-by:Roger Wang <ywang@roblox.com>
-
Michael Goin authored
Signed-off-by:mgoin <michael@neuralmagic.com>
-
Kevin H. Luu authored
-
Guillaume Calmettes authored
[Misc][OpenAI] deprecate max_tokens in favor of new max_completion_tokens field for chat completion endpoint (#9837)
-
- 30 Oct, 2024 18 commits
-
-
youkaichao authored
Signed-off-by:youkaichao <youkaichao@gmail.com>
-
Yongzao authored
Signed-off-by:
youkaichao <youkaichao@gmail.com> Co-authored-by:
youkaichao <youkaichao@gmail.com>
-
youkaichao authored
Signed-off-by:youkaichao <youkaichao@gmail.com>
-
Harsha vardhan manoj Bikki authored
-
Joe Runde authored
Signed-off-by:Joe Runde <Joseph.Runde@ibm.com>
-
Joe Runde authored
Signed-off-by:Joe Runde <Joseph.Runde@ibm.com>
-
Elfie Guo authored
-
Went-Liang authored
Signed-off-by:Went-Liang <wenteng_liang@163.com>
-
Alex Brooks authored
Signed-off-by:Alex-Brooks <Alex.Brooks@ibm.com>
-
Woosuk Kwon authored
-
Jee Jee Li authored
Signed-off-by:Jee Jee Li <pandaleefree@gmail.com>
-
youkaichao authored
Signed-off-by:youkaichao <youkaichao@gmail.com>
-
Russell Bryant authored
Signed-off-by:
Russell Bryant <rbryant@redhat.com> Co-authored-by:
Michael Goin <michael@neuralmagic.com> Co-authored-by:
Cyrus Leung <cyrus.tl.leung@gmail.com>
-
Yan Ma authored
Signed-off-by:
YiSheng5 <syhm@mail.ustc.edu.cn> Signed-off-by:
yan ma <yan.ma@intel.com> Co-authored-by:
YiSheng5 <syhm@mail.ustc.edu.cn>
-
Kevin H. Luu authored
Signed-off-by:kevin <kevin@anyscale.com>
-
Michael Goin authored
-
Lily Liu authored
-
youkaichao authored
Signed-off-by:youkaichao <youkaichao@gmail.com>
-