- 28 Jan, 2025 6 commits
-
-
Mengqing Cao authored
Signed-off-by:Mengqing Cao <cmq0113@163.com>
-
Gabriel Marinho authored
Signed-off-by:Gabriel Marinho <gmarinho@ibm.com>
-
Hossein Sarshar authored
-
Michael Goin authored
Signed-off-by:mgoin <michael@neuralmagic.com>
-
Liangfu Chen authored
Signed-off-by:
Liangfu Chen <liangfc@amazon.com> Co-authored-by:
Jiangfei Duan <jfduan@outlook.com>
-
Harry Mellor authored
Signed-off-by:Harry Mellor <19981378+hmellor@users.noreply.github.com>
-
- 27 Jan, 2025 13 commits
-
-
Nicolò Lucchesi authored
[Feature] [Spec decode]: Enable MLPSpeculator/Medusa and `prompt_logprobs` with ChunkedPrefill (#10132) Signed-off-by:
NickLucche <nlucches@redhat.com> Signed-off-by:
wallashss <wallashss@ibm.com> Co-authored-by:
wallashss <wallashss@ibm.com>
-
Bowen Wang authored
Signed-off-by:
Bowen Wang <abmfy@icloud.com> Signed-off-by:
youkaichao <youkaichao@gmail.com> Co-authored-by:
youkaichao <youkaichao@gmail.com>
-
Woosuk Kwon authored
Signed-off-by:Woosuk Kwon <woosuk.kwon@berkeley.edu>
-
Mark McLoughlin authored
Signed-off-by:Mark McLoughlin <markmc@redhat.com>
-
Lucas Wilkinson authored
Signed-off-by:Lucas Wilkinson <lwilkinson@neuralmagic.com>
-
Isotr0py authored
Signed-off-by:Isotr0py <2037008807@qq.com>
-
Woosuk Kwon authored
Signed-off-by:Woosuk Kwon <woosuk.kwon@berkeley.edu>
-
Isotr0py authored
Signed-off-by:Isotr0py <2037008807@qq.com>
-
Cyrus Leung authored
Signed-off-by:DarkLight1337 <tlleungac@connect.ust.hk>
-
Pooya Davoodi authored
Signed-off-by:Pooya Davoodi <pooya.davoodi@parasail.io>
-
Woosuk Kwon authored
Signed-off-by:Woosuk Kwon <woosuk.kwon@berkeley.edu>
-
Yuan Tang authored
Signed-off-by:Yuan Tang <terrytangyuan@gmail.com>
-
Kyle Mistele authored
Signed-off-by:Kyle Mistele <kyle@mistele.com>
-
- 26 Jan, 2025 9 commits
-
-
Tyler Michael Smith authored
Signed-off-by:Tyler Michael Smith <tyler@neuralmagic.com>
-
Lucas Wilkinson authored
Signed-off-by:Lucas Wilkinson <lwilkinson@neuralmagic.com>
-
Tyler Michael Smith authored
-
Tyler Michael Smith authored
Signed-off-by:
Tyler Michael Smith <tyler@neuralmagic.com> Co-authored-by:
mgoin <michael@neuralmagic.com>
-
Matthew Hendrey authored
Signed-off-by:
Matthew Hendrey <matthew.hendrey@gmail.com> Signed-off-by:
Shangming Cai <caishangming@linux.alibaba.com> Signed-off-by:
youkaichao <youkaichao@gmail.com> Signed-off-by:
Harry Mellor <19981378+hmellor@users.noreply.github.com> Signed-off-by:
Yuan Tang <terrytangyuan@gmail.com> Signed-off-by:
Isotr0py <2037008807@qq.com> Signed-off-by:
DarkLight1337 <tlleungac@connect.ust.hk> Signed-off-by:
Chen Zhang <zhangch99@outlook.com> Signed-off-by:
wangxiyuan <wangxiyuan1007@gmail.com> Co-authored-by:
shangmingc <caishangming@linux.alibaba.com> Co-authored-by:
youkaichao <youkaichao@gmail.com> Co-authored-by:
Harry Mellor <19981378+hmellor@users.noreply.github.com> Co-authored-by:
Yuan Tang <terrytangyuan@gmail.com> Co-authored-by:
Isotr0py <mozf@mail2.sysu.edu.cn> Co-authored-by:
Cyrus Leung <tlleungac@connect.ust.hk> Co-authored-by:
Chen Zhang <zhangch99@outlook.com> Co-authored-by:
wangxiyuan <wangxiyuan1007@gmail.com>
-
Roger Wang authored
-
Roger Wang authored
Signed-off-by:Roger Wang <ywang@roblox.com>
-
Keyun Tong authored
Signed-off-by:Keyun Tong <tongkeyun@gmail.com>
-
Roger Wang authored
Signed-off-by:
Roger Wang <ywang@roblox.com> Signed-off-by:
Isotr0py <2037008807@qq.com> Co-authored-by:
Isotr0py <2037008807@qq.com>
-
- 25 Jan, 2025 5 commits
-
-
Siyuan Liu authored
Signed-off-by:Siyuan Liu <lsiyuan@google.com>
-
Isotr0py authored
Signed-off-by:Isotr0py <2037008807@qq.com>
-
Divakar Verma authored
Signed-off-by:Divakar Verma <divakar.verma@amd.com>
-
Cyrus Leung authored
Signed-off-by:DarkLight1337 <tlleungac@connect.ust.hk>
-
ElizaWszola authored
-
- 24 Jan, 2025 7 commits
-
-
Lucas Wilkinson authored
[Bugfix][Kernel] FA3 Fix - RuntimeError: This flash attention build only supports pack_gqa (for build size reasons). (#12405) Signed-off-by:Lucas Wilkinson <lwilkinson@neuralmagic.com>
-
Cyrus Leung authored
Signed-off-by:DarkLight1337 <tlleungac@connect.ust.hk>
-
Lucas Wilkinson authored
Signed-off-by:Lucas Wilkinson <lwilkinson@neuralmagic.com>
-
Junichi Sato authored
Signed-off-by:Junichi Sato <junichi.sato@sbintuitions.co.jp>
-
youkaichao authored
Signed-off-by:youkaichao <youkaichao@gmail.com>
-
Mohit Deopujari authored
-
youkaichao authored
Signed-off-by:youkaichao <youkaichao@gmail.com>
-