- 17 Feb, 2025 1 commit
-
-
shangmingc authored
Signed-off-by:Shangming Cai <caishangming@linux.alibaba.com>
-
- 16 Feb, 2025 4 commits
-
-
Cyrus Leung authored
Signed-off-by:DarkLight1337 <tlleungac@connect.ust.hk>
-
youkaichao authored
Signed-off-by:youkaichao <youkaichao@gmail.com>
-
wchen61 authored
Signed-off-by:wchen61 <wchen61@foxmail.com>
-
Lily Liu authored
Signed-off-by:LiuXiaoxuanPKU <lilyliupku@gmail.com>
-
- 15 Feb, 2025 3 commits
-
-
Cody Yu authored
-
Mark McLoughlin authored
-
Woosuk Kwon authored
Signed-off-by:Woosuk Kwon <woosuk.kwon@berkeley.edu>
-
- 14 Feb, 2025 10 commits
-
-
Aoyu authored
Signed-off-by:
Aoyu <aoyuzhan@amazon.com> Co-authored-by:
Aoyu <aoyuzhan@amazon.com>
-
Joe Runde authored
Signed-off-by:
Joe Runde <Joseph.Runde@ibm.com> Signed-off-by:
Prashant Gupta <prashantgupta@us.ibm.com> Co-authored-by:
Prashant Gupta <prashantgupta@us.ibm.com> Co-authored-by:
Cody Yu <hao.yu.cody@gmail.com>
-
Michael Goin authored
Signed-off-by:mgoin <mgoin64@gmail.com>
-
Lu Fang authored
-
Alexander Matveev authored
-
Kero Liang authored
[Bugfix][V1] GPUModelRunner._update_states should return True when there is a finished request in batch (#13126)
-
Varun Sundar Rabindranath authored
Signed-off-by:
Varun Sundar Rabindranath <varun@neuralmagic.com> Co-authored-by:
Varun Sundar Rabindranath <varun@neuralmagic.com>
-
Harry Mellor authored
-
Tyler Michael Smith authored
-
Tyler Michael Smith authored
Signed-off-by:Tyler Michael Smith <tyler@neuralmagic.com>
-
- 13 Feb, 2025 8 commits
-
-
Nicolò Lucchesi authored
-
Vaibhav Jain authored
-
Cyrus Leung authored
-
Cyrus Leung authored
-
Rui Qiao authored
-
LikeSundayLikeRain authored
[Bugfix] deepseek_r1_reasoning_parser put reason content in wrong field in certain edge case (#13097)
-
Isotr0py authored
-
Kaixi Hou authored
-
- 12 Feb, 2025 7 commits
-
-
Michael Goin authored
Signed-off-by:mgoin <mgoin64@gmail.com>
-
Qubitium-ModelCloud authored
-
Jee Jee Li authored
-
Rafael Vasquez authored
-
Lingfan Yu authored
[Neuron][Kernel] Support Longer Sequences in NKI-based Flash PagedAttention and Improve Efficiency (#12921) Signed-off-by:Lingfan Yu <lingfany@amazon.com>
-
Christian Pinto authored
-
Keyun Tong authored
Signed-off-by:Keyun Tong <tongkeyun@gmail.com>
-
- 11 Feb, 2025 7 commits
-
-
ℍ𝕠𝕝𝕝𝕠𝕨 𝕄𝕒𝕟 authored
Signed-off-by:Hollow Man <hollowman@opensuse.org>
-
Mark McLoughlin authored
Signed-off-by:Mark McLoughlin <markmc@redhat.com>
-
மனோஜ்குமார் பழனிச்சாமி authored
Signed-off-by:மனோஜ்குமார் பழனிச்சாமி <smartmanoj42857@gmail.com>
-
Cody Yu authored
-
Ce Gao authored
Signed-off-by:Ce Gao <cegao@tensorchord.ai>
-
Varun Sundar Rabindranath authored
Signed-off-by:
Varun Sundar Rabindranath <varun@neuralmagic.com> Co-authored-by:
Varun Sundar Rabindranath <varun@neuralmagic.com>
-
Florian Greinacher authored
Signed-off-by:Florian Greinacher <florian.greinacher@siemens.com>
-