- 14 Feb, 2025 8 commits
-
-
Kero Liang authored
[Bugfix][V1] GPUModelRunner._update_states should return True when there is a finished request in batch (#13126)
-
Sage Moore authored
-
Varun Sundar Rabindranath authored
Signed-off-by:
Varun Sundar Rabindranath <varun@neuralmagic.com> Co-authored-by:
Varun Sundar Rabindranath <varun@neuralmagic.com>
-
Michael Goin authored
-
Wang Ran (汪然) authored
-
Roger Wang authored
-
Gregory Shtrasberg authored
Signed-off-by:Gregory Shtrasberg <Gregory.Shtrasberg@amd.com>
-
Tyler Michael Smith authored
Signed-off-by:Tyler Michael Smith <tyler@neuralmagic.com>
-
- 13 Feb, 2025 17 commits
-
-
Michael Goin authored
Signed-off-by:mgoin <mgoin64@gmail.com>
-
Nicolò Lucchesi authored
-
Vaibhav Jain authored
-
Cyrus Leung authored
-
燃 authored
-
Aoyu authored
Signed-off-by:
Aoyu <aoyuzhan@amazon.com> Signed-off-by:
youkaichao <youkaichao@gmail.com> Co-authored-by:
Aoyu <aoyuzhan@amazon.com> Co-authored-by:
youkaichao <youkaichao@gmail.com>
-
Cyrus Leung authored
-
Roger Wang authored
-
Russell Bryant authored
-
Isotr0py authored
-
Rui Qiao authored
-
Daniel Han authored
-
Russell Bryant authored
-
LikeSundayLikeRain authored
[Bugfix] deepseek_r1_reasoning_parser put reason content in wrong field in certain edge case (#13097)
-
Lu Fang authored
Signed-off-by:Lu Fang <lufang@fb.com>
-
Isotr0py authored
-
Kaixi Hou authored
-
- 12 Feb, 2025 13 commits
-
-
Murali Andoorveedu authored
Signed-off-by:andoorve <37849411+andoorve@users.noreply.github.com>
-
Michael Goin authored
-
Qubitium-ModelCloud authored
-
Lu Fang authored
Signed-off-by:Lu Fang <lufang@fb.com>
-
Jee Jee Li authored
-
Rafael Vasquez authored
-
Cyrus Leung authored
Signed-off-by:DarkLight1337 <tlleungac@connect.ust.hk>
-
bnellnm authored
-
Shiyan Deng authored
-
Maximilien de Bayser authored
-
Lingfan Yu authored
[Neuron][Kernel] Support Longer Sequences in NKI-based Flash PagedAttention and Improve Efficiency (#12921) Signed-off-by:Lingfan Yu <lingfany@amazon.com>
-
Christian Pinto authored
-
Keyun Tong authored
Signed-off-by:Keyun Tong <tongkeyun@gmail.com>
-
- 11 Feb, 2025 2 commits
-
-
Yuan Tang authored
Signed-off-by:Yuan Tang <terrytangyuan@gmail.com>
-
Li, Jiang authored
-