- 18 Dec, 2024 3 commits
-
-
Konrad Zawora authored
Signed-off-by:Konrad Zawora <kzawora@habana.ai>
-
Cody Yu authored
Signed-off-by:Cody Yu <hao.yu.cody@gmail.com>
-
Michael Goin authored
-
- 17 Dec, 2024 10 commits
-
-
Joe Runde authored
-
Isotr0py authored
Signed-off-by:Isotr0py <2037008807@qq.com>
-
Roger Wang authored
Signed-off-by:
Roger Wang <ywang@roblox.com> Co-authored-by:
Xiaoyu Zhang <BBuf@users.noreply.github.com>
-
Tyler Michael Smith authored
Signed-off-by:Tyler Michael Smith <tyler@neuralmagic.com>
-
wangxiyuan authored
Signed-off-by:wangxiyuan <wangxiyuan1007@gmail.com>
-
Roger Wang authored
Signed-off-by:
Roger Wang <ywang@roblox.com> Co-authored-by:
ywang96 <ywang@example.com>
-
kYLe authored
Signed-off-by:
DarkLight1337 <tlleungac@connect.ust.hk> Co-authored-by:
DarkLight1337 <tlleungac@connect.ust.hk>
-
Michael Goin authored
[CI] Add test case with JSON schema using references + use xgrammar by default with OpenAI parse (#10935) Signed-off-by:mgoin <michael@neuralmagic.com>
-
youkaichao authored
Signed-off-by:youkaichao <youkaichao@gmail.com>
-
youkaichao authored
Signed-off-by:
youkaichao <youkaichao@gmail.com> Co-authored-by:
Tyler Michael Smith <tyler@neuralmagic.com>
-
- 16 Dec, 2024 11 commits
-
-
youkaichao authored
Signed-off-by:youkaichao <youkaichao@gmail.com>
-
bk-TurbaAI authored
[Docs] hint to enable use of GPU performance counters in profiling tools for multi-node distributed serving (#11235) Co-authored-by:Michael Goin <michael@neuralmagic.com>
-
youkaichao authored
Signed-off-by:youkaichao <youkaichao@gmail.com>
-
Varun Sundar Rabindranath authored
Signed-off-by:
Varun Sundar Rabindranath <varun@neuralmagic.com> Co-authored-by:
Varun Sundar Rabindranath <varun@neuralmagic.com>
-
Isotr0py authored
Signed-off-by:Isotr0py <2037008807@qq.com>
-
Isotr0py authored
Signed-off-by:
Isotr0py <2037008807@qq.com> Co-authored-by:
Cyrus Leung <cyrus.tl.leung@gmail.com>
-
Jani Monoses authored
-
cennn authored
Signed-off-by:
drikster80 <ed.sealing@gmail.com> Co-authored-by:
drikster80 <ed.sealing@gmail.com>
-
yansh97 authored
-
chenqianfzh authored
-
AlexHe99 authored
-
- 15 Dec, 2024 7 commits
-
-
Woosuk Kwon authored
Signed-off-by:Woosuk Kwon <woosuk.kwon@berkeley.edu>
-
shangmingc authored
Signed-off-by:Shangming Cai <caishangming@linux.alibaba.com>
-
Kuntai Du authored
Signed-off-by:Kuntai Du <kuntai@uchicago.edu>
-
Cyrus Leung authored
Signed-off-by:DarkLight1337 <tlleungac@connect.ust.hk>
-
Cyrus Leung authored
Signed-off-by:DarkLight1337 <tlleungac@connect.ust.hk>
-
youkaichao authored
Signed-off-by:youkaichao <youkaichao@gmail.com>
-
Jee Jee Li authored
-
- 14 Dec, 2024 9 commits
-
-
Sungjae Lee authored
[Performance][Core] Optimize the performance of evictor v1 and v2 by applying a priority queue and lazy deletion (#7209)
-
Mark McLoughlin authored
Signed-off-by:Mark McLoughlin <markmc@redhat.com>
-
Cyrus Leung authored
Signed-off-by:DarkLight1337 <tlleungac@connect.ust.hk>
-
Brad Hilton authored
Signed-off-by:Brad Hilton <brad.hilton.nw@gmail.com>
-
Jee Jee Li authored
Signed-off-by:Jee Jee Li <pandaleefree@gmail.com>
-
Tyler Michael Smith authored
Signed-off-by:Tyler Michael Smith <tyler@neuralmagic.com>
-
Russell Bryant authored
Signed-off-by:Russell Bryant <rbryant@redhat.com>
-
dhuangnm authored
Co-authored-by:dhuangnm <dhuang@MacBook-Pro-2.local>
-
Cody Yu authored
Signed-off-by:Cody Yu <hao.yu.cody@gmail.com>
-