- 23 Jan, 2025 2 commits
-
-
Junichi Sato authored
Signed-off-by:Junichi Sato <junichi.sato@sbintuitions.co.jp>
-
Gregory Shtrasberg authored
Signed-off-by:
Gregory Shtrasberg <Gregory.Shtrasberg@amd.com> Co-authored-by:
Micah Williamson <micah.williamson@amd.com>
-
- 22 Jan, 2025 1 commit
-
-
Nick Hill authored
Signed-off-by:Nick Hill <nhill@redhat.com>
-
- 21 Jan, 2025 1 commit
-
-
Divakar Verma authored
Signed-off-by:Divakar Verma <divakar.verma@amd.com>
-
- 19 Jan, 2025 1 commit
-
-
gujing authored
Signed-off-by:
zibai <zibai.gj@alibaba-inc.com> Co-authored-by:
Roger Wang <136131678+ywang96@users.noreply.github.com>
-
- 17 Jan, 2025 1 commit
-
-
Divakar Verma authored
Signed-off-by:Divakar Verma <divakar.verma@amd.com>
-
- 16 Jan, 2025 1 commit
-
-
Varun Sundar Rabindranath authored
-
- 13 Jan, 2025 1 commit
-
-
elijah authored
Signed-off-by:elijah <f1renze.142857@gmail.com>
-
- 10 Jan, 2025 2 commits
-
-
minmin authored
Signed-off-by:
Ren MinMin <renmm6@chinaunicom.cn> Co-authored-by:
Ren MinMin <renmm6@chinaunicom.cn>
-
Kuntai Du authored
Signed-off-by:Kuntai Du <kuntai@uchicago.edu>
-
- 09 Jan, 2025 1 commit
-
-
Ye (Charlotte) Qi authored
Signed-off-by:
Ye Qi <yeq@meta.com> Co-authored-by:
yeq <yeq@devgpu004.lla3.facebook.com>
-
- 08 Jan, 2025 1 commit
-
-
Divakar Verma authored
-
- 01 Jan, 2025 1 commit
-
-
Yihua Cheng authored
Signed-off-by:
ApostaC <yihua98@uchicago.edu> Co-authored-by:
KuntaiDu <kuntai@uchicago.edu>
-
- 25 Dec, 2024 1 commit
-
-
Jiaxin Shan authored
Signed-off-by:Jiaxin Shan <seedjeffwan@gmail.com>
-
- 19 Dec, 2024 1 commit
-
-
Varun Sundar Rabindranath authored
Signed-off-by:
Varun Sundar Rabindranath <varun@neuralmagic.com> Co-authored-by:
Varun Sundar Rabindranath <varun@neuralmagic.com>
-
- 18 Dec, 2024 1 commit
-
-
Dipika Sikka authored
Co-authored-by:
Faraz Shahsavan <faraz.shahsavan@gmail.com> Co-authored-by:
ilmarkov <markovilya197@gmail.com> Co-authored-by:
Rahul Tuli <rahul@neuralmagic.com> Co-authored-by:
rshaw@neuralmagic.com <rshaw@neuralmagic.com>
-
- 17 Dec, 2024 1 commit
-
-
Roger Wang authored
Signed-off-by:
Roger Wang <ywang@roblox.com> Co-authored-by:
Xiaoyu Zhang <BBuf@users.noreply.github.com>
-
- 13 Dec, 2024 2 commits
-
-
Alexander Matveev authored
Signed-off-by:Alexander Matveev <alexm@neuralmagic.com>
-
Luka Govedič authored
Signed-off-by:
luka <luka@neuralmagic.com> Co-authored-by:
Varun Sundar Rabindranath <varun@neuralmagic.com>
-
- 04 Dec, 2024 2 commits
-
-
Chendi.Xue authored
Signed-off-by:
Chendi Xue <chendi.xue@intel.com> Co-authored-by:
Michael Goin <michael@neuralmagic.com>
-
Chendi.Xue authored
Signed-off-by:
Aaron Pham <contact@aarnphm.xyz> Signed-off-by:
Chendi Xue <chendi.xue@intel.com> Co-authored-by:
Aaron Pham <contact@aarnphm.xyz>
-
- 03 Dec, 2024 1 commit
-
-
Michael Goin authored
-
- 02 Dec, 2024 1 commit
-
-
Kuntai Du authored
This PR provides initial support for single-node disaggregated prefill in 1P1D scenario. Signed-off-by:
KuntaiDu <kuntai@uchicago.edu> Co-authored-by:
ApostaC <yihua98@uchicago.edu> Co-authored-by:
YaoJiayi <120040070@link.cuhk.edu.cn>
-
- 01 Dec, 2024 1 commit
-
-
Roger Wang authored
Signed-off-by:
Roger Wang <ywang@roblox.com> Co-authored-by:
Chen Zhang <zhangch99@outlook.com> Co-authored-by:
Isotr0py <2037008807@qq.com>
-
- 21 Nov, 2024 1 commit
-
-
Wang, Yi authored
Signed-off-by:Wang, Yi A <yi.a.wang@intel.com>
-
- 19 Nov, 2024 1 commit
-
-
ElizaWszola authored
Signed-off-by:ElizaWszola <eliza@neuralmagic.com>
-
- 18 Nov, 2024 2 commits
-
-
Ricky Xu authored
Signed-off-by:rickyx <rickyx@anyscale.com>
-
Lucas Wilkinson authored
Signed-off-by:Lucas Wilkinson <lwilkinson@neuralmagic.com>
-
- 16 Nov, 2024 1 commit
-
-
Jaehyun An authored
Signed-off-by:rbbang <anjaehyun87@gmail.com>
-
- 08 Nov, 2024 3 commits
-
-
Cyrus Leung authored
Signed-off-by:DarkLight1337 <tlleungac@connect.ust.hk>
-
DearPlanet authored
-
Cody Yu authored
Signed-off-by:Cody Yu <hao.yu.cody@gmail.com>
-
- 07 Nov, 2024 2 commits
-
-
Russell Bryant authored
Signed-off-by:Russell Bryant <rbryant@redhat.com>
-
Atlas authored
Signed-off-by:
Mozhou <spli161006@gmail.com> Co-authored-by:
Roger Wang <136131678+ywang96@users.noreply.github.com>
-
- 06 Nov, 2024 1 commit
-
-
Aaron Pham authored
Signed-off-by:Aaron Pham <contact@aarnphm.xyz>
-
- 05 Nov, 2024 1 commit
-
-
lkchen authored
Signed-off-by:
Linkun Chen <github+anyscale@lkchen.net> Co-authored-by:
Linkun Chen <github+anyscale@lkchen.net>
-
- 04 Nov, 2024 2 commits
-
-
lkchen authored
Signed-off-by:
Linkun Chen <github+anyscale@lkchen.net> Co-authored-by:
Linkun Chen <lkchen@github.com> Co-authored-by:
Linkun Chen <github+anyscale@lkchen.net>
-
Tran Quang Dai authored
Signed-off-by:daitran2k1 <tranquangdai7a@gmail.com>
-
- 31 Oct, 2024 1 commit
-
-
Guillaume Calmettes authored
[Misc][OpenAI] deprecate max_tokens in favor of new max_completion_tokens field for chat completion endpoint (#9837)
-
- 29 Oct, 2024 1 commit
-
-
wangshuai09 authored
Signed-off-by:wangshuai09 <391746016@qq.com>
-