- 23 Jan, 2025 1 commit
-
-
youkaichao authored
Signed-off-by:youkaichao <youkaichao@gmail.com>
-
- 22 Jan, 2025 1 commit
-
-
youkaichao authored
Signed-off-by:youkaichao <youkaichao@gmail.com>
-
- 18 Jan, 2025 1 commit
-
-
youkaichao authored
Signed-off-by:youkaichao <youkaichao@gmail.com>
-
- 16 Jan, 2025 1 commit
-
-
youkaichao authored
Signed-off-by:youkaichao <youkaichao@gmail.com>
-
- 13 Jan, 2025 1 commit
-
-
Chenguang Li authored
Signed-off-by:Chenguang Li <757486878@qq.com>
-
- 10 Jan, 2025 1 commit
-
-
Chen Zhang authored
Signed-off-by:Chen Zhang <zhangch99@outlook.com>
-
- 16 Dec, 2024 1 commit
-
-
youkaichao authored
Signed-off-by:youkaichao <youkaichao@gmail.com>
-
- 11 Dec, 2024 2 commits
-
-
Cyrus Leung authored
Signed-off-by:DarkLight1337 <tlleungac@connect.ust.hk>
-
Cyrus Leung authored
Signed-off-by:DarkLight1337 <tlleungac@connect.ust.hk>
-
- 02 Dec, 2024 1 commit
-
-
Kuntai Du authored
This PR provides initial support for single-node disaggregated prefill in 1P1D scenario. Signed-off-by:
KuntaiDu <kuntai@uchicago.edu> Co-authored-by:
ApostaC <yihua98@uchicago.edu> Co-authored-by:
YaoJiayi <120040070@link.cuhk.edu.cn>
-
- 01 Dec, 2024 1 commit
-
-
Cyrus Leung authored
Signed-off-by:DarkLight1337 <tlleungac@connect.ust.hk>
-
- 27 Nov, 2024 1 commit
-
-
Chendi.Xue authored
Signed-off-by:Chendi Xue <chendi.xue@intel.com>
-
- 20 Nov, 2024 1 commit
-
-
youkaichao authored
Signed-off-by:youkaichao <youkaichao@gmail.com>
-
- 15 Nov, 2024 1 commit
-
-
Jee Jee Li authored
Signed-off-by:Jee Jee Li <pandaleefree@gmail.com>
-
- 07 Nov, 2024 1 commit
-
-
Cyrus Leung authored
Signed-off-by:DarkLight1337 <tlleungac@connect.ust.hk>
-
- 02 Nov, 2024 1 commit
-
-
youkaichao authored
Signed-off-by:
youkaichao <youkaichao@gmail.com> Co-authored-by:
Nick Hill <nhill@redhat.com>
-
- 19 Oct, 2024 1 commit
-
-
Joe Runde authored
Signed-off-by:Joe Runde <Joseph.Runde@ibm.com>
-
- 18 Oct, 2024 2 commits
-
-
Cyrus Leung authored
-
Joe Runde authored
Signed-off-by:Joe Runde <Joseph.Runde@ibm.com>
-
- 11 Oct, 2024 1 commit
-
-
Tyler Michael Smith authored
-
- 18 Sep, 2024 1 commit
-
-
Cyrus Leung authored
-
- 11 Sep, 2024 1 commit
-
-
bnellnm authored
Co-authored-by:Sage Moore <sage@neuralmagic.com>
-
- 30 Aug, 2024 1 commit
-
-
afeldman-nm authored
-
- 22 Aug, 2024 1 commit
-
-
Abhinav Goyal authored
-
- 21 Aug, 2024 1 commit
-
-
William Lin authored
Co-authored-by:Cody Yu <hao.yu.cody@gmail.com>
-
- 19 Aug, 2024 1 commit
-
-
SangBin Cho authored
-
- 17 Aug, 2024 2 commits
-
-
Roger Wang authored
-
youkaichao authored
-
- 11 Aug, 2024 1 commit
-
-
William Lin authored
-
- 09 Aug, 2024 2 commits
-
-
Mahesh Keralapura authored
-
Cyrus Leung authored
-
- 06 Aug, 2024 1 commit
-
-
afeldman-nm authored
[Core] Subclass ModelRunner to support cross-attention & encoder sequences (towards eventual encoder/decoder model support) (#4942) Co-authored-by:
Andrew Feldman <afeld2012@gmail.com> Co-authored-by:
Nick Hill <nickhill@us.ibm.com>
-
- 02 Aug, 2024 1 commit
-
-
youkaichao authored
-
- 17 Jul, 2024 1 commit
-
-
Woosuk Kwon authored
-
- 10 Jul, 2024 1 commit
-
-
Abhinav Goyal authored
-
- 09 Jul, 2024 1 commit
-
-
Swapnil Parekh authored
Co-authored-by:
Swapnil Parekh <swapnilp@ibm.com> Co-authored-by:
Joe G <joseph.granados@h2o.ai> Co-authored-by:
Antoni Baum <antoni.baum@protonmail.com>
-
- 03 Jul, 2024 3 commits
-
-
youkaichao authored
-
xwjiang2010 authored
Signed-off-by:
Xiaowei Jiang <xwjiang2010@gmail.com> Co-authored-by:
Roger Wang <ywang@roblox.com>
-
youkaichao authored
-
- 02 Jul, 2024 1 commit
-
-
Murali Andoorveedu authored
Signed-off-by:Muralidhar Andoorveedu <muralidhar.andoorveedu@centml.ai>
-