- 15 Jan, 2025 1 commit
-
-
youkaichao authored
Signed-off-by:youkaichao <youkaichao@gmail.com>
-
- 09 Jan, 2025 1 commit
-
-
youkaichao authored
Signed-off-by:youkaichao <youkaichao@gmail.com>
-
- 06 Jan, 2025 1 commit
-
-
cennn authored
-
- 05 Jan, 2025 1 commit
-
-
cennn authored
-
- 04 Jan, 2025 1 commit
-
-
Yan Burman authored
Signed-off-by:
Yan Burman <yanburman@users.noreply.github.com> Signed-off-by:
Ido Asraff <idoa@atero.ai>
-
- 30 Dec, 2024 1 commit
-
-
youkaichao authored
Signed-off-by:youkaichao <youkaichao@gmail.com>
-
- 12 Dec, 2024 1 commit
-
-
youkaichao authored
Signed-off-by:youkaichao <youkaichao@gmail.com>
-
- 02 Dec, 2024 1 commit
-
-
Kuntai Du authored
This PR provides initial support for single-node disaggregated prefill in 1P1D scenario. Signed-off-by:
KuntaiDu <kuntai@uchicago.edu> Co-authored-by:
ApostaC <yihua98@uchicago.edu> Co-authored-by:
YaoJiayi <120040070@link.cuhk.edu.cn>
-
- 26 Nov, 2024 1 commit
-
-
Sage Moore authored
Signed-off-by:
youkaichao <youkaichao@gmail.com> Co-authored-by:
youkaichao <youkaichao@gmail.com>
-
- 08 Nov, 2024 1 commit
-
-
Yan Ma authored
Signed-off-by:yan ma <yan.ma@intel.com>
-
- 06 Nov, 2024 2 commits
-
-
Russell Bryant authored
Signed-off-by:
Russell Bryant <rbryant@redhat.com> Signed-off-by:
DarkLight1337 <tlleungac@connect.ust.hk> Co-authored-by:
DarkLight1337 <tlleungac@connect.ust.hk>
-
Konrad Zawora authored
Signed-off-by:
yuwenzho <yuwen.zhou@intel.com> Signed-off-by:
Chendi.Xue <chendi.xue@intel.com> Signed-off-by:
Bob Zhu <bob.zhu@intel.com> Signed-off-by:
zehao-intel <zehao.huang@intel.com> Signed-off-by:
Konrad Zawora <kzawora@habana.ai> Co-authored-by:
Kunshang Ji <kunshang.ji@intel.com> Co-authored-by:
Sanju C Sudhakaran <scsudhakaran@habana.ai> Co-authored-by:
Michal Adamczyk <madamczyk@habana.ai> Co-authored-by:
Marceli Fylcek <mfylcek@habana.ai> Co-authored-by:
Himangshu Lahkar <49579433+hlahkar@users.noreply.github.com> Co-authored-by:
Vivek Goel <vgoel@habana.ai> Co-authored-by:
yuwenzho <yuwen.zhou@intel.com> Co-authored-by:
Dominika Olszewska <dolszewska@habana.ai> Co-authored-by:
barak goldberg <149692267+bgoldberg-habana@users.noreply.github.com> Co-authored-by:
Michal Szutenberg <37601244+szutenberg@users.noreply.github.com> Co-authored-by:
Jan Kaniecki <jkaniecki@habana.ai> Co-authored-by: Agata Dobrzynie...
-
- 01 Nov, 2024 1 commit
-
-
youkaichao authored
Signed-off-by:youkaichao <youkaichao@gmail.com>
-
- 30 Oct, 2024 1 commit
-
-
Yan Ma authored
Signed-off-by:
YiSheng5 <syhm@mail.ustc.edu.cn> Signed-off-by:
yan ma <yan.ma@intel.com> Co-authored-by:
YiSheng5 <syhm@mail.ustc.edu.cn>
-
- 24 Oct, 2024 1 commit
-
-
Yongzao authored
Signed-off-by:
youkaichao <youkaichao@gmail.com> Co-authored-by:
youkaichao <youkaichao@gmail.com>
-
- 22 Oct, 2024 1 commit
-
-
wangshuai09 authored
-
- 18 Oct, 2024 1 commit
-
-
Cody Yu authored
-
- 04 Oct, 2024 1 commit
-
-
youkaichao authored
-
- 21 Sep, 2024 1 commit
-
-
Kunshang Ji authored
Co-authored-by:youkaichao <youkaichao@126.com>
-
- 18 Sep, 2024 1 commit
-
-
Cyrus Leung authored
-
- 17 Sep, 2024 1 commit
-
-
youkaichao authored
-
- 13 Aug, 2024 1 commit
-
-
Cyrus Leung authored
-
- 05 Aug, 2024 1 commit
-
-
Rui Qiao authored
Signed-off-by:Rui Qiao <ruisearch42@gmail.com>
-
- 01 Aug, 2024 1 commit
-
-
Aurick Qiao authored
Co-authored-by:Aurick Qiao <aurick.qiao@snowflake.com>
-
- 31 Jul, 2024 1 commit
-
-
Cyrus Leung authored
-
- 27 Jul, 2024 1 commit
-
-
Woosuk Kwon authored
-
- 26 Jul, 2024 1 commit
-
-
Li, Jiang authored
[Hardware] [Intel] Enable Multiprocessing and tensor parallel in CPU backend and update documentation (#6125)
-
- 25 Jul, 2024 1 commit
-
-
Tyler Michael Smith authored
Co-authored-by:Lucas Wilkinson <lwilkinson@neuralmagic.com>
-
- 10 Jul, 2024 1 commit
-
-
youkaichao authored
[core][distributed] add zmq fallback for broadcasting large objects (#6183)
-
- 03 Jul, 2024 1 commit
-
-
youkaichao authored
-
- 02 Jul, 2024 1 commit
-
-
Murali Andoorveedu authored
Signed-off-by:Muralidhar Andoorveedu <muralidhar.andoorveedu@centml.ai>
-
- 29 Jun, 2024 1 commit
-
-
Cyrus Leung authored
-
- 28 Jun, 2024 2 commits
-
-
xwjiang2010 authored
Signed-off-by:Xiaowei Jiang <xwjiang2010@gmail.com>
-
xwjiang2010 authored
Signed-off-by:
Xiaowei Jiang <xwjiang2010@gmail.com> Co-authored-by:
Roger Wang <ywang@roblox.com>
-
- 27 Jun, 2024 1 commit
-
-
xwjiang2010 authored
Signed-off-by:Xiaowei Jiang <xwjiang2010@gmail.com>
-
- 25 Jun, 2024 1 commit
-
-
Woo-Yeon Lee authored
[Speculative Decoding] Support draft model on different tensor-parallel size than target model (#5414)
-
- 23 Jun, 2024 1 commit
-
-
Murali Andoorveedu authored
-
- 21 Jun, 2024 1 commit
-
-
youkaichao authored
Co-authored-by:Cody Yu <hao.yu.cody@gmail.com>
-
- 17 Jun, 2024 1 commit
-
-
Kunshang Ji authored
Co-authored-by:
Jiang Li <jiang1.li@intel.com> Co-authored-by:
Abhilash Majumder <abhilash.majumder@intel.com> Co-authored-by:
Abhilash Majumder <30946547+abhilash1910@users.noreply.github.com>
-
- 14 Jun, 2024 1 commit
-
-
youkaichao authored
-