- 31 Jul, 2025 3 commits
-
-
Faraz authored
Signed-off-by:Faraz Khoubsirat <58580514+farazkh80@users.noreply.github.com>
-
Chang Su authored
-
Cheng Wan authored
-
- 29 Jul, 2025 1 commit
-
-
Lifu Huang authored
Co-authored-by:Stefan He <hebiaobuaa@gmail.com>
-
- 28 Jul, 2025 1 commit
-
-
Qiaolin Yu authored
Co-authored-by:
tianqilin.99 <tianqilin.99@bytedance.com> Co-authored-by:
Baizhou Zhang <sobereddiezhang@gmail.com>
-
- 25 Jul, 2025 2 commits
-
-
Lianmin Zheng authored
-
Cheng Wan authored
-
- 23 Jul, 2025 3 commits
-
-
Xinyuan Tong authored
Signed-off-by:
Xinyuan Tong <xinyuantong.cs@gmail.com> Co-authored-by:
Xinyuan Tong <xinyuantong.cs@gmail.com>
-
Zhiqiang Xie authored
Co-authored-by:pansicheng <sicheng.pan.chn@gmail.com>
-
Lifu Huang authored
-
- 19 Jul, 2025 2 commits
-
-
Lifu Huang authored
-
Mick authored
-
- 17 Jul, 2025 1 commit
-
-
Cheng Wan authored
-
- 16 Jul, 2025 1 commit
-
-
Xiaoze Fan authored
Signed-off-by:
jason-fxz <jason341132@qq.com> Co-authored-by:
gemini-code-assist[bot] <176961590+gemini-code-assist[bot]@users.noreply.github.com>
-
- 14 Jul, 2025 2 commits
-
-
ykcombat authored
-
Lifu Huang authored
-
- 13 Jul, 2025 1 commit
-
-
Hanming Lu authored
Co-authored-by:Ying Sheng <sqy1415@gmail.com>
-
- 04 Jul, 2025 1 commit
-
-
Lianmin Zheng authored
Move mem_fraction_static adjustment for multimodal models to `server_args.py` & Fix session control & Other cleanups (#7748)
-
- 03 Jul, 2025 5 commits
-
-
Chunyuan WU authored
[CPU] support the case where num_attention_heads or intermediate_size is not divisible by the TP size (#6771)
-
ronnie_zheng authored
Co-authored-by:
Maksim <makcum888e@mail.ru> Co-authored-by:
VDV1985 <vladdv85@mail.ru>
-
Chunyuan WU authored
Co-authored-by:blzheng <beilei.zheng@intel.com>
-
Chunyuan WU authored
Co-authored-by:srinarayan-srikanthan <srinarayan.srikanthan@intel.com>
-
Zilin Zhu authored
-
- 01 Jul, 2025 1 commit
-
-
lukec authored
Co-authored-by:
shuaills <shishuaiuoe@gmail.com> Co-authored-by:
Shenggui Li <somerlee.9@gmail.com> Co-authored-by:
Yingyi Huang <yingyihuang2000@outlook.com> Co-authored-by:
yizhang2077 <1109276519@qq.com>
-
- 30 Jun, 2025 1 commit
-
-
Lianmin Zheng authored
Co-authored-by:Kan Wu <wukanustc@gmail.com>
-
- 29 Jun, 2025 1 commit
-
-
fzyzcjy authored
-
- 28 Jun, 2025 2 commits
-
-
Lifu Huang authored
-
tarinkk authored
Co-authored-by:
Cheng Wan <54331508+ch-wan@users.noreply.github.com> Co-authored-by:
tarinkk <rt572@physics.rutger.edu> Co-authored-by:
tarinkk <rt572@rutgers.physics.edu> Co-authored-by:
Hanming Lu <69857889+hanming-lu@users.noreply.github.com>
-
- 27 Jun, 2025 1 commit
-
-
Qiaolin Yu authored
Co-authored-by:Cheng Wan <54331508+ch-wan@users.noreply.github.com>
-
- 25 Jun, 2025 1 commit
-
-
Yuhong Guo authored
-
- 24 Jun, 2025 1 commit
-
-
xianzhiT authored
-
- 22 Jun, 2025 1 commit
-
-
Liangsheng Yin authored
-
- 21 Jun, 2025 1 commit
-
-
Lifu Huang authored
Refactor LoRAManager and LoRAMemoryPool state management logic for dynamic LoRA loading support (#7412)
-
- 19 Jun, 2025 1 commit
-
-
Stefan He authored
-
- 18 Jun, 2025 1 commit
-
-
YanbingJiang authored
Co-authored-by:
Wu, Chunyuan <chunyuan.wu@intel.com> Co-authored-by:
jianan-gu <jianan.gu@intel.com> Co-authored-by:
sdp <sdp@gnr799219.jf.intel.com>
-
- 17 Jun, 2025 1 commit
-
-
u4lr451 authored
Co-authored-by:
austindeng <austindeng@tencent.com> Co-authored-by:
tianqilin.99 <tianqilin.99@bytedance.com> Co-authored-by:
Qiaolin Yu <liin1211@outlook.com> Co-authored-by:
ch-wan <cwan39@gatech.edu>
-
- 16 Jun, 2025 1 commit
-
-
KavioYu authored
Co-authored-by:kavioyu <kavioyu@tencent.com>
-
- 14 Jun, 2025 1 commit
-
-
fzyzcjy authored
-
- 10 Jun, 2025 2 commits
-
-
Byron Hsu authored
-
Baizhou Zhang authored
-