- 22 Jul, 2025 31 commits
-
-
Rui Qiao authored
Signed-off-by:Rui Qiao <ruisearch42@gmail.com>
-
Yiheng Xu authored
Signed-off-by:
simon-mo <xmo@berkeley.edu> Co-authored-by:
simon-mo <xmo@berkeley.edu>
-
Xin Li authored
Fix Flashinfer Allreduce+Norm enable disable calculation based on `fi_allreduce_fusion_max_token_num` (#21325) Signed-off-by:XIn Li <xinli@nvidia.com>
-
Cyrus Leung authored
Signed-off-by:DarkLight1337 <tlleungac@connect.ust.hk>
-
Aritra Roy Gosthipaty authored
[Bugfix] Decode Tokenized IDs to Strings for `hf_processor` in `llm.chat()` with `model_impl=transformers` (#21353) Signed-off-by:ariG23498 <aritra.born2fly@gmail.com>
-
Wang Yijun authored
-
Wentao Ye authored
Signed-off-by:yewentao256 <zhyanwentao@126.com>
-
Raushan Turganbay authored
Signed-off-by:raushan <raushan@huggingface.co>
-
Benjamin Bartels authored
Signed-off-by:
bbartels <benjamin@bartels.dev> Co-authored-by:
Cyrus Leung <cyrus.tl.leung@gmail.com>
-
Wentao Ye authored
Signed-off-by:yewentao256 <zhyanwentao@126.com>
-
Duncan Moss authored
Signed-off-by:
Duncan Moss <djm.moss@gmail.com> Signed-off-by:
jiahanc <173873397+jiahanc@users.noreply.github.com> Co-authored-by:
jiahanc <173873397+jiahanc@users.noreply.github.com> Co-authored-by:
mgoin <mgoin64@gmail.com>
-
Mickaël Seznec authored
Signed-off-by:
Mickael Seznec <mickael@mistral.ai> Co-authored-by:
mgoin <mgoin64@gmail.com>
-
Ning Xie authored
Signed-off-by:Andy Xie <andy.xning@gmail.com>
-
Jialin Ouyang authored
[Core] Introduce popleft_n and append_n in FreeKVCacheBlockQueue to further optimize block_pool (#21222) Signed-off-by:Jialin Ouyang <Jialin.Ouyang@gmail.com>
-
Jialin Ouyang authored
Signed-off-by:Jialin Ouyang <Jialin.Ouyang@gmail.com>
-
Jialin Ouyang authored
Signed-off-by:Jialin Ouyang <Jialin.Ouyang@gmail.com>
-
Kebe authored
Signed-off-by:Kebe <mail@kebe7jun.com>
-
Simon Mo authored
Signed-off-by:simon-mo <simon.mo@hey.com>
-
Raghav Ravishankar authored
Signed-off-by:
alyosha-swamy <raghav@arcee.ai> Signed-off-by:
Jee Jee Li <pandaleefree@gmail.com> Co-authored-by:
Jee Jee Li <pandaleefree@gmail.com>
-
rongfu.leng authored
Signed-off-by:rongfu.leng <rongfu.leng@daocloud.io>
-
Shu Wang authored
Signed-off-by:Shu Wang <shuw@nvidia.com>
-
Wentao Ye authored
Signed-off-by:yewentao256 <zhyanwentao@126.com>
-
Varun Sundar Rabindranath authored
Signed-off-by:
Varun Sundar Rabindranath <vsundarr@redhat.com> Co-authored-by:
Varun Sundar Rabindranath <vsundarr@redhat.com>
-
Konrad Zawora authored
Signed-off-by:
Konrad Zawora <kzawora@habana.ai> Signed-off-by:
Chendi.Xue <chendi.xue@intel.com> Co-authored-by:
Chendi.Xue <chendi.xue@intel.com>
-
Wentao Ye authored
Signed-off-by:yewentao256 <zhyanwentao@126.com>
-
Thomas Parnell authored
Signed-off-by:Thomas Parnell <tpa@zurich.ibm.com>
-
Jialin Ouyang authored
Signed-off-by:Jialin Ouyang <Jialin.Ouyang@gmail.com>
-
Ming Yang authored
Signed-off-by:Ming Yang <minos.future@gmail.com>
-
Ratnam Parikh authored
Signed-off-by:ratnampa <ratnam.parikh@intel.com>
-
Li, Jiang authored
Signed-off-by:jiang1.li <jiang1.li@intel.com>
-
Chaojun Zhang authored
Signed-off-by:chzhang <chaojun.zhang@intel.com>
-
- 21 Jul, 2025 9 commits
-
-
Lu Fang authored
Signed-off-by:Lu Fang <lufang@fb.com>
-
Himanshu Jaju authored
Signed-off-by:Himanshu Jaju <hj@mistral.ai>
-
Michael Goin authored
-
Robert Shaw authored
Signed-off-by:
Robert Shaw <robshaw@redhat.com> Co-authored-by:
Robert Shaw <robshaw@redhat.com>
-
Lucas Wilkinson authored
Signed-off-by:
Lucas Wilkinson <lwilkins@redhat.com> Co-authored-by:
Michael Goin <mgoin64@gmail.com>
-
Ming Yang authored
Signed-off-by:
Ming Yang <minos.future@gmail.com> Co-authored-by:
gemini-code-assist[bot] <176961590+gemini-code-assist[bot]@users.noreply.github.com>
-
simpx authored
Signed-off-by:simpx <simpxx@gmail.com>
-
Li, Jiang authored
Signed-off-by:jiang1.li <jiang1.li@intel.com>
-
Woosuk Kwon authored
Signed-off-by:Woosuk Kwon <woosuk.kwon@berkeley.edu>
-