- 28 Jan, 2025 2 commits
-
-
Mengqing Cao authored
Signed-off-by:Mengqing Cao <cmq0113@163.com>
-
Harry Mellor authored
Signed-off-by:Harry Mellor <19981378+hmellor@users.noreply.github.com>
-
- 27 Jan, 2025 1 commit
-
-
Nicolò Lucchesi authored
[Feature] [Spec decode]: Enable MLPSpeculator/Medusa and `prompt_logprobs` with ChunkedPrefill (#10132) Signed-off-by:
NickLucche <nlucches@redhat.com> Signed-off-by:
wallashss <wallashss@ibm.com> Co-authored-by:
wallashss <wallashss@ibm.com>
-
- 20 Jan, 2025 1 commit
-
-
Cyrus Leung authored
Signed-off-by:DarkLight1337 <tlleungac@connect.ust.hk>
-
- 15 Jan, 2025 1 commit
-
-
youkaichao authored
Signed-off-by:youkaichao <youkaichao@gmail.com>
-
- 06 Jan, 2025 1 commit
-
-
Cyrus Leung authored
Signed-off-by:DarkLight1337 <tlleungac@connect.ust.hk>
-
- 23 Dec, 2024 1 commit
-
-
Rafael Vasquez authored
Signed-off-by:Rafael Vasquez <rafvasq21@gmail.com>
-
- 11 Dec, 2024 1 commit
-
-
王敏 authored
-
- 07 Dec, 2024 1 commit
-
-
Cyrus Leung authored
Signed-off-by:DarkLight1337 <tlleungac@connect.ust.hk>
-
- 06 Dec, 2024 1 commit
-
-
王敏 authored
-
- 05 Dec, 2024 2 commits
-
-
王敏 authored
-
Cyrus Leung authored
Signed-off-by:DarkLight1337 <tlleungac@connect.ust.hk>
-
- 04 Dec, 2024 1 commit
-
-
王敏 authored
-
- 27 Nov, 2024 2 commits
-
-
王敏 authored
2.更新medusa readme 3.解决benchmark_moe报错问题
-
Chendi.Xue authored
Signed-off-by:Chendi Xue <chendi.xue@intel.com>
-
- 26 Nov, 2024 1 commit
-
-
Murali Andoorveedu authored
Signed-off-by:
andoorve <37849411+andoorve@users.noreply.github.com> Signed-off-by:
Sourashis Roy <sroy@roblox.com> Co-authored-by:
Sourashis Roy <sroy@roblox.com>
-
- 18 Nov, 2024 1 commit
-
-
王敏 authored
-
- 07 Nov, 2024 1 commit
-
-
Nicolò Lucchesi authored
Signed-off-by:NickLucche <nlucches@redhat.com>
-
- 06 Nov, 2024 1 commit
-
-
王敏 authored
2.examples中添加medusa readme 3.修复model_runner中input_positions配置错误的笔误,解决多个模型运行失败问题
-
- 04 Nov, 2024 1 commit
-
-
youkaichao authored
Signed-off-by:youkaichao <youkaichao@gmail.com>
-
- 02 Nov, 2024 1 commit
-
-
youkaichao authored
Signed-off-by:
youkaichao <youkaichao@gmail.com> Co-authored-by:
Nick Hill <nhill@redhat.com>
-
- 27 Oct, 2024 1 commit
-
-
科英 authored
Signed-off-by:Abatom <abatom@163.com>
-
- 24 Oct, 2024 1 commit
-
-
王敏 authored
-
- 21 Oct, 2024 1 commit
-
-
Thomas Parnell authored
Signed-off-by:Thomas Parnell <tpa@zurich.ibm.com>
-
- 12 Oct, 2024 1 commit
-
-
Lily Liu authored
-
- 11 Oct, 2024 1 commit
-
-
Wallas Henrique authored
Signed-off-by:Wallas Santos <wallashss@ibm.com>
-
- 01 Oct, 2024 1 commit
-
-
Lily Liu authored
-
- 25 Sep, 2024 1 commit
-
-
Travis Johnson authored
Signed-off-by:Travis Johnson <tsjohnso@us.ibm.com>
-
- 22 Sep, 2024 1 commit
-
-
Lily Liu authored
-
- 02 Sep, 2024 1 commit
-
-
Lily Liu authored
-
- 30 Aug, 2024 1 commit
-
-
afeldman-nm authored
-
- 25 Aug, 2024 1 commit
-
-
Nick Hill authored
-
- 22 Aug, 2024 1 commit
-
-
Abhinav Goyal authored
-
- 20 Aug, 2024 1 commit
-
-
Abhinav Goyal authored
-
- 09 Aug, 2024 1 commit
-
-
William Lin authored
-
- 05 Aug, 2024 1 commit
-
-
Cade Daniel authored
-
- 30 Jul, 2024 1 commit
-
-
Nick Hill authored
-
- 24 Jul, 2024 1 commit
-
-
Allen.Dou authored
-
- 21 Jul, 2024 1 commit
-
-
sroy745 authored
[Spec Decode] Disable Log Prob serialization to CPU for spec decoding for both draft and target models. (#6485)
-
- 19 Jul, 2024 1 commit
-
-
Woo-Yeon Lee authored
-