- 27 Nov, 2024 1 commit
-
-
Chendi.Xue authored
Signed-off-by:Chendi Xue <chendi.xue@intel.com>
-
- 26 Nov, 2024 1 commit
-
-
Murali Andoorveedu authored
Signed-off-by:
andoorve <37849411+andoorve@users.noreply.github.com> Signed-off-by:
Sourashis Roy <sroy@roblox.com> Co-authored-by:
Sourashis Roy <sroy@roblox.com>
-
- 07 Nov, 2024 1 commit
-
-
Nicolò Lucchesi authored
Signed-off-by:NickLucche <nlucches@redhat.com>
-
- 04 Nov, 2024 1 commit
-
-
youkaichao authored
Signed-off-by:youkaichao <youkaichao@gmail.com>
-
- 02 Nov, 2024 1 commit
-
-
youkaichao authored
Signed-off-by:
youkaichao <youkaichao@gmail.com> Co-authored-by:
Nick Hill <nhill@redhat.com>
-
- 27 Oct, 2024 1 commit
-
-
科英 authored
Signed-off-by:Abatom <abatom@163.com>
-
- 21 Oct, 2024 1 commit
-
-
Thomas Parnell authored
Signed-off-by:Thomas Parnell <tpa@zurich.ibm.com>
-
- 12 Oct, 2024 1 commit
-
-
Lily Liu authored
-
- 11 Oct, 2024 1 commit
-
-
Wallas Henrique authored
Signed-off-by:Wallas Santos <wallashss@ibm.com>
-
- 01 Oct, 2024 1 commit
-
-
Lily Liu authored
-
- 25 Sep, 2024 1 commit
-
-
Travis Johnson authored
Signed-off-by:Travis Johnson <tsjohnso@us.ibm.com>
-
- 22 Sep, 2024 1 commit
-
-
Lily Liu authored
-
- 02 Sep, 2024 1 commit
-
-
Lily Liu authored
-
- 30 Aug, 2024 1 commit
-
-
afeldman-nm authored
-
- 25 Aug, 2024 1 commit
-
-
Nick Hill authored
-
- 22 Aug, 2024 1 commit
-
-
Abhinav Goyal authored
-
- 20 Aug, 2024 1 commit
-
-
Abhinav Goyal authored
-
- 09 Aug, 2024 1 commit
-
-
William Lin authored
-
- 05 Aug, 2024 1 commit
-
-
Cade Daniel authored
-
- 30 Jul, 2024 1 commit
-
-
Nick Hill authored
-
- 24 Jul, 2024 1 commit
-
-
Allen.Dou authored
-
- 21 Jul, 2024 1 commit
-
-
sroy745 authored
[Spec Decode] Disable Log Prob serialization to CPU for spec decoding for both draft and target models. (#6485)
-
- 19 Jul, 2024 2 commits
-
-
Woo-Yeon Lee authored
-
Thomas Parnell authored
Signed-off-by:
Thomas Parnell <tpa@zurich.ibm.com> Co-authored-by:
Nick Hill <nickhill@us.ibm.com>
-
- 17 Jul, 2024 1 commit
-
-
shangmingc authored
Co-authored-by:caishangming.csm <caishangming.csm@alibaba-inc.com>
-
- 10 Jul, 2024 2 commits
-
-
sroy745 authored
[Speculative Decoding] Enabling bonus token in speculative decoding for KV cache based models (#5765)
-
Abhinav Goyal authored
-
- 02 Jul, 2024 1 commit
-
-
Sirej Dua authored
Co-authored-by:Sirej Dua <sirej.dua@databricks.com> Co-authored-by: Sirej Dua <Sirej Dua>
-
- 01 Jul, 2024 1 commit
-
-
sroy745 authored
-
- 28 Jun, 2024 1 commit
-
-
Cody Yu authored
-
- 25 Jun, 2024 1 commit
-
-
Woo-Yeon Lee authored
[Speculative Decoding] Support draft model on different tensor-parallel size than target model (#5414)
-
- 21 Jun, 2024 1 commit
-
-
Joshua Rosenkranz authored
Signed-off-by:
Thomas Parnell <tpa@zurich.ibm.com> Co-authored-by:
Thomas Parnell <tpa@zurich.ibm.com> Co-authored-by:
Nick Hill <nickhill@us.ibm.com> Co-authored-by:
Davis Wertheimer <Davis.Wertheimer@ibm.com>
-
- 15 Jun, 2024 1 commit
-
-
Cyrus Leung authored
-
- 11 Jun, 2024 1 commit
-
-
Nick Hill authored
-
- 05 Jun, 2024 1 commit
-
-
Nick Hill authored
-
- 25 May, 2024 1 commit
-
-
Lily Liu authored
-
- 22 May, 2024 1 commit
-
-
Nick Hill authored
-
- 16 May, 2024 1 commit
-
-
Cody Yu authored
Co-authored-by:
Cade Daniel <edacih@gmail.com> Co-authored-by:
Cade Daniel <cade@anyscale.com>
-
- 08 May, 2024 1 commit
-
-
Cody Yu authored
Co-authored-by:Cade Daniel <edacih@gmail.com>
-
- 07 May, 2024 1 commit
-
-
leiwen83 authored
Co-authored-by:
Lei Wen <wenlei03@qiyi.com> Co-authored-by:
Cade Daniel <edacih@gmail.com> Co-authored-by:
Cody Yu <hao.yu.cody@gmail.com>
-