- 03 Aug, 2024 1 commit
-
-
zhuwenwen authored
-
- 31 Jul, 2024 1 commit
-
-
zhuwenwen authored
-
- 08 Jun, 2024 1 commit
-
-
Benjamin Kitor authored
-
- 22 May, 2024 1 commit
-
-
Cody Yu authored
The 2nd PR for #4532. This PR supports loading FP8 kv-cache scaling factors from a FP8 checkpoint (with .kv_scale parameter).
-
- 16 May, 2024 2 commits
- 24 Apr, 2024 1 commit
-
-
zifeitong authored
-
- 18 Apr, 2024 1 commit
-
-
Michael Goin authored
-
- 11 Apr, 2024 1 commit
-
-
SangBin Cho authored
-
- 10 Apr, 2024 1 commit
-
-
Zedong Peng authored
-
- 04 Apr, 2024 1 commit
-
-
TianYu GUO authored
Co-authored-by:Roger Wang <ywang@roblox.com>
-
- 03 Apr, 2024 1 commit
-
-
Adrian Abeyta authored
Co-authored-by:
Gregory Shtrasberg <Gregory.Shtrasberg@amd.com> Co-authored-by:
HaiShaw <hixiao@gmail.com> Co-authored-by:
AdrianAbeyta <Adrian.Abeyta@amd.com> Co-authored-by:
Matthew Wong <Matthew.Wong2@amd.com> Co-authored-by:
root <root@gt-pla-u18-08.pla.dcgpu> Co-authored-by:
mawong-amd <156021403+mawong-amd@users.noreply.github.com> Co-authored-by:
ttbachyinsda <ttbachyinsda@outlook.com> Co-authored-by:
guofangze <guofangze@kuaishou.com> Co-authored-by:
Michael Goin <mgoin64@gmail.com> Co-authored-by:
jacobthebanana <50071502+jacobthebanana@users.noreply.github.com> Co-authored-by:
Woosuk Kwon <woosuk.kwon@berkeley.edu>
-
- 29 Mar, 2024 1 commit
-
-
Yile (Michael) Gu authored
-
- 27 Mar, 2024 1 commit
-
-
AmadeusChan authored
-
- 25 Mar, 2024 1 commit
-
-
SangBin Cho authored
-
- 04 Mar, 2024 1 commit
-
-
Allen.Dou authored
Co-authored-by:zixiao <shunli.dsl@alibaba-inc.com>
-
- 03 Mar, 2024 1 commit
-
-
Zhuohan Li authored
-
- 02 Mar, 2024 1 commit
-
-
Sage Moore authored
Co-authored-by:
ElizaWszola <eliza@neuralmagic.com> Co-authored-by:
Michael Goin <michael@neuralmagic.com>
-
- 01 Feb, 2024 1 commit
-
-
Kunshang Ji authored
Co-authored-by:
Jiang Li <jiang1.li@intel.com> Co-authored-by:
Kunshang Ji <kunshang.ji@intel.com>
-
- 29 Jan, 2024 1 commit
-
-
zhaoyang-star authored
Co-authored-by:
zhaoyang <zhao.yang16@zte.com.cn> Co-authored-by:
Zhuohan Li <zhuohan123@gmail.com>
-
- 17 Dec, 2023 1 commit
-
-
Woosuk Kwon authored
Co-authored-by:
Chen Shen <scv119@gmail.com> Co-authored-by:
Antoni Baum <antoni.baum@protonmail.com>
-
- 15 Dec, 2023 1 commit
-
-
CHU Tianxiang authored
-
- 30 Nov, 2023 1 commit
-
-
aisensiy authored
-
- 20 Nov, 2023 1 commit
-
-
Simon Mo authored
-
- 17 Nov, 2023 1 commit
-
-
Zhuofan authored
-
- 14 Nov, 2023 1 commit
-
-
Woosuk Kwon authored
-
- 22 Oct, 2023 1 commit
-
-
chooper1 authored
Co-authored-by:
squeeze-ai-lab <squeezeailab.bair@gmail.com> Co-authored-by:
Woosuk Kwon <woosuk.kwon@berkeley.edu>
-
- 03 Oct, 2023 1 commit
-
-
Antoni Baum authored
-
- 01 Oct, 2023 1 commit
-
-
kg6-sleipnir authored
-
- 16 Sep, 2023 1 commit
-
-
Woosuk Kwon authored
Co-authored-by:
Robert Irvine <robert@seamlessml.com> Co-authored-by:
root <rirv938@gmail.com> Co-authored-by:
Casper <casperbh.96@gmail.com> Co-authored-by:
julian-q <julianhquevedo@gmail.com>
-
- 20 Jul, 2023 2 commits
-
-
Ricardo Lu authored
-
WRH authored
-
- 28 Jun, 2023 1 commit
-
-
Woosuk Kwon authored
-
- 17 Jun, 2023 3 commits
-
-
Woosuk Kwon authored
-
Zhuohan Li authored
-
Woosuk Kwon authored
-
- 15 Jun, 2023 1 commit
-
-
Woosuk Kwon authored
-
- 04 Jun, 2023 1 commit
-
-
Woosuk Kwon authored
-
- 28 May, 2023 1 commit
-
-
Woosuk Kwon authored
-