- 15 Mar, 2024 1 commit
-
-
youkaichao authored
Co-authored-by:Simon Mo <simon.mo@hey.com>
-
- 14 Mar, 2024 4 commits
-
-
Enrique Shockwave authored
-
陈序 authored
Co-authored-by:Cade Daniel <edacih@gmail.com>
-
Dan Clark authored
Co-authored-by:Daniel Clark <daniel.clark@ibm.com>
-
youkaichao authored
[Kernel] change benchmark script so that result can be directly used; tune moe kernel in A100/H100 with tp=2,4,8 (#3389)
-
- 13 Mar, 2024 7 commits
-
-
Zhuohan Li authored
-
Antoni Baum authored
-
Terry authored
-
Hui Liu authored
-
Bo-Wen Wang authored
Co-authored-by:Woosuk Kwon <woosuk.kwon@berkeley.edu>
-
Woosuk Kwon authored
-
Breno Faria authored
-
- 11 Mar, 2024 5 commits
-
-
DAIZHENWEI authored
-
Zhuohan Li authored
-
Zhuohan Li authored
-
Nick Hill authored
-
Roy authored
-
- 09 Mar, 2024 2 commits
-
-
Cade Daniel authored
-
Zhuohan Li authored
-
- 08 Mar, 2024 5 commits
-
-
Michael Goin authored
-
Woosuk Kwon authored
-
whyiug authored
-
Nick Hill authored
-
ElizaWszola authored
-
- 07 Mar, 2024 4 commits
-
-
jacobthebanana authored
Possible fix for conflict between Automated Prefix Caching (#2762) and multi-LoRA support (#1804) (#3263)
-
Michael Goin authored
-
Woosuk Kwon authored
-
TechxGenus authored
-
- 06 Mar, 2024 3 commits
-
-
Chujie Zheng authored
-
Cade Daniel authored
-
Nick Hill authored
Co-authored-by:Antoni Baum <antoni.baum@protonmail.com>
-
- 05 Mar, 2024 2 commits
-
-
Nick Hill authored
-
Hongxia Yang authored
Co-authored-by:lcskrishna <lollachaitanya@gmail.com>
-
- 04 Mar, 2024 4 commits
-
-
Antoni Baum authored
Co-authored-by:Zhuohan Li <zhuohan123@gmail.com>
-
Antoni Baum authored
Co-authored-by:Avnish Narayan <avnish@anyscale.com>
-
ttbachyinsda authored
Co-authored-by:guofangze <guofangze@kuaishou.com>
-
Philipp Moritz authored
Co-authored-by:Roger Wang <136131678+ywang96@users.noreply.github.com>
-
- 03 Mar, 2024 2 commits
-
-
Zhuohan Li authored
-
Jason Cox authored
-
- 02 Mar, 2024 1 commit
-
-
Sage Moore authored
Co-authored-by:
ElizaWszola <eliza@neuralmagic.com> Co-authored-by:
Michael Goin <michael@neuralmagic.com>
-