- 15 Mar, 2024 10 commits
-
-
Harry Mellor authored
-
youkaichao authored
-
Tao He authored
Signed-off-by:
Tao He <sighingnow@gmail.com> Co-authored-by:
simon-mo <simon.mo@hey.com>
-
Dan Clark authored
Co-authored-by:declark1 <daniel.clark@ibm.com>
-
Yang Fan authored
-
Junda Chen authored
-
Dinghow Yang authored
-
Dinghow Yang authored
-
youkaichao authored
Co-authored-by:Simon Mo <simon.mo@hey.com>
-
akhoroshev authored
-
- 14 Mar, 2024 8 commits
-
-
Enrique Shockwave authored
-
陈序 authored
Co-authored-by:Cade Daniel <edacih@gmail.com>
-
youkaichao authored
-
Dan Clark authored
Co-authored-by:Daniel Clark <daniel.clark@ibm.com>
-
Thomas Parnell authored
-
youkaichao authored
[Kernel] change benchmark script so that result can be directly used; tune moe kernel in A100/H100 with tp=2,4,8 (#3389)
-
Allen.Dou authored
-
Simon Mo authored
-
- 13 Mar, 2024 10 commits
-
-
Zhuohan Li authored
-
Antoni Baum authored
-
Terry authored
-
Or Sharir authored
Add missing kernel for CodeLlama-34B on A/H100 (no tensor parallelism) when using Multi-LoRA. (#3350)
-
陈序 authored
-
Hui Liu authored
-
Ronan McGovern authored
-
Bo-Wen Wang authored
Co-authored-by:Woosuk Kwon <woosuk.kwon@berkeley.edu>
-
Woosuk Kwon authored
-
Breno Faria authored
-
- 12 Mar, 2024 1 commit
-
-
Sherlock Xu authored
Signed-off-by:Sherlock113 <sherlockxu07@gmail.com>
-
- 11 Mar, 2024 7 commits
-
-
DAIZHENWEI authored
-
kliuae authored
-
Zhuohan Li authored
-
Philipp Moritz authored
-
Zhuohan Li authored
-
Nick Hill authored
-
Roy authored
-
- 10 Mar, 2024 2 commits
-
-
Douglas Lehr authored
-
Terry authored
-
- 09 Mar, 2024 2 commits
-
-
Cade Daniel authored
-
Zhuohan Li authored
-