- 18 Mar, 2024 5 commits
-
-
Zhuohan Li authored
-
Robert Shaw authored
-
Cade Daniel authored
-
Simon Mo authored
-
Woosuk Kwon authored
-
- 17 Mar, 2024 2 commits
-
-
Simon Mo authored
-
Woosuk Kwon authored
Co-authored-by:youkaichao <youkaichao@126.com>
-
- 16 Mar, 2024 9 commits
-
-
Simon Mo authored
-
Simon Mo authored
-
Simon Mo authored
-
simon-mo authored
-
Dinghow Yang authored
-
Ronen Schaffer authored
-
Tao He authored
-
youkaichao authored
Co-authored-by:Zhuohan Li <zhuohan123@gmail.com>
-
Robert Shaw authored
-
- 15 Mar, 2024 12 commits
-
-
Antoni Baum authored
-
laneeee authored
-
Harry Mellor authored
-
youkaichao authored
-
Tao He authored
Signed-off-by:
Tao He <sighingnow@gmail.com> Co-authored-by:
simon-mo <simon.mo@hey.com>
-
Dan Clark authored
Co-authored-by:declark1 <daniel.clark@ibm.com>
-
Yang Fan authored
-
Junda Chen authored
-
Dinghow Yang authored
-
Dinghow Yang authored
-
youkaichao authored
Co-authored-by:Simon Mo <simon.mo@hey.com>
-
akhoroshev authored
-
- 14 Mar, 2024 8 commits
-
-
Enrique Shockwave authored
-
陈序 authored
Co-authored-by:Cade Daniel <edacih@gmail.com>
-
youkaichao authored
-
Dan Clark authored
Co-authored-by:Daniel Clark <daniel.clark@ibm.com>
-
Thomas Parnell authored
-
youkaichao authored
[Kernel] change benchmark script so that result can be directly used; tune moe kernel in A100/H100 with tp=2,4,8 (#3389)
-
Allen.Dou authored
-
Simon Mo authored
-
- 13 Mar, 2024 4 commits
-
-
Zhuohan Li authored
-
Antoni Baum authored
-
Terry authored
-
Or Sharir authored
Add missing kernel for CodeLlama-34B on A/H100 (no tensor parallelism) when using Multi-LoRA. (#3350)
-