- 26 Jul, 2024 1 commit
-
-
Li, Jiang authored
[Hardware] [Intel] Enable Multiprocessing and tensor parallel in CPU backend and update documentation (#6125)
-
- 18 Jul, 2024 1 commit
-
-
Rui Qiao authored
Signed-off-by:
Rui Qiao <ruisearch42@gmail.com> Co-authored-by:
Stephanie Wang <swang@cs.berkeley.edu>
-
- 16 Jul, 2024 1 commit
-
-
Cyrus Leung authored
-
- 15 Jul, 2024 1 commit
-
-
DefTruth authored
-
- 13 Jul, 2024 1 commit
-
-
Woosuk Kwon authored
-
- 01 Jul, 2024 1 commit
-
-
Avshalom Manevich authored
-
- 28 Jun, 2024 1 commit
-
-
Ilya Lavrenov authored
-
- 21 Jun, 2024 2 commits
-
-
youkaichao authored
Co-authored-by:Cody Yu <hao.yu.cody@gmail.com>
-
youkaichao authored
-
- 12 Jun, 2024 1 commit
-
-
Woosuk Kwon authored
-
- 11 Jun, 2024 1 commit
-
-
Woosuk Kwon authored
-
- 07 Jun, 2024 2 commits
-
-
Roger Wang authored
Co-authored-by:DarkLight1337 <tlleungac@connect.ust.hk>
-
youkaichao authored
-
- 25 May, 2024 1 commit
-
-
youkaichao authored
-
- 20 May, 2024 1 commit
-
-
Wenwei Zhang authored
-
- 13 May, 2024 1 commit
-
-
Sanger Steel authored
[Frontend] [Core] perf: Automatically detect vLLM-tensorized model, update `tensorizer` to version 2.9.0 (#4208)
-
- 03 May, 2024 2 commits
-
-
youkaichao authored
-
youkaichao authored
-
- 02 May, 2024 1 commit
-
-
youkaichao authored
-