- 27 Jul, 2024 22 commits
-
-
Michael Goin authored
-
Alexander Matveev authored
-
Woosuk Kwon authored
-
Chenggang Wu authored
-
Cyrus Leung authored
Co-authored-by:ywang96 <ywang@roblox.com>
-
Roger Wang authored
-
Wang Ran (汪然) authored
-
Roger Wang authored
-
Roger Wang authored
Co-authored-by:Cyrus Leung <tlleungac@connect.ust.hk>
-
Travis Johnson authored
[Bugfix] Use torch.set_num_threads() to configure parallelism in multiproc_gpu_executor (#6802) Signed-off-by:Travis Johnson <tsjohnso@us.ibm.com>
-
Harry Mellor authored
-
Woosuk Kwon authored
-
Joe authored
-
tomeras91 authored
-
omrishiv authored
Signed-off-by:omrishiv <327609+omrishiv@users.noreply.github.com>
-
Woosuk Kwon authored
-
Sanger Steel authored
-
Lucas Wilkinson authored
-
Cyrus Leung authored
-
Woosuk Kwon authored
-
chenqianfzh authored
-
Gurpreet Singh Dhami authored
-
- 26 Jul, 2024 13 commits
-
-
Zhanghao Wu authored
-
Michael Goin authored
-
Li, Jiang authored
[Hardware] [Intel] Enable Multiprocessing and tensor parallel in CPU backend and update documentation (#6125)
-
Woosuk Kwon authored
-
Woosuk Kwon authored
-
Tyler Michael Smith authored
-
Michael Goin authored
-
youkaichao authored
-
Peng Guanwen authored
-
Anthony Platanios authored
-
QQSong authored
-
Kevin H. Luu authored
[ci] Mark tensorizer test as soft fail and separate it from grouped test in fast check (#6810) Signed-off-by:kevin <kevin@anyscale.com>
-
youkaichao authored
-
- 25 Jul, 2024 5 commits
-
-
SangBin Cho authored
-
Woosuk Kwon authored
-
youkaichao authored
-
Lucas Wilkinson authored
[Bugfix] Fix empty (nullptr) channelwise scales when loading wNa16 using compressed tensors (#6798)
-
Kuntai Du authored
-