- 24 Jul, 2024 2 commits
- 20 Jul, 2024 2 commits
- 17 Jul, 2024 1 commit
-
-
zhuwenwen authored
-
- 14 Jul, 2024 1 commit
-
-
Robert Shaw authored
-
- 06 Jul, 2024 1 commit
-
-
zhuwenwen authored
-
- 03 Jul, 2024 2 commits
-
-
xwjiang2010 authored
Signed-off-by:
Xiaowei Jiang <xwjiang2010@gmail.com> Co-authored-by:
Roger Wang <ywang@roblox.com>
-
youkaichao authored
-
- 02 Jul, 2024 1 commit
-
-
xwjiang2010 authored
Signed-off-by:
Xiaowei Jiang <xwjiang2010@gmail.com> Co-authored-by:
Cyrus Leung <cyrus.tl.leung@gmail.com> Co-authored-by:
Roger Wang <ywang@roblox.com>
-
- 01 Jul, 2024 2 commits
-
-
youkaichao authored
-
zhuwenwen authored
-
- 28 Jun, 2024 2 commits
-
-
Ilya Lavrenov authored
-
zhuwenwen authored
-
- 27 Jun, 2024 1 commit
-
-
Cyrus Leung authored
-
- 20 Jun, 2024 1 commit
-
-
Michael Goin authored
-
- 15 Jun, 2024 1 commit
-
-
Cyrus Leung authored
-
- 12 Jun, 2024 2 commits
-
-
Travis Johnson authored
Signed-off-by:
Travis Johnson <tsjohnso@us.ibm.com> Co-authored-by:
Sanger Steel <sangersteel@gmail.com> Co-authored-by:
Roger Wang <ywang@roblox.com>
-
Woosuk Kwon authored
-
- 10 Jun, 2024 1 commit
-
-
Dipika Sikka authored
Co-authored-by:Michael Goin <michael@neuralmagic.com>
-
- 01 Jun, 2024 2 commits
-
-
chenqianfzh authored
-
Ye Cao authored
Signed-off-by:Ye Cao <caoye.cao@alibaba-inc.com>
-
- 30 May, 2024 1 commit
-
-
zhuwenwen authored
-
- 25 May, 2024 2 commits
- 24 May, 2024 1 commit
-
-
Robert Shaw authored
Co-authored-by:Cody Yu <hao.yu.cody@gmail.com>
-
- 23 May, 2024 1 commit
-
-
Dipika Sikka authored
Co-authored-by:
Varun Sundar Rabindranath <varunsundar08@gmail.com> Co-authored-by:
Varun Sundar Rabindranath <varun@neuralmagic.com>
-
- 20 May, 2024 2 commits
-
-
Aurick Qiao authored
-
Mor Zusman authored
Allow dummy load format for fp8, torch.uniform_ doesn't support FP8 at the moment Co-authored-by:Mor Zusman <morz@ai21.com>
-
- 19 May, 2024 1 commit
-
-
Cyrus Leung authored
-
- 16 May, 2024 1 commit
-
-
Aurick Qiao authored
Co-authored-by:Woosuk Kwon <woosuk.kwon@berkeley.edu>
-
- 13 May, 2024 2 commits
-
-
Sanger Steel authored
[Frontend] [Core] perf: Automatically detect vLLM-tensorized model, update `tensorizer` to version 2.9.0 (#4208)
-
Woosuk Kwon authored
-
- 10 May, 2024 1 commit
-
-
SangBin Cho authored
Storing exception frame is extremely prone to circular refernece because it contains the reference to objects. When tensorizer is not installed, it leaks llm instance because error frame has references to various modules which cause circular reference problem. I also found spec decoding has a circular reference issue, and I solved it using weakref.proxy.
-
- 02 May, 2024 1 commit
-
-
youkaichao authored
-
- 30 Apr, 2024 1 commit
-
-
Alpay Ariyak authored
-
- 29 Apr, 2024 1 commit
-
-
SangBin Cho authored
-
- 27 Apr, 2024 1 commit
-
-
Prashant Gupta authored
Signed-off-by:
Prashant Gupta <prashantgupta@us.ibm.com> Co-authored-by:
Travis Johnson <tjohnson31415@gmail.com>
-
- 26 Apr, 2024 2 commits
-
-
Cody Yu authored
-
SangBin Cho authored
Co-authored-by:Danny Guinther <dguinther@neuralmagic.com>
-