- 13 Aug, 2024 1 commit
-
-
Cyrus Leung authored
-
- 09 Aug, 2024 1 commit
-
-
Siyuan Liu authored
-
- 08 Aug, 2024 1 commit
-
-
Isotr0py authored
-
- 06 Aug, 2024 1 commit
-
-
Cyrus Leung authored
Co-authored-by:Roger Wang <136131678+ywang96@users.noreply.github.com>
-
- 05 Aug, 2024 1 commit
-
-
Isotr0py authored
Co-authored-by:Michael Goin <michael@neuralmagic.com>
-
- 01 Aug, 2024 1 commit
-
-
Woosuk Kwon authored
-
- 31 Jul, 2024 1 commit
-
-
Michael Goin authored
-
- 24 Jul, 2024 1 commit
-
-
liuyhwangyh authored
-
- 23 Jul, 2024 4 commits
-
-
dongmao zhang authored
Co-authored-by:Michael Goin <michael@neuralmagic.com>
-
Simon Mo authored
-
youkaichao authored
-
zhaotyer authored
Co-authored-by:
tianyi.zhao <tianyi.zhao@transwarp.io> Co-authored-by:
youkaichao <youkaichao@126.com>
-
- 17 Jul, 2024 1 commit
-
-
youkaichao authored
-
- 16 Jul, 2024 2 commits
-
-
Michael Goin authored
-
Mor Zusman authored
Co-authored-by:Mor Zusman <morz@ai21.com>
-
- 15 Jul, 2024 1 commit
-
-
Woosuk Kwon authored
-
- 14 Jul, 2024 1 commit
-
-
Robert Shaw authored
-
- 03 Jul, 2024 2 commits
-
-
xwjiang2010 authored
Signed-off-by:
Xiaowei Jiang <xwjiang2010@gmail.com> Co-authored-by:
Roger Wang <ywang@roblox.com>
-
youkaichao authored
-
- 02 Jul, 2024 1 commit
-
-
xwjiang2010 authored
Signed-off-by:
Xiaowei Jiang <xwjiang2010@gmail.com> Co-authored-by:
Cyrus Leung <cyrus.tl.leung@gmail.com> Co-authored-by:
Roger Wang <ywang@roblox.com>
-
- 01 Jul, 2024 1 commit
-
-
youkaichao authored
-
- 28 Jun, 2024 1 commit
-
-
Ilya Lavrenov authored
-
- 27 Jun, 2024 1 commit
-
-
Cyrus Leung authored
-
- 20 Jun, 2024 1 commit
-
-
Michael Goin authored
-
- 15 Jun, 2024 1 commit
-
-
Cyrus Leung authored
-
- 12 Jun, 2024 2 commits
-
-
Travis Johnson authored
Signed-off-by:
Travis Johnson <tsjohnso@us.ibm.com> Co-authored-by:
Sanger Steel <sangersteel@gmail.com> Co-authored-by:
Roger Wang <ywang@roblox.com>
-
Woosuk Kwon authored
-
- 10 Jun, 2024 1 commit
-
-
Dipika Sikka authored
Co-authored-by:Michael Goin <michael@neuralmagic.com>
-
- 01 Jun, 2024 2 commits
-
-
chenqianfzh authored
-
Ye Cao authored
Signed-off-by:Ye Cao <caoye.cao@alibaba-inc.com>
-
- 24 May, 2024 1 commit
-
-
Robert Shaw authored
Co-authored-by:Cody Yu <hao.yu.cody@gmail.com>
-
- 23 May, 2024 1 commit
-
-
Dipika Sikka authored
Co-authored-by:
Varun Sundar Rabindranath <varunsundar08@gmail.com> Co-authored-by:
Varun Sundar Rabindranath <varun@neuralmagic.com>
-
- 20 May, 2024 2 commits
-
-
Aurick Qiao authored
-
Mor Zusman authored
Allow dummy load format for fp8, torch.uniform_ doesn't support FP8 at the moment Co-authored-by:Mor Zusman <morz@ai21.com>
-
- 19 May, 2024 1 commit
-
-
Cyrus Leung authored
-
- 16 May, 2024 1 commit
-
-
Aurick Qiao authored
Co-authored-by:Woosuk Kwon <woosuk.kwon@berkeley.edu>
-
- 13 May, 2024 2 commits
-
-
Sanger Steel authored
[Frontend] [Core] perf: Automatically detect vLLM-tensorized model, update `tensorizer` to version 2.9.0 (#4208)
-
Woosuk Kwon authored
-
- 10 May, 2024 1 commit
-
-
SangBin Cho authored
Storing exception frame is extremely prone to circular refernece because it contains the reference to objects. When tensorizer is not installed, it leaks llm instance because error frame has references to various modules which cause circular reference problem. I also found spec decoding has a circular reference issue, and I solved it using weakref.proxy.
-
- 02 May, 2024 1 commit
-
-
youkaichao authored
-