- 12 Aug, 2025 4 commits
-
-
Po-Han Huang (NVIDIA) authored
Signed-off-by:Po-Han Huang <pohanh@nvidia.com>
-
Yongye Zhu authored
Signed-off-by:Yongye Zhu <zyy1102000@gmail.com>
-
Jun-Howie authored
Signed-off-by:
JunHowie <JunHowie@aliyun.com> Co-authored-by:
JunHowie <JunHowie@aliyun.com> Co-authored-by:
gemini-code-assist[bot] <176961590+gemini-code-assist[bot]@users.noreply.github.com>
-
Chen Zhang authored
Signed-off-by:Chen Zhang <zhangch99@outlook.com>
-
- 11 Aug, 2025 1 commit
-
-
Wentao Ye authored
Signed-off-by:yewentao256 <zhyanwentao@126.com>
-
- 09 Aug, 2025 1 commit
-
-
Kyuyeun Kim authored
Signed-off-by:Kyuyeun Kim <kyuyeunk@google.com>
-
- 08 Aug, 2025 3 commits
-
-
Yongye Zhu authored
Signed-off-by: <zyy1102000@gmail.com> Signed-off-by:Yongye Zhu <zyy1102000@gmail.com>
-
Po-Han Huang (NVIDIA) authored
Signed-off-by:Po-Han Huang <pohanh@nvidia.com>
-
Shu Wang authored
Signed-off-by:
Shu Wang <shuw@nvidia.com> Signed-off-by:
Po-Han Huang <pohanh@nvidia.com> Signed-off-by:
Shu Wang. <shuw@nvidia.com> Signed-off-by:
XIn Li <xinli@nvidia.com> Co-authored-by:
XIn Li <xinli@nvidia.com>
-
- 07 Aug, 2025 3 commits
-
-
Woosuk Kwon authored
Signed-off-by:Woosuk Kwon <woosuk.kwon@berkeley.edu>
-
Syed Muhammad Bin Asif authored
Signed-off-by:Syed Muhammad Bin Asif <syedmba7@connect.hku.hk>
-
Wentao Ye authored
Signed-off-by:yewentao256 <zhyanwentao@126.com>
-
- 06 Aug, 2025 2 commits
-
-
Yongye Zhu authored
Signed-off-by:
simon-mo <xmo@berkeley.edu> Signed-off-by:
Yongye Zhu <zyy1102000@gmail.com> Co-authored-by:
simon-mo <xmo@berkeley.edu>
-
Jee Jee Li authored
Signed-off-by:Jee Jee Li <pandaleefree@gmail.com>
-
- 02 Aug, 2025 1 commit
-
-
JartX authored
-
- 31 Jul, 2025 2 commits
-
-
Wentao Ye authored
Signed-off-by:yewentao256 <zhyanwentao@126.com>
-
amirkl94 authored
Signed-off-by:
Amir Klein <203507526+amirkl94@users.noreply.github.com> Signed-off-by:
mgoin <mgoin64@gmail.com> Co-authored-by:
mgoin <mgoin64@gmail.com>
-
- 30 Jul, 2025 1 commit
-
-
Po-Han Huang (NVIDIA) authored
Signed-off-by:Po-Han Huang <pohanh@nvidia.com>
-
- 29 Jul, 2025 2 commits
-
-
Wenhua Cheng authored
Signed-off-by:Wenhua Cheng <wenhua.cheng@intel.com>
-
Wentao Ye authored
[Refactor] Merge Compressed Tensor FP8 `CompressedTensorsW8A8Fp8MoEMethod` and `CompressedTensorsW8A8Fp8MoECutlassMethod` (#21775) Signed-off-by:yewentao256 <zhyanwentao@126.com>
-
- 28 Jul, 2025 2 commits
-
-
Nikhil Gupta authored
Signed-off-by:
Nikhil Gupta <nikhil.gupta2@arm.com> Co-authored-by:
mgoin <mgoin64@gmail.com>
-
TJian authored
-
- 27 Jul, 2025 1 commit
-
-
Wentao Ye authored
Signed-off-by:yewentao256 <zhyanwentao@126.com>
-
- 26 Jul, 2025 2 commits
-
-
Alex Kogan authored
Signed-off-by:Alex Kogan <alex.kogan@oracle.com>
-
Wentao Ye authored
Signed-off-by:yewentao256 <zhyanwentao@126.com>
-
- 25 Jul, 2025 1 commit
-
-
Wentao Ye authored
Signed-off-by:yewentao256 <zhyanwentao@126.com>
-
- 24 Jul, 2025 1 commit
-
-
Shu Wang authored
Signed-off-by:Shu Wang. <shuw@nvidia.com>
-
- 22 Jul, 2025 4 commits
-
-
Wentao Ye authored
Signed-off-by:yewentao256 <zhyanwentao@126.com>
-
Duncan Moss authored
Signed-off-by:
Duncan Moss <djm.moss@gmail.com> Signed-off-by:
jiahanc <173873397+jiahanc@users.noreply.github.com> Co-authored-by:
jiahanc <173873397+jiahanc@users.noreply.github.com> Co-authored-by:
mgoin <mgoin64@gmail.com>
-
Mickaël Seznec authored
Signed-off-by:
Mickael Seznec <mickael@mistral.ai> Co-authored-by:
mgoin <mgoin64@gmail.com>
-
Ming Yang authored
Signed-off-by:Ming Yang <minos.future@gmail.com>
-
- 21 Jul, 2025 1 commit
-
-
Zhiyu authored
Signed-off-by:Zhiyu Cheng <zhiyuc@nvidia.com>
-
- 19 Jul, 2025 1 commit
-
-
Kaixi Hou authored
Signed-off-by:
kaixih <kaixih@nvidia.com> Signed-off-by:
mgoin <mgoin64@gmail.com> Co-authored-by:
mgoin <mgoin64@gmail.com>
-
- 18 Jul, 2025 1 commit
-
-
Shu Wang authored
Signed-off-by:
shuw <shuw@nvidia.com> Signed-off-by:
mgoin <mgoin64@gmail.com> Co-authored-by:
mgoin <mgoin64@gmail.com>
-
- 17 Jul, 2025 2 commits
-
-
ElizaWszola authored
Signed-off-by:ElizaWszola <ewszola@redhat.com>
-
Michael Goin authored
Signed-off-by:mgoin <mgoin64@gmail.com>
-
- 16 Jul, 2025 3 commits
-
-
Nir David authored
Support FP8 Quantization and Inference Run on Intel Gaudi (HPU) using INC (Intel Neural Compressor) (#12010) Signed-off-by:
Nir David <ndavid@habana.ai> Signed-off-by:
Uri Livne <ulivne@habana.ai> Co-authored-by:
Uri Livne <ulivne@habana.ai>
-
Wentao Ye authored
Signed-off-by:yewentao256 <zhyanwentao@126.com>
-
Ming Yang authored
Signed-off-by:Ming Yang <minos.future@gmail.com>
-
- 15 Jul, 2025 1 commit
-
-
Ruheena Suhani Shaik authored
Signed-off-by:Ruheena Suhani Shaik <rsshaik@habana.ai>
-