- 09 Apr, 2026 1 commit
-
-
Maral authored
[W8A8 Block Linear Refactor][2/N] Remove W8A8Fp8BlockLinearOp and adopt Fp8 block linear kernel selections. (#33892) Signed-off-by:
maral <maralbahari.98@gmail.com> Signed-off-by:
Maral <maralbahari.98@gmail.com>
-
- 22 Dec, 2025 1 commit
-
-
CedricHuang authored
[Feature]: Support NVIDIA ModelOpt HF FP8 variants FP8_PER_CHANNEL_PER_TOKEN and FP8_PB_WO in vLLM (#30957)
-
- 18 Nov, 2025 1 commit
-
-
Alex authored
Signed-off-by:Alex Yun <alexyun04@gmail.com>
-
- 05 Oct, 2025 1 commit
-
-
Harry Mellor authored
Signed-off-by:Harry Mellor <19981378+hmellor@users.noreply.github.com>
-
- 20 Sep, 2025 1 commit
-
-
Cyrus Leung authored
Signed-off-by:DarkLight1337 <tlleungac@connect.ust.hk>
-
- 03 Sep, 2025 1 commit
-
-
co63oc authored
Signed-off-by:co63oc <co63oc@users.noreply.github.com>
-
- 21 Jul, 2025 1 commit
-
-
Zhiyu authored
Signed-off-by:Zhiyu Cheng <zhiyuc@nvidia.com>
-