- 19 Jun, 2024 1 commit
-
-
Dipika Sikka authored
[Misc] Add per channel support for static activation quantization; update w8a8 schemes to share base classes (#5650)
-
- 18 Jun, 2024 1 commit
-
-
Dipika Sikka authored
[Misc] Add channel-wise quantization support for w8a8 dynamic per token activation quantization (#5542)
-
- 17 Jun, 2024 1 commit
-
-
Dipika Sikka authored
-
- 13 Jun, 2024 1 commit
-
-
Dipika Sikka authored
Co-authored-by:Robert Shaw <114415538+robertgshaw2-neuralmagic@users.noreply.github.com>
-
- 10 Jun, 2024 1 commit
-
-
Dipika Sikka authored
Co-authored-by:Michael Goin <michael@neuralmagic.com>
-
- 08 Jun, 2024 1 commit
-
-
youkaichao authored
[CI/Test] improve robustness of test by replacing del with context manager (vllm_runner) (#5357)
-
- 07 Jun, 2024 1 commit
-
-
Dipika Sikka authored
Co-authored-by:
Varun Sundar Rabindranath <varunsundar08@gmail.com> Co-authored-by:
Varun Sundar Rabindranath <varun@neuralmagic.com>
-
- 23 May, 2024 1 commit
-
-
Dipika Sikka authored
Co-authored-by:
Varun Sundar Rabindranath <varunsundar08@gmail.com> Co-authored-by:
Varun Sundar Rabindranath <varun@neuralmagic.com>
-