- 08 Aug, 2024 9 commits
-
-
Daniele authored
-
Michael Goin authored
-
Zach Zheng authored
-
Joe Runde authored
Signed-off-by:
Joe Runde <joe@joerun.de> Signed-off-by:
Joe Runde <Joseph.Runde@ibm.com>
-
Luka Govedič authored
-
Jee Jee Li authored
-
Murali Andoorveedu authored
-
Cherilyn Buren authored
-
Rui Qiao authored
Signed-off-by:Rui Qiao <ruisearch42@gmail.com>
-
- 07 Aug, 2024 19 commits
-
-
Lily Liu authored
-
Michael Goin authored
-
Nick Hill authored
-
Lucas Wilkinson authored
-
Kevin H. Luu authored
Signed-off-by:kevin <kevin@anyscale.com>
-
Michael Goin authored
-
Maximilien de Bayser authored
Signed-off-by:
Max de Bayser <mbayser@br.ibm.com> Co-authored-by:
Robert Shaw <114415538+robertgshaw2-neuralmagic@users.noreply.github.com>
-
Stas Bekman authored
-
Ilya Lavrenov authored
-
Isotr0py authored
Co-authored-by:Cyrus Leung <cyrus.tl.leung@gmail.com>
-
Rafael Vasquez authored
Signed-off-by:Rafael Vasquez <rafvasq21@gmail.com>
-
Robert Shaw authored
-
Dipika Sikka authored
[Misc] Refactor linear layer weight loading; introduce `BasevLLMParameter` and `weight_loader_v2` (#5874)
-
youkaichao authored
-
Cyrus Leung authored
Co-authored-by:Roger Wang <ywang@roblox.com>
-
Atilla Akkuş authored
-
Roger Wang authored
-
Nick Hill authored
-
Michael Goin authored
-
- 06 Aug, 2024 10 commits
-
-
afeldman-nm authored
[Core] Subclass ModelRunner to support cross-attention & encoder sequences (towards eventual encoder/decoder model support) (#4942) Co-authored-by:
Andrew Feldman <afeld2012@gmail.com> Co-authored-by:
Nick Hill <nickhill@us.ibm.com>
-
xiaobochen123 authored
-
Luka Govedič authored
Co-authored-by:Tyler Michael Smith <tyler@neuralmagic.com>
-
Lily Liu authored
-
Katarzyna Papis authored
Co-authored-by:katarzyna.papis <kpapis@kpapis-u20.sclab.intel.com>
-
Robert Shaw authored
-
Dipika Sikka authored
-
Cyrus Leung authored
Co-authored-by:Roger Wang <136131678+ywang96@users.noreply.github.com>
-
Jee Jee Li authored
-
Simon Mo authored
-
- 05 Aug, 2024 2 commits
-
-
Isotr0py authored
Co-authored-by:Michael Goin <michael@neuralmagic.com>
-
Cody Yu authored
-