- 07 Aug, 2024 6 commits
-
-
Dipika Sikka authored
[Misc] Refactor linear layer weight loading; introduce `BasevLLMParameter` and `weight_loader_v2` (#5874)
-
youkaichao authored
-
Cyrus Leung authored
Co-authored-by:Roger Wang <ywang@roblox.com>
-
Atilla Akkuş authored
-
Nick Hill authored
-
Michael Goin authored
-
- 06 Aug, 2024 7 commits
-
-
afeldman-nm authored
[Core] Subclass ModelRunner to support cross-attention & encoder sequences (towards eventual encoder/decoder model support) (#4942) Co-authored-by:
Andrew Feldman <afeld2012@gmail.com> Co-authored-by:
Nick Hill <nickhill@us.ibm.com>
-
xiaobochen123 authored
-
Luka Govedič authored
Co-authored-by:Tyler Michael Smith <tyler@neuralmagic.com>
-
Lily Liu authored
-
Robert Shaw authored
-
Cyrus Leung authored
Co-authored-by:Roger Wang <136131678+ywang96@users.noreply.github.com>
-
Jee Jee Li authored
-
- 05 Aug, 2024 13 commits
-
-
Isotr0py authored
Co-authored-by:Michael Goin <michael@neuralmagic.com>
-
Cody Yu authored
-
Jacob Schein authored
Co-authored-by:Jacob Schein <jacobschein@Jacobs-MacBook-Pro-2.local>
-
Thomas Parnell authored
Signed-off-by:Thomas Parnell <tpa@zurich.ibm.com>
-
Simon Mo authored
-
Rui Qiao authored
Signed-off-by:Rui Qiao <ruisearch42@gmail.com>
-
Aditya Paliwal authored
-
Nick Hill authored
-
Bongwon Jang authored
-
Cade Daniel authored
-
Jungho Christopher Cho authored
Co-authored-by:Roger Wang <ywang@roblox.com>
-
Cyrus Leung authored
-
Alphi authored
Co-authored-by:
hezhihui <hzh7269@modelbest.cn> Co-authored-by:
Cyrus Leung <cyrus.tl.leung@gmail.com>
-
- 04 Aug, 2024 6 commits
-
-
Jee Jee Li authored
-
youkaichao authored
-
Thomas Parnell authored
[Bugfix] [SpecDecode] Default speculative_draft_tensor_parallel_size to 1 when using MLPSpeculator (#7105) Signed-off-by:Thomas Parnell <tpa@zurich.ibm.com>
-
Jee Jee Li authored
Co-authored-by:Cyrus Leung <cyrus.tl.leung@gmail.com>
-
youkaichao authored
-
Yihuan Bu authored
Co-authored-by:Cyrus Leung <cyrus.tl.leung@gmail.com>
-
- 03 Aug, 2024 6 commits
-
-
Jeff Fialho authored
Signed-off-by:
Jefferson Fialho <jfialho@ibm.com> Co-authored-by:
Nick Hill <nickhill@us.ibm.com>
-
Jee Jee Li authored
-
Zach Zheng authored
-
Isotr0py authored
-
Cyrus Leung authored
-
Robert Shaw authored
Signed-off-by:
Joe Runde <Joseph.Runde@ibm.com> Co-authored-by:
Joe Runde <Joseph.Runde@ibm.com> Co-authored-by:
Joe Runde <joe@joerun.de> Co-authored-by:
Nick Hill <nickhill@us.ibm.com> Co-authored-by:
Simon Mo <simon.mo@hey.com>
-
- 02 Aug, 2024 2 commits
-
-
youkaichao authored
-
Rui Qiao authored
Signed-off-by:Rui Qiao <ruisearch42@gmail.com>
-