- 07 Apr, 2025 1 commit
-
-
Roger Wang authored
Signed-off-by:Roger Wang <ywang@roblox.com>
-
- 04 Apr, 2025 2 commits
-
-
Roger Wang authored
-
Cyrus Leung authored
Signed-off-by:
DarkLight1337 <tlleungac@connect.ust.hk> Signed-off-by:
mgoin <mgoin64@gmail.com> Signed-off-by:
Roger Wang <ywang@roblox.com> Co-authored-by:
mgoin <mgoin64@gmail.com> Co-authored-by:
Roger Wang <ywang@roblox.com>
-
- 01 Apr, 2025 4 commits
-
-
Gerald authored
Signed-off-by:
qscqesze <475517977@qq.com> Co-authored-by:
qingjun <qingjun@minimaxi.com> Co-authored-by:
qscqesze <475517977@qq.com>
-
Jennifer Zhao authored
Signed-off-by:
Jennifer Zhao <ai.jenniferzhao@gmail.com> Signed-off-by:
Roger Wang <ywang@roblox.com> Co-authored-by:
Roger Wang <ywang@roblox.com>
-
Michael Goin authored
Signed-off-by:
mgoin <mgoin64@gmail.com> Signed-off-by:
DarkLight1337 <tlleungac@connect.ust.hk> Co-authored-by:
DarkLight1337 <tlleungac@connect.ust.hk> Co-authored-by:
Cyrus Leung <cyrus.tl.leung@gmail.com>
-
Harry Mellor authored
Signed-off-by:Harry Mellor <19981378+hmellor@users.noreply.github.com>
-
- 31 Mar, 2025 3 commits
-
-
Cyrus Leung authored
Signed-off-by:DarkLight1337 <tlleungac@connect.ust.hk>
-
Harry Mellor authored
Signed-off-by:Harry Mellor <19981378+hmellor@users.noreply.github.com>
-
Naveassaf authored
Signed-off-by:Nave Assaf <nassaf@nvidia.com>
-
- 30 Mar, 2025 1 commit
-
-
yihong authored
Signed-off-by:yihong0618 <zouzou0208@gmail.com>
-
- 29 Mar, 2025 1 commit
-
-
pengyuange authored
Signed-off-by:
jiacai.liu <932997367@qq.com> Co-authored-by:
jiacai.liu <932997367@qq.com>
-
- 26 Mar, 2025 1 commit
-
-
Harry Mellor authored
Signed-off-by:
Harry Mellor <19981378+hmellor@users.noreply.github.com> Signed-off-by:
Isotr0py <2037008807@qq.com> Co-authored-by:
Isotr0py <2037008807@qq.com> Co-authored-by:
Isotr0py <mozf@mail2.sysu.edu.cn>
-
- 22 Mar, 2025 1 commit
-
-
Naitong Yu authored
Signed-off-by:
Naitong Yu <ntyu@baai.ac.cn> Signed-off-by:
jiangxin <horizon94@outlook.com> Co-authored-by:
Jason Fang <jasonfang3900@gmail.com> Co-authored-by:
jiangxin <horizon94@outlook.com>
-
- 18 Mar, 2025 1 commit
-
-
yury-tokpanov authored
Signed-off-by:
Yury Tokpanov <yury@zyphra.com> Signed-off-by:
Quentin Anthony <qganthony@yahoo.com> Co-authored-by:
Quentin Anthony <qganthony@yahoo.com> Co-authored-by:
Tyler Michael Smith <tysmith@redhat.com> Co-authored-by:
Cyrus Leung <cyrus.tl.leung@gmail.com>
-
- 15 Mar, 2025 1 commit
-
-
Robert Shaw authored
Signed-off-by:
rshaw@neuralmagic.com <rshaw@neuralmagic.com> Co-authored-by:
rshaw@neuralmagic.com <rshaw@neuralmagic.com> Co-authored-by:
Nicolò Lucchesi <nlucches@redhat.com> Co-authored-by:
Tyler Michael Smith <tyler@neuralmagic.com> Co-authored-by:
Michael Goin <michael@neuralmagic.com>
-
- 14 Mar, 2025 1 commit
-
-
Roger Wang authored
Signed-off-by:Roger Wang <ywang@roblox.com>
-
- 12 Mar, 2025 2 commits
-
-
Woosuk Kwon authored
Signed-off-by:
Woosuk Kwon <woosuk.kwon@berkeley.edu> Signed-off-by:
Roger Wang <ywang@roblox.com> Signed-off-by:
DarkLight1337 <tlleungac@connect.ust.hk> Co-authored-by:
Roger Wang <ywang@roblox.com> Co-authored-by:
DarkLight1337 <tlleungac@connect.ust.hk>
-
Farzad Abdolhosseini authored
Signed-off-by:Farzad Abdolhosseini <farzad@fixie.ai>
-
- 05 Mar, 2025 1 commit
-
-
Congcong Chen authored
-
- 04 Mar, 2025 1 commit
-
-
Travis Johnson authored
Signed-off-by:
Travis Johnson <tsjohnso@us.ibm.com> Co-authored-by:
Cyrus Leung <cyrus.tl.leung@gmail.com>
-
- 03 Mar, 2025 1 commit
-
-
Harry Mellor authored
-
- 28 Feb, 2025 1 commit
-
-
Harry Mellor authored
-
- 27 Feb, 2025 1 commit
-
-
Isotr0py authored
-
- 26 Feb, 2025 3 commits
-
-
Cyrus Leung authored
-
Roger Wang authored
-
Michael Goin authored
Signed-off-by:mgoin <mgoin64@gmail.com>
-
- 19 Feb, 2025 2 commits
-
-
Lucia Fang authored
Signed-off-by:
Lu Fang <fanglu@fb.com> Co-authored-by:
LiuXiaoxuanPKU <lilyliupku@gmail.com>
-
Kevin H. Luu authored
Signed-off-by: <> Co-authored-by:EC2 Default User <ec2-user@ip-172-31-20-117.us-west-2.compute.internal>
-
- 17 Feb, 2025 1 commit
-
-
Tyler Michael Smith authored
Signed-off-by:
Tyler Michael Smith <tyler@neuralmagic.com> Co-authored-by:
Yu Chin Fabian Lim <flim@sg.ibm.com>
-
- 14 Feb, 2025 1 commit
-
-
Harry Mellor authored
-
- 13 Feb, 2025 2 commits
-
-
Cyrus Leung authored
-
Cyrus Leung authored
-
- 12 Feb, 2025 1 commit
-
-
Christian Pinto authored
-
- 10 Feb, 2025 1 commit
-
-
Farzad Abdolhosseini authored
Signed-off-by:Farzad Abdolhosseini <farzad@fixie.ai>
-
- 06 Feb, 2025 1 commit
-
-
Yu Chin Fabian Lim authored
Signed-off-by:
Yu Chin Fabian Lim <flim@sg.ibm.com> Signed-off-by:
Tyler Michael Smith <tyler@neuralmagic.com> Co-authored-by:
Tyler Michael Smith <tyler@neuralmagic.com>
-
- 05 Feb, 2025 1 commit
-
-
Roger Wang authored
-
- 04 Feb, 2025 2 commits
-
-
Cyrus Leung authored
Signed-off-by:DarkLight1337 <tlleungac@connect.ust.hk>
-
Thomas Parnell authored
Signed-off-by:Thomas Parnell <tpa@zurich.ibm.com>
-
- 03 Feb, 2025 1 commit
-
-
Arthur authored
# Adds support for `transformers` as a backend Following https://github.com/huggingface/transformers/pull/35235 , a bunch of models should already be supported, we are ramping up support for more models. Thanks @Isotr0py for the TP support, and @hmellor for his help as well! This includes: - `trust_remote_code=True` support: any model on the hub, if it implements attention the correct way can be natively supported!! - tensor parallel support --------- Signed-off-by:
Harry Mellor <19981378+hmellor@users.noreply.github.com> Signed-off-by:
Isotr0py <2037008807@qq.com> Co-authored-by:
Isotr0py <41363108+Isotr0py@users.noreply.github.com> Co-authored-by:
Harry Mellor <19981378+hmellor@users.noreply.github.com> Co-authored-by:
Isotr0py <2037008807@qq.com> Co-authored-by:
Cyrus Leung <cyrus.tl.leung@gmail.com> Co-authored-by:
Michael Goin <mgoin64@gmail.com> Co-authored-by:
Isotr0py <mozf@mail2.sysu.edu.cn>
-