- 20 Feb, 2025 1 commit
-
-
Jee Jee Li authored
-
- 19 Feb, 2025 5 commits
-
-
Wilson Wu authored
-
youkaichao authored
Signed-off-by:youkaichao <youkaichao@gmail.com>
-
youkaichao authored
Signed-off-by:youkaichao <youkaichao@gmail.com>
-
Roger Wang authored
-
Harry Mellor authored
Signed-off-by:Harry Mellor <19981378+hmellor@users.noreply.github.com>
-
- 18 Feb, 2025 3 commits
-
-
youkaichao authored
Signed-off-by:youkaichao <youkaichao@gmail.com>
-
Harry Mellor authored
Signed-off-by:Harry Mellor <19981378+hmellor@users.noreply.github.com>
-
Isotr0py authored
-
- 17 Feb, 2025 3 commits
-
-
Cyrus Leung authored
-
yankooo authored
-
shangmingc authored
Signed-off-by:Shangming Cai <caishangming@linux.alibaba.com>
-
- 16 Feb, 2025 2 commits
-
-
凌 authored
-
Roger Wang authored
Signed-off-by:Roger Wang <ywang@roblox.com>
-
- 15 Feb, 2025 2 commits
-
-
Cyrus Leung authored
-
Nicolò Lucchesi authored
-
- 13 Feb, 2025 5 commits
-
-
Nicolò Lucchesi authored
-
Cyrus Leung authored
-
Cyrus Leung authored
-
Russell Bryant authored
-
Cody Yu authored
-
- 11 Feb, 2025 1 commit
-
-
Harry Mellor authored
Signed-off-by:Harry Mellor <19981378+hmellor@users.noreply.github.com>
-
- 10 Feb, 2025 4 commits
-
-
Farzad Abdolhosseini authored
Signed-off-by:Farzad Abdolhosseini <farzad@fixie.ai>
-
மனோஜ்குமார் பழனிச்சாமி authored
Signed-off-by:மனோஜ்குமார் பழனிச்சாமி <smartmanoj42857@gmail.com>
-
Cyrus Leung authored
Signed-off-by:DarkLight1337 <tlleungac@connect.ust.hk>
-
Yuan Tang authored
Signed-off-by:Yuan Tang <terrytangyuan@gmail.com>
-
- 08 Feb, 2025 3 commits
-
-
Jee Jee Li authored
Signed-off-by:Jee Jee Li <pandaleefree@gmail.com>
-
Cyrus Leung authored
-
Jun Duan authored
-
- 07 Feb, 2025 1 commit
-
-
TJian authored
[ROCm] [Feature] [Doc] [Dockerfile] [BugFix] Support Per-Token-Activation Per-Channel-Weight FP8 Quantization Inferencing (#12501)
-
- 06 Feb, 2025 3 commits
-
-
Jitse Klomp authored
-
Sumit Vij authored
-
Cyrus Leung authored
-
- 05 Feb, 2025 3 commits
-
-
Roger Wang authored
-
Russell Bryant authored
-
Michael Goin authored
-
- 04 Feb, 2025 3 commits
-
-
Isotr0py authored
Signed-off-by:
Isotr0py <2037008807@qq.com> Co-authored-by:
Cyrus Leung <cyrus.tl.leung@gmail.com>
-
Cyrus Leung authored
Signed-off-by:
DarkLight1337 <tlleungac@connect.ust.hk> Signed-off-by:
Isotr0py <2037008807@qq.com> Co-authored-by:
Isotr0py <2037008807@qq.com>
-
Thomas Parnell authored
Signed-off-by:Thomas Parnell <tpa@zurich.ibm.com>
-
- 03 Feb, 2025 1 commit
-
-
Arthur authored
# Adds support for `transformers` as a backend Following https://github.com/huggingface/transformers/pull/35235 , a bunch of models should already be supported, we are ramping up support for more models. Thanks @Isotr0py for the TP support, and @hmellor for his help as well! This includes: - `trust_remote_code=True` support: any model on the hub, if it implements attention the correct way can be natively supported!! - tensor parallel support --------- Signed-off-by:
Harry Mellor <19981378+hmellor@users.noreply.github.com> Signed-off-by:
Isotr0py <2037008807@qq.com> Co-authored-by:
Isotr0py <41363108+Isotr0py@users.noreply.github.com> Co-authored-by:
Harry Mellor <19981378+hmellor@users.noreply.github.com> Co-authored-by:
Isotr0py <2037008807@qq.com> Co-authored-by:
Cyrus Leung <cyrus.tl.leung@gmail.com> Co-authored-by:
Michael Goin <mgoin64@gmail.com> Co-authored-by:
Isotr0py <mozf@mail2.sysu.edu.cn>
-