- 28 Jan, 2025 1 commit
-
-
Harry Mellor authored
Signed-off-by:Harry Mellor <19981378+hmellor@users.noreply.github.com>
-
- 20 Jan, 2025 1 commit
-
-
Cyrus Leung authored
Signed-off-by:DarkLight1337 <tlleungac@connect.ust.hk>
-
- 19 Jan, 2025 1 commit
-
-
Cyrus Leung authored
-
- 12 Jan, 2025 1 commit
-
-
Isotr0py authored
Signed-off-by:
Isotr0py <2037008807@qq.com> Co-authored-by:
Cyrus Leung <cyrus.tl.leung@gmail.com>
-
- 28 Dec, 2024 1 commit
-
-
youkaichao authored
Signed-off-by:youkaichao <youkaichao@gmail.com>
-
- 27 Nov, 2024 1 commit
-
-
shunxing12345 authored
Signed-off-by:
Isotr0py <2037008807@qq.com> Co-authored-by:
xiangw2 <xiangw2@chinatelecom.cn> Co-authored-by:
Isotr0py <2037008807@qq.com>
-
- 25 Nov, 2024 2 commits
-
-
Shane A authored
-
zhou fan authored
Signed-off-by:
xffxff <1247714429@qq.com> Co-authored-by:
Isotr0py <2037008807@qq.com>
-
- 06 Nov, 2024 1 commit
-
-
Aaron Pham authored
Signed-off-by:Aaron Pham <contact@aarnphm.xyz>
-
- 04 Nov, 2024 1 commit
-
-
shanshan wang authored
Signed-off-by:
Shanshan Wang <shanshan.wang@h2o.ai> Signed-off-by:
Roger Wang <ywang@roblox.com> Co-authored-by:
Roger Wang <ywang@roblox.com>
-
- 24 Oct, 2024 1 commit
-
-
王敏 authored
-
- 16 Oct, 2024 1 commit
-
-
Cyrus Leung authored
-
- 11 Oct, 2024 1 commit
-
-
zhuwenwen authored
-
- 07 Oct, 2024 1 commit
-
-
Cyrus Leung authored
Co-authored-by:
Roger Wang <ywang@roblox.com> Co-authored-by:
Isotr0py <mozf@mail2.sysu.edu.cn>
-
- 26 Sep, 2024 1 commit
-
-
Roger Wang authored
-
- 25 Sep, 2024 1 commit
-
-
Chen Zhang authored
Co-authored-by:
simon-mo <xmo@berkeley.edu> Co-authored-by:
Chang Su <chang.s.su@oracle.com> Co-authored-by:
Simon Mo <simon.mo@hey.com> Co-authored-by:
Roger Wang <136131678+ywang96@users.noreply.github.com> Co-authored-by:
Roger Wang <ywang@roblox.com>
-
- 18 Sep, 2024 1 commit
-
-
Geun, Lim authored
Co-authored-by:Michael Goin <michael@neuralmagic.com>
-
- 06 Sep, 2024 1 commit
-
-
Nick Hill authored
-
- 02 Sep, 2024 1 commit
-
-
Shawn Tan authored
Co-authored-by:Nick Hill <nickhill@us.ibm.com>
-
- 30 Aug, 2024 1 commit
-
-
Yohan Na authored
-
- 22 Aug, 2024 1 commit
-
-
Abhinav Goyal authored
-
- 21 Aug, 2024 1 commit
-
-
Peter Salas authored
-
- 16 Aug, 2024 1 commit
-
-
Michael Goin authored
-
- 29 Jul, 2024 1 commit
-
-
Isotr0py authored
Co-authored-by:Roger Wang <ywang@roblox.com>
-
- 26 Jul, 2024 1 commit
-
-
Michael Goin authored
-
- 23 Jul, 2024 2 commits
-
-
Roger Wang authored
-
Roger Wang authored
-
- 22 Jul, 2024 1 commit
-
-
Roger Wang authored
-
- 10 Jul, 2024 1 commit
-
-
Abhinav Goyal authored
-
- 01 Jul, 2024 1 commit
-
-
Thomas Parnell authored
Signed-off-by:
Thomas Parnell <tpa@zurich.ibm.com> Co-authored-by:
Joshua Rosenkranz <jmrosenk@us.ibm.com>
-
- 27 Jun, 2024 1 commit
-
-
Nick Hill authored
-
- 21 Jun, 2024 1 commit
-
-
Joshua Rosenkranz authored
Signed-off-by:
Thomas Parnell <tpa@zurich.ibm.com> Co-authored-by:
Thomas Parnell <tpa@zurich.ibm.com> Co-authored-by:
Nick Hill <nickhill@us.ibm.com> Co-authored-by:
Davis Wertheimer <Davis.Wertheimer@ibm.com>
-
- 18 May, 2024 1 commit
-
-
SangBin Cho authored
Currently we need to call rotary embedding kernel for each LoRA, which makes it hard to serve multiple long context length LoRA. Add batched rotary embedding kernel and pipe it through. It replaces the rotary embedding layer to the one that is aware of multiple cos-sin-cache per scaling factors. Follow up of https://github.com/vllm-project/vllm/pull/3095/files
-
- 09 May, 2024 1 commit
-
-
Hao Zhang authored
Co-authored-by:
Dash Desai <1723932+iamontheinet@users.noreply.github.com> Co-authored-by:
Aurick Qiao <qiao@aurick.net> Co-authored-by:
Aurick Qiao <aurick.qiao@snowflake.com> Co-authored-by:
Aurick Qiao <aurickq@users.noreply.github.com> Co-authored-by:
Cody Yu <hao.yu.cody@gmail.com>
-
- 26 Apr, 2024 1 commit
-
-
SangBin Cho authored
Co-authored-by:Danny Guinther <dguinther@neuralmagic.com>
-
- 23 Apr, 2024 1 commit
-
-
SangBin Cho authored
-
- 12 Apr, 2024 1 commit
-
-
Michael Feil authored
Co-authored-by:Roger Wang <136131678+ywang96@users.noreply.github.com>
-
- 27 Mar, 2024 1 commit
-
-
Megha Agarwal authored
-
- 25 Mar, 2024 1 commit
-
-
SangBin Cho authored
-
- 21 Mar, 2024 1 commit
-
-
Woosuk Kwon authored
Co-authored-by:
Roy <jasonailu87@gmail.com> Co-authored-by:
Roger Meier <r.meier@siemens.com>
-