- 23 Jan, 2025 1 commit
-
-
Gregory Shtrasberg authored
Signed-off-by:
Gregory Shtrasberg <Gregory.Shtrasberg@amd.com> Co-authored-by:
Micah Williamson <micah.williamson@amd.com>
-
- 25 Nov, 2024 1 commit
-
-
Wallas Henrique authored
Signed-off-by:
Wallas Santos <wallashss@ibm.com> Co-authored-by:
Michael Goin <michael@neuralmagic.com>
-
- 20 Nov, 2024 1 commit
-
-
Li, Jiang authored
Signed-off-by:jiang1.li <jiang1.li@intel.com>
-
- 18 Nov, 2024 1 commit
-
-
Angus Wang authored
Signed-off-by:Angus Wang <wangjadehao@gmail.com>
-
- 14 Nov, 2024 1 commit
-
-
Maximilien de Bayser authored
Signed-off-by:
Max de Bayser <mbayser@br.ibm.com> Signed-off-by:
Flavia Beo <flavia.beo@ibm.com> Co-authored-by:
Flavia Beo <flavia.beo@ibm.com>
-
- 06 Nov, 2024 2 commits
-
-
Konrad Zawora authored
Signed-off-by:
yuwenzho <yuwen.zhou@intel.com> Signed-off-by:
Chendi.Xue <chendi.xue@intel.com> Signed-off-by:
Bob Zhu <bob.zhu@intel.com> Signed-off-by:
zehao-intel <zehao.huang@intel.com> Signed-off-by:
Konrad Zawora <kzawora@habana.ai> Co-authored-by:
Kunshang Ji <kunshang.ji@intel.com> Co-authored-by:
Sanju C Sudhakaran <scsudhakaran@habana.ai> Co-authored-by:
Michal Adamczyk <madamczyk@habana.ai> Co-authored-by:
Marceli Fylcek <mfylcek@habana.ai> Co-authored-by:
Himangshu Lahkar <49579433+hlahkar@users.noreply.github.com> Co-authored-by:
Vivek Goel <vgoel@habana.ai> Co-authored-by:
yuwenzho <yuwen.zhou@intel.com> Co-authored-by:
Dominika Olszewska <dolszewska@habana.ai> Co-authored-by:
barak goldberg <149692267+bgoldberg-habana@users.noreply.github.com> Co-authored-by:
Michal Szutenberg <37601244+szutenberg@users.noreply.github.com> Co-authored-by:
Jan Kaniecki <jkaniecki@habana.ai> Co-authored-by:
Agata Dobrzyniewicz <160237065+adobrzyniewicz-habana@users.noreply.github.com> Co-authored-by:
Krzysztof Wisniewski <kwisniewski@habana.ai> Co-authored-by:
Dudi Lester <160421192+dudilester@users.noreply.github.com> Co-authored-by:
Ilia Taraban <tarabanil@gmail.com> Co-authored-by:
Chendi.Xue <chendi.xue@intel.com> Co-authored-by:
Michał Kuligowski <mkuligowski@habana.ai> Co-authored-by:
Jakub Maksymczuk <jmaksymczuk@habana.ai> Co-authored-by:
Tomasz Zielinski <85164140+tzielinski-habana@users.noreply.github.com> Co-authored-by:
Sun Choi <schoi@habana.ai> Co-authored-by:
Iryna Boiko <iboiko@habana.ai> Co-authored-by:
Bob Zhu <41610754+czhu15@users.noreply.github.com> Co-authored-by:
hlin99 <73271530+hlin99@users.noreply.github.com> Co-authored-by:
Zehao Huang <zehao.huang@intel.com> Co-authored-by:
Andrzej Kotłowski <Andrzej.Kotlowski@intel.com> Co-authored-by:
Yan Tomsinsky <73292515+Yantom1@users.noreply.github.com> Co-authored-by:
Nir David <ndavid@habana.ai> Co-authored-by:
Yu-Zhou <yu.zhou@intel.com> Co-authored-by:
Ruheena Suhani Shaik <rsshaik@habana.ai> Co-authored-by:
Karol Damaszke <kdamaszke@habana.ai> Co-authored-by:
Marcin Swiniarski <mswiniarski@habana.ai> Co-authored-by:
Woosuk Kwon <woosuk.kwon@berkeley.edu> Co-authored-by:
Jacek Czaja <jacek.czaja@intel.com> Co-authored-by:
Jacek Czaja <jczaja@habana.ai> Co-authored-by:
Yuan <yuan.zhou@outlook.com>
-
Aaron Pham authored
Signed-off-by:Aaron Pham <contact@aarnphm.xyz>
-
- 28 Oct, 2024 1 commit
-
-
wangshuai09 authored
Signed-off-by:wangshuai09 <391746016@qq.com>
-
- 22 Oct, 2024 1 commit
-
-
wangshuai09 authored
-
- 18 Sep, 2024 1 commit
-
-
Cyrus Leung authored
-
- 12 Aug, 2024 1 commit
-
-
jon-chuang authored
Co-authored-by:Cody Yu <hao.yu.cody@gmail.com>
-
- 29 Jul, 2024 1 commit
-
-
Thomas Parnell authored
Signed-off-by:Thomas Parnell <tpa@zurich.ibm.com>
-
- 27 Jul, 2024 1 commit
-
-
Joe authored
-
- 16 Jul, 2024 1 commit
-
-
Michael Goin authored
-
- 15 Jul, 2024 1 commit
-
-
Thomas Parnell authored
Signed-off-by:Thomas Parnell <tpa@zurich.ibm.com>
-
- 13 Jul, 2024 1 commit
-
-
Thomas Parnell authored
Signed-off-by:Thomas Parnell <tpa@zurich.ibm.com>
-
- 12 Jul, 2024 1 commit
-
-
Michael Goin authored
-
- 05 Jul, 2024 1 commit
-
-
JGSweets authored
-
- 03 Jul, 2024 1 commit
-
-
youkaichao authored
-
- 01 Jul, 2024 1 commit
-
-
youkaichao authored
-
- 13 Jun, 2024 1 commit
-
-
Li, Jiang authored
Co-authored-by:Jianan Gu <jianan.gu@intel.com>
-
- 31 May, 2024 1 commit
-
-
SnowDist authored
Co-authored-by:Zhuohan Li <zhuohan123@gmail.com>
-
- 28 May, 2024 1 commit
-
-
Michał Moskal authored
Co-authored-by:Ruth Evans <ruthevans@Ruths-MacBook-Pro.local>
-
- 25 May, 2024 1 commit
-
-
Eric Xihui Lin authored
Co-authored-by:
beagleski <yunanzhang@microsoft.com> Co-authored-by:
bapatra <bapatra@microsoft.com> Co-authored-by:
Barun Patra <codedecde@users.noreply.github.com> Co-authored-by:
Michael Goin <michael@neuralmagic.com>
-
- 16 May, 2024 1 commit
-
-
Hongxia Yang authored
-
- 15 May, 2024 1 commit
-
-
SangBin Cho authored
[Core][2/N] Model runner refactoring part 2. Combine prepare prefill / decode to a single API (#4681) This PR combines prepare_prompt and prepare_decode into a single API. This PR also coelsce the attn metadata for prefill/decode to a single class and allow to slice them when running attn backend. It also refactors subquery_start_loc which was not refactored in the previous PR
-
- 08 May, 2024 2 commits
-
-
youkaichao authored
-
DefTruth authored
-
- 07 May, 2024 1 commit
-
-
youkaichao authored
-
- 03 May, 2024 1 commit
-
-
SangBin Cho authored
-
- 02 May, 2024 1 commit
-
-
Michał Moskal authored
Co-authored-by:SangBin Cho <rkooo567@gmail.com>
-
- 27 Apr, 2024 1 commit
-
-
Hongxia Yang authored
-
- 18 Apr, 2024 1 commit
-
-
Michał Moskal authored
-
- 12 Apr, 2024 1 commit
-
-
Bellk17 authored
Co-authored-by:Woosuk Kwon <woosuk.kwon@berkeley.edu>
-
- 11 Apr, 2024 2 commits
-
-
Kunshang Ji authored
-
SangBin Cho authored
-
- 10 Apr, 2024 1 commit
-
-
James Whedbee authored
-
- 09 Apr, 2024 1 commit
-
-
Juan Villamizar authored
Co-authored-by:
jpvillam <jpvillam@amd.com> Co-authored-by:
Gregory Shtrasberg <Gregory.Shtrasberg@amd.com> Co-authored-by:
Woosuk Kwon <woosuk.kwon@berkeley.edu>
-
- 04 Apr, 2024 1 commit
-
-
Woosuk Kwon authored
-
- 03 Apr, 2024 1 commit
-
-
Adrian Abeyta authored
Co-authored-by:
Gregory Shtrasberg <Gregory.Shtrasberg@amd.com> Co-authored-by:
HaiShaw <hixiao@gmail.com> Co-authored-by:
AdrianAbeyta <Adrian.Abeyta@amd.com> Co-authored-by:
Matthew Wong <Matthew.Wong2@amd.com> Co-authored-by:
root <root@gt-pla-u18-08.pla.dcgpu> Co-authored-by:
mawong-amd <156021403+mawong-amd@users.noreply.github.com> Co-authored-by:
ttbachyinsda <ttbachyinsda@outlook.com> Co-authored-by:
guofangze <guofangze@kuaishou.com> Co-authored-by:
Michael Goin <mgoin64@gmail.com> Co-authored-by:
jacobthebanana <50071502+jacobthebanana@users.noreply.github.com> Co-authored-by:
Woosuk Kwon <woosuk.kwon@berkeley.edu>
-