- 13 Aug, 2025 1 commit
-
-
Woosuk Kwon authored
Signed-off-by:
Woosuk Kwon <woosuk.kwon@berkeley.edu> Signed-off-by:
Woosuk Kwon <woosuk@thinkingmachines.ai>
-
- 12 Jun, 2025 1 commit
-
-
Ning Xie authored
Signed-off-by:Andy Xie <andy.xning@gmail.com>
-
- 03 Jun, 2025 1 commit
-
-
Simon Mo authored
Signed-off-by:simon-mo <simon.mo@hey.com>
-
- 03 May, 2025 1 commit
-
-
Cyrus Leung authored
Signed-off-by:DarkLight1337 <tlleungac@connect.ust.hk>
-
- 02 May, 2025 1 commit
-
-
Andrew Sansom authored
Signed-off-by:
Andrew Sansom <andrew@protopia.ai> Signed-off-by:
DarkLight1337 <tlleungac@connect.ust.hk> Co-authored-by:
临景 <linjing.yx@alibaba-inc.com> Co-authored-by:
Bryce1010 <bryceyx@gmail.com> Co-authored-by:
Nan2018 <nan@protopia.ai> Co-authored-by:
Cyrus Leung <cyrus.tl.leung@gmail.com> Co-authored-by:
DarkLight1337 <tlleungac@connect.ust.hk>
-
- 15 Mar, 2025 1 commit
-
-
Robert Shaw authored
Signed-off-by:
rshaw@neuralmagic.com <rshaw@neuralmagic.com> Co-authored-by:
rshaw@neuralmagic.com <rshaw@neuralmagic.com> Co-authored-by:
Nicolò Lucchesi <nlucches@redhat.com> Co-authored-by:
Tyler Michael Smith <tyler@neuralmagic.com> Co-authored-by:
Michael Goin <michael@neuralmagic.com>
-
- 03 Mar, 2025 1 commit
-
-
Harry Mellor authored
-
- 23 Feb, 2025 1 commit
-
-
Kevin H. Luu authored
-
- 19 Feb, 2025 1 commit
-
-
Kevin H. Luu authored
Signed-off-by: <> Co-authored-by:EC2 Default User <ec2-user@ip-172-31-20-117.us-west-2.compute.internal>
-
- 04 Feb, 2025 1 commit
-
-
Hongxia Yang authored
Signed-off-by:
Hongxia Yang <hongxia.yang@amd.com> Co-authored-by:
Matthew Wong <Matthew.Wong2@amd.com>
-
- 02 Feb, 2025 1 commit
-
-
Russell Bryant authored
- **Add SPDX license headers to python source files** - **Check for SPDX headers using pre-commit** commit 9d7ef44c3cfb72ca4c32e1c677d99259d10d4745 Author: Russell Bryant <rbryant@redhat.com> Date: Fri Jan 31 14:18:24 2025 -0500 Add SPDX license headers to python source files This commit adds SPDX license headers to python source files as recommended to the project by the Linux Foundation. These headers provide a concise way that is both human and machine readable for communicating license information for each source file. It helps avoid any ambiguity about the license of the code and can also be easily used by tools to help manage license compliance. The Linux Foundation runs license scans against the codebase to help ensure we are in compliance with the licenses of the code we use, including dependencies. Having these headers in place helps that tool do its job. More information can be found on the SPDX site: - https://spdx.dev/learn/handling-license-info/ Signed-off-by:Russell Bryant <rbryant@redhat.com> commit 5a1cf1cb3b80759131c73f6a9dddebccac039dea Author: Russell Bryant <rbryant@redhat.com> Date: Fri Jan 31 14:36:32 2025 -0500 Check for SPDX headers using pre-commit Signed-off-by:
Russell Bryant <rbryant@redhat.com> --------- Signed-off-by:
Russell Bryant <rbryant@redhat.com>
-
- 23 Jan, 2025 1 commit
-
-
Gregory Shtrasberg authored
Signed-off-by:
Gregory Shtrasberg <Gregory.Shtrasberg@amd.com> Co-authored-by:
Micah Williamson <micah.williamson@amd.com>
-
- 16 Dec, 2024 1 commit
-
-
youkaichao authored
Signed-off-by:youkaichao <youkaichao@gmail.com>
-
- 13 Dec, 2024 1 commit
-
-
youkaichao authored
Signed-off-by:youkaichao <youkaichao@gmail.com>
-
- 03 Dec, 2024 1 commit
-
-
youkaichao authored
Signed-off-by:youkaichao <youkaichao@gmail.com>
-
- 01 Dec, 2024 1 commit
-
-
Cyrus Leung authored
Signed-off-by:DarkLight1337 <tlleungac@connect.ust.hk>
-
- 02 Nov, 2024 1 commit
-
-
youkaichao authored
Signed-off-by:
youkaichao <youkaichao@gmail.com> Co-authored-by:
Nick Hill <nhill@redhat.com>
-
- 01 Nov, 2024 1 commit
-
-
Peter Salas authored
Signed-off-by:Peter Salas <peter@fixie.ai>
-
- 22 Oct, 2024 1 commit
-
-
wangshuai09 authored
-
- 19 Oct, 2024 1 commit
-
-
Joe Runde authored
Signed-off-by:Joe Runde <Joseph.Runde@ibm.com>
-
- 18 Oct, 2024 1 commit
-
-
Joe Runde authored
Signed-off-by:Joe Runde <Joseph.Runde@ibm.com>
-
- 21 Sep, 2024 1 commit
-
-
Cyrus Leung authored
-
- 19 Sep, 2024 1 commit
-
-
sroy745 authored
[Bugfix] [Encoder-Decoder] Bugfix for encoder specific metadata construction during decode of encoder-decoder models. (#8545)
-
- 18 Sep, 2024 1 commit
-
-
Aaron Pham authored
Signed-off-by:
Aaron Pham <contact@aarnphm.xyz> Co-authored-by:
Cyrus Leung <cyrus.tl.leung@gmail.com>
-
- 17 Sep, 2024 1 commit
-
-
sroy745 authored
-
- 20 Aug, 2024 1 commit
-
-
Antoni Baum authored
-
- 19 Aug, 2024 2 commits
-
-
William Lin authored
Co-authored-by:afeldman-nm <156691304+afeldman-nm@users.noreply.github.com>
-
SangBin Cho authored
-
- 16 Aug, 2024 1 commit
-
-
jon-chuang authored
-
- 09 Aug, 2024 1 commit
-
-
Mahesh Keralapura authored
-
- 06 Aug, 2024 1 commit
-
-
afeldman-nm authored
[Core] Subclass ModelRunner to support cross-attention & encoder sequences (towards eventual encoder/decoder model support) (#4942) Co-authored-by:
Andrew Feldman <afeld2012@gmail.com> Co-authored-by:
Nick Hill <nickhill@us.ibm.com>
-
- 25 Jul, 2024 1 commit
-
-
Cody Yu authored
-
- 17 Jul, 2024 1 commit
-
-
Cody Yu authored
-
- 09 Jul, 2024 1 commit
-
-
Swapnil Parekh authored
Co-authored-by:
Swapnil Parekh <swapnilp@ibm.com> Co-authored-by:
Joe G <joseph.granados@h2o.ai> Co-authored-by:
Antoni Baum <antoni.baum@protonmail.com>
-
- 02 Jul, 2024 1 commit
-
-
Murali Andoorveedu authored
Signed-off-by:Muralidhar Andoorveedu <muralidhar.andoorveedu@centml.ai>
-
- 26 Jun, 2024 1 commit
-
-
Stephanie Wang authored
Signed-off-by:
Stephanie Wang <swang@cs.berkeley.edu> Signed-off-by:
Stephanie <swang@anyscale.com> Co-authored-by:
Stephanie <swang@anyscale.com>
-
- 15 Jun, 2024 1 commit
-
-
Cyrus Leung authored
-
- 13 Jun, 2024 1 commit
-
-
youkaichao authored
[Core][Distributed] add coordinator to reduce code duplication in tp and pp (#5293)
-
- 15 May, 2024 1 commit
-
-
SangBin Cho authored
[Core][2/N] Model runner refactoring part 2. Combine prepare prefill / decode to a single API (#4681) This PR combines prepare_prompt and prepare_decode into a single API. This PR also coelsce the attn metadata for prefill/decode to a single class and allow to slice them when running attn backend. It also refactors subquery_start_loc which was not refactored in the previous PR
-
- 13 May, 2024 1 commit
-
-
Woosuk Kwon authored
-