"vllm/model_executor/models/aquila.py" did not exist on "5c976a7e1a1bec875bf6474824b7dff39e38de18"
- 07 Aug, 2025 1 commit
-
-
Richard Huo authored
feat: DIS-323 [trtllm backend publisher] only publish kv event with the biggest window size to support kv routing with variable sliding window attention (#2241)
-
- 05 Aug, 2025 2 commits
-
-
Richard Huo authored
-
Ryan McCormick authored
-
- 31 Jul, 2025 1 commit
-
-
KrishnanPrash authored
-
- 16 Jul, 2025 1 commit
-
-
Tanmay Verma authored
-
- 15 Jul, 2025 1 commit
-
-
Tanmay Verma authored
-
- 26 Jun, 2025 1 commit
-
-
Tanmay Verma authored
Signed-off-by:Tanmay Verma <tanmayv@nvidia.com>
-
- 06 May, 2025 1 commit
-
-
hhzhang16 authored
-
- 11 Mar, 2025 1 commit
-
-
julienmancuso authored
-
- 25 Feb, 2025 1 commit
-
-
Neelay Shah authored
Signed-off-by:
Neelay Shah <neelays@nvidia.com> Co-authored-by:
Ryan McCormick <rmccormick@nvidia.com>
-
- 18 Feb, 2025 1 commit
-
-
GuanLuo authored
Signed-off-by:
Neelay Shah <neelays@nvidia.com> Co-authored-by:
aflowers <aflowers@nvidia.com> Co-authored-by:
Ryan McCormick <rmccormick@nvidia.com> Co-authored-by:
hongkuanz <hongkuanz@nvidia.com> Co-authored-by:
Neelay Shah <neelays@nvidia.com>
-
- 05 Feb, 2025 2 commits
-
-
J Wyman authored
-
Ryan Olson authored
Co-authored-by:
Ryan McCormick <rmccormick@nvidia.com> Co-authored-by:
Neelay Shah <neelays@nvidia.com>
-