- 04 Mar, 2026 1 commit
-
-
Yan Ru Pei authored
Signed-off-by:
PeaBrane <yanrpei@gmail.com> Co-authored-by:
Claude Opus 4.6 <noreply@anthropic.com>
-
- 03 Mar, 2026 3 commits
-
-
GuanLuo authored
fix: properly setup and register vLLM worker for external / hybrid load balancing. Update launch script (#6695) Signed-off-by:Guan Luo <41310872+GuanLuo@users.noreply.github.com>
-
Yan Ru Pei authored
Signed-off-by:PeaBrane <yanrpei@gmail.com>
-
Yan Ru Pei authored
Signed-off-by:PeaBrane <yanrpei@gmail.com>
-
- 02 Mar, 2026 2 commits
-
-
Yan Ru Pei authored
Signed-off-by:
PeaBrane <yanrpei@gmail.com> Co-authored-by:
Claude Opus 4.6 <noreply@anthropic.com>
-
Neal Vaidya authored
Signed-off-by:
Neal Vaidya <nealv@nvidia.com> Co-authored-by:
Claude Opus 4.6 <noreply@anthropic.com>
-
- 28 Feb, 2026 2 commits
-
-
Michael Feil authored
Signed-off-by:michaelfeil <63565275+michaelfeil@users.noreply.github.com>
-
Yan Ru Pei authored
Signed-off-by:
PeaBrane <yanrpei@gmail.com> Co-authored-by:
Claude Opus 4.6 <noreply@anthropic.com>
-
- 27 Feb, 2026 2 commits
-
-
Yan Ru Pei authored
Signed-off-by:PeaBrane <yanrpei@gmail.com>
-
ishandhanani authored
Co-authored-by:Claude <noreply@anthropic.com>
-
- 26 Feb, 2026 1 commit
-
-
Yan Ru Pei authored
Signed-off-by:
PeaBrane <yanrpei@gmail.com> Co-authored-by:
Claude Opus 4.6 <noreply@anthropic.com>
-
- 25 Feb, 2026 3 commits
-
-
Yan Ru Pei authored
Signed-off-by:PeaBrane <yanrpei@gmail.com>
-
Keiven C authored
Signed-off-by:
Keiven Chang <keivenchang@users.noreply.github.com> Co-authored-by:
Keiven Chang <keivenchang@users.noreply.github.com> Co-authored-by:
Cursor <cursoragent@cursor.com>
-
atchernych authored
Signed-off-by:Anna Tchernych <atchernych@nvidia.com>
-
- 24 Feb, 2026 9 commits
-
-
ishandhanani authored
Co-authored-by:
baihuitian <baihuitian.bht@gmail.com> Co-authored-by:
Cursor <cursoragent@cursor.com>
-
Keiven C authored
Signed-off-by:
Keiven Chang <keivenchang@users.noreply.github.com> Co-authored-by:
Keiven Chang <keivenchang@users.noreply.github.com> Co-authored-by:
Cursor <cursoragent@cursor.com>
-
Yan Ru Pei authored
Signed-off-by:PeaBrane <yanrpei@gmail.com>
-
Yan Ru Pei authored
chore: split JetStream subscriber into dedicated module and deprecate durable_kv_events [DYN-2203] (#6477) Signed-off-by:PeaBrane <yanrpei@gmail.com>
-
Biswa Panda authored
-
Yan Ru Pei authored
Signed-off-by:PeaBrane <yanrpei@gmail.com>
-
Jacky authored
Signed-off-by:Jacky <18255193+kthui@users.noreply.github.com>
-
Yan Ru Pei authored
Signed-off-by:
PeaBrane <yanrpei@gmail.com> Signed-off-by:
Yan Ru Pei <yanrpei@gmail.com> Co-authored-by:
Claude Opus 4.6 <noreply@anthropic.com>
-
Yan Ru Pei authored
Signed-off-by:
Pea Brane <peabrane@peabrane.com> Signed-off-by:
PeaBrane <yanrpei@gmail.com> Co-authored-by:
Claude Opus 4.6 <noreply@anthropic.com>
-
- 23 Feb, 2026 3 commits
-
-
Thomas Montfort authored
Signed-off-by:tmontfort <tmontfort@nvidia.com>
-
Yan Ru Pei authored
Signed-off-by:
PeaBrane <yanrpei@gmail.com> Co-authored-by:
Claude Opus 4.6 <noreply@anthropic.com>
-
Yan Ru Pei authored
Signed-off-by:PeaBrane <yanrpei@gmail.com>
-
- 19 Feb, 2026 3 commits
-
-
Yan Ru Pei authored
Signed-off-by:
PeaBrane <yanrpei@gmail.com> Co-authored-by:
Claude Opus 4.6 <noreply@anthropic.com>
-
Yan Ru Pei authored
chore: Remove ZmqKvEventListener binding and rework standalone TRT-LLM example to use native Python ZMQ (#6164) Signed-off-by:PeaBrane <yanrpei@gmail.com>
-
Yan Ru Pei authored
Signed-off-by:
PeaBrane <yanrpei@gmail.com> Co-authored-by:
Cursor <cursoragent@cursor.com>
-
- 14 Feb, 2026 1 commit
-
-
zhongdaor-nv authored
Signed-off-by:
zhongdaor <zhongdaor@nvidia.com> Signed-off-by:
zhongdaor-nv <zhongdaor@nvidia.com>
-
- 13 Feb, 2026 4 commits
-
-
Yan Ru Pei authored
Signed-off-by:PeaBrane <yanrpei@gmail.com>
-
atchernych authored
Signed-off-by:Anna Tchernych <atchernych@nvidia.com>
-
Yan Ru Pei authored
Signed-off-by:
PeaBrane <yanrpei@gmail.com> Co-authored-by:
ishandhanani <82981111+ishandhanani@users.noreply.github.com>
-
Yan Ru Pei authored
Signed-off-by:PeaBrane <yanrpei@gmail.com>
-
- 12 Feb, 2026 2 commits
-
-
Karen Chung authored
feat: use RNG when dp routing targets are tied; override no-assume-kv-reuse for decode requests (#6253) Signed-off-by:Karen Chung <karenc@nvidia.com>
-
Yan Ru Pei authored
Signed-off-by:
PeaBrane <yanrpei@gmail.com> Signed-off-by:
jthomson04 <jothomson@nvidia.com> Signed-off-by:
jthomson04 <jwillthomson19@gmail.com> Signed-off-by:
Janelle Cai <jcai18@mit.edu> Co-authored-by:
jthomson04 <jwillthomson19@gmail.com> Co-authored-by:
Janelle Cai <jcai18@mit.edu>
-
- 11 Feb, 2026 1 commit
-
-
Yan Ru Pei authored
Signed-off-by:
PeaBrane <yanrpei@gmail.com> Co-authored-by:
Cursor <cursoragent@cursor.com>
-
- 08 Feb, 2026 1 commit
-
-
Yan Ru Pei authored
chore: enable local indexers by default, and use normal event plane by default (not jetstream) (#5941) Signed-off-by:PeaBrane <yanrpei@gmail.com>
-
- 04 Feb, 2026 2 commits
-
-
Janelle Cai authored
-
Hongkuan Zhou authored
Signed-off-by:hongkuanz <hongkuanz@nvidia.com>
-