fix(planner): don't block agg decode scaling when max_num_batched_tokens is missing (#8196)
Signed-off-by:hongkuanz <hongkuanz@nvidia.com> Co-authored-by:
Claude Opus 4.6 (1M context) <noreply@anthropic.com>
Showing
Please register or sign in to comment