[Model] Extract GatedDeltaNetAttention into shared layer for Qwen3Next and Qwen3.5 (#37975)
Signed-off-by:wxsIcey <1790571317@qq.com> Signed-off-by:
Icey <1790571317@qq.com>
Showing
This diff is collapsed.
This diff is collapsed.
Please register or sign in to comment