"megatron/legacy/model/fused_softmax.py" did not exist on "051f58f1a5a8a7450ffea5c3aadaa2ea4b3a8630"
[Feature] Qwen3-Next & FLA: Support MTP topk>1; Up to 6% faster (#11133)
Co-authored-by:
Stefan He <hebiaobuaa@gmail.com>
Showing
Please register or sign in to comment