Fix:Extend MAX_VPT from 32 to 256 to accommodate large-scale MoE models (e.g.,...
Fix:Extend MAX_VPT from 32 to 256 to accommodate large-scale MoE models (e.g., GLM-5-quantized model).
Showing
Please register or sign in to comment
Fix:Extend MAX_VPT from 32 to 256 to accommodate large-scale MoE models (e.g., GLM-5-quantized model).