[Bugfix][Quantization] Fix PerTensorScale loading with tuple shard_id in...
[Bugfix][Quantization] Fix PerTensorScale loading with tuple shard_id in MergedColumnParallelLinear (#38517)
Signed-off-by:
loukang <loukang@xiaohongshu.com>
Showing
Please register or sign in to comment