Improve linear.py to load sharded weights & remove the dependency of Parameters from vllm (#2784)
Co-authored-by: SangBin Cho rkooo567@gmail.com
Showing
This diff is collapsed.
Please register or sign in to comment
Co-authored-by: SangBin Cho rkooo567@gmail.com