[perf][OSS] tensor views for bucketing (#300)
* min bucket size with model size * resize the bucket after all the params have been squeezed in, save a tiny bit of memory * minor, ensure that the cache is freed and improve the comments
Showing
Please register or sign in to comment