[Core][Distributed] support both cpu and device tensor in broadcast tensor dict (#4660)
Attach a file by drag & drop or click to upload