• Paul Fultz II's avatar
    Refactor runtime compiled kernels to use the same compile_ops pipeline (#1125) · 661046c6
    Paul Fultz II authored
    This adds the infrastructure so we can compile everything in parallel, whereas before only pointwise kernels were compiled in parallel. This will also directly integrate with lowering and the gpu-driver. The kernels for pointwise and roialign are using this infrastructure. Scatternd is not since it does require standard shape.
    
    This also makes it easier to add new runtime compiled kernels in the future.
    661046c6
compile_hip.cpp 7.94 KB