• Gyula Zakor's avatar
    Fix gpu::contiguous elimination bug · 6d59be1b
    Gyula Zakor authored
    When testing gpu::contiguous instruction replacability recursive calls to try_compute_shape can pick up gpu::contiguous instructions as well.
    These instructions always passes the try_compute_shape check because compute_shape of gpu::contiguous always returns a standard shape, thus propagating false `true` values upwards.
    For these corner cases we must demand the standard shape of the inputs as well, to avoid ellimination of needed gpu::contiguous instructions.
    6d59be1b
eliminate_contiguous.cpp 6.88 KB