• Jeremy Reizenstein's avatar
    Avoid raysampler dict · 305cf32f
    Jeremy Reizenstein authored
    Summary:
    A significant speedup (e.g. >2% of a forward pass).
    
    Move NDCMultinomialRaysampler parts of AbstractMaskRaySampler to members instead of living in a dict. The dict was hiding them from the nn.Module system so their _xy_grid members were remaining on the CPU. Therefore they were being copied to the GPU in every forward pass.
    
    (We couldn't easily use a ModuleDict here because the enum keys are not strs.)
    
    Reviewed By: shapovalov
    
    Differential Revision: D39668589
    
    fbshipit-source-id: 719b88e4a08fd7263a284e0ab38189e666bd7e3a
    305cf32f
ray_sampler.py 9.52 KB