• Chang Liu's avatar
    [Feature] Unify the cuda stream used in core library (#4480) · 1c9d2a03
    Chang Liu authored
    
    
    * Use an internal cuda stream for CopyDataFromTo
    
    * small fix white space
    
    * Fix to compile
    
    * Make stream optional in copydata for compile
    
    * fix lint issue
    
    * Update cub functions to use internal stream
    
    * Lint check
    
    * Update CopyTo/CopyFrom/CopyFromTo to use internal stream
    
    * Address comments
    
    * Fix backward CUDA stream
    
    * Avoid overloading CopyFromTo()
    
    * Minor comment update
    
    * Overload copydatafromto in cuda device api
    Co-authored-by: default avatarxiny <xiny@nvidia.com>
    1c9d2a03
unit_graph.cc 56 KB