1. 22 Sep, 2025 1 commit
    • Jeff Daily's avatar
      [ROCm] re-add support for ROCm builds · 61ec4f1a
      Jeff Daily authored
      Previously #6086 added ROCm support but after numerous rebases it lost
      critical changes. This PR restores the ROCm build.
      
      There are many source file changes but most were automated using the
      following:
      
      ```bash
      for f in `grep -rl '#ifdef USE_CUDA'`
      do
          sed -i 's@#ifdef USE_CUDA@#if defined(USE_CUDA) || defined(USE_ROCM)@g' $f
      done
      
      for f in `grep -rl '#endif  // USE_CUDA'`
      do
          sed -i 's@#endif  // USE_CUDA@#endif  // USE_CUDA || USE_ROCM@g' $f
      done
      ```
      61ec4f1a
  2. 16 Jun, 2023 1 commit
  3. 16 Mar, 2023 1 commit
  4. 01 Feb, 2023 1 commit
    • James Lamb's avatar
      [CUDA] consolidate CUDA versions (#5677) · 4f47547c
      James Lamb authored
      
      
      * [ci] speed up if-else, swig, and lint conda setup
      
      * add 'source activate'
      
      * python constraint
      
      * start removing cuda v1
      
      * comment out CI
      
      * remove more references
      
      * revert some unnecessaary changes
      
      * revert a few more mistakes
      
      * revert another change that ignored params
      
      * sigh
      
      * remove CUDATreeLearner
      
      * fix tests, docs
      
      * fix quoting in setup.py
      
      * restore all CI
      
      * Apply suggestions from code review
      Co-authored-by: default avatarshiyu1994 <shiyu_k1994@qq.com>
      
      * Apply suggestions from code review
      
      * completely remove cuda_exp, update docs
      
      ---------
      Co-authored-by: default avatarshiyu1994 <shiyu_k1994@qq.com>
      4f47547c
  5. 29 Dec, 2022 1 commit
  6. 27 Dec, 2022 1 commit
    • shiyu1994's avatar
      [CUDA] Add L2 metric for new CUDA version (#5633) · 6482b47e
      shiyu1994 authored
      * add rmse metric for new cuda version
      
      * add Init for CUDAMetricInterface
      
      * fix lint errors
      
      * fix rmse and add l2 metric for new cuda version
      
      * use CUDAL2Metric
      
      * explicit template instantiation
      
      * write result only with the first thread
      
      * pre allocate buffer for output converting
      
      * fix l2 regression with cuda metric evaluation
      
      * weighting loss in cuda metric evaluation
      
      * mark CUDATree::AsConstantTree as override
      6482b47e
  7. 02 Dec, 2022 1 commit