Unverified Commit c7102e56 authored by Nikita Titov's avatar Nikita Titov Committed by GitHub
Browse files

Minor CUDA cleanup (#5394)



* Update README.rst

* Update cuda_score_updater.cu
Co-authored-by: default avatarJames Lamb <jaylamb20@gmail.com>
parent e2dfcd69
......@@ -125,7 +125,7 @@ All requirements from `Build from Sources section <#build-from-sources>`__ apply
Recently, a new CUDA version with better efficiency is implemented as an experimental feature. To build the new CUDA version, replace ``--cuda`` with ``--cuda-exp`` in the above commands. Please note that new version requires **CUDA** 10.0 or later libraries. Note that this new version uses twice the memory, since it stores data row-wise as well as column-wise in memory to improve performance (see this `issue <https://github.com/microsoft/LightGBM/issues/5318>`__ for discussion).
To use the regular or experimental cuda versions within Python, pass ``{"device":"cuda"}`` or ``{"device":"cuda_exp"}`` respectively as parameters.
To use the regular or experimental CUDA versions within Python, pass ``{"device": "cuda"}`` or ``{"device": "cuda_exp"}`` respectively as parameters.
Build HDFS Version
~~~~~~~~~~~~~~~~~~
......
......@@ -22,7 +22,7 @@ __global__ void AddScoreConstantKernel(
void CUDAScoreUpdater::LaunchAddScoreConstantKernel(const double val, const size_t offset) {
const int num_blocks = (num_data_ + num_threads_per_block_) / num_threads_per_block_;
Log::Warning("adding init score = %f", val);
Log::Debug("Adding init score = %lf", val);
AddScoreConstantKernel<<<num_blocks, num_threads_per_block_>>>(val, offset, num_data_, cuda_score_);
}
......
Markdown is supported
0% or .
You are about to add 0 people to the discussion. Proceed with caution.
Finish editing this message first!
Please register or to comment