1. 06 May, 2023 1 commit
  2. 01 Feb, 2023 1 commit
    • James Lamb's avatar
      [CUDA] consolidate CUDA versions (#5677) · 4f47547c
      James Lamb authored
      
      
      * [ci] speed up if-else, swig, and lint conda setup
      
      * add 'source activate'
      
      * python constraint
      
      * start removing cuda v1
      
      * comment out CI
      
      * remove more references
      
      * revert some unnecessaary changes
      
      * revert a few more mistakes
      
      * revert another change that ignored params
      
      * sigh
      
      * remove CUDATreeLearner
      
      * fix tests, docs
      
      * fix quoting in setup.py
      
      * restore all CI
      
      * Apply suggestions from code review
      Co-authored-by: default avatarshiyu1994 <shiyu_k1994@qq.com>
      
      * Apply suggestions from code review
      
      * completely remove cuda_exp, update docs
      
      ---------
      Co-authored-by: default avatarshiyu1994 <shiyu_k1994@qq.com>
      4f47547c
  3. 02 Dec, 2022 2 commits
  4. 27 Nov, 2022 1 commit
  5. 02 Nov, 2022 1 commit
  6. 31 Aug, 2022 1 commit
    • shiyu1994's avatar
      [CUDA] Add binary objective for cuda_exp (#5425) · 2b8fe8b4
      shiyu1994 authored
      * add binary objective for cuda_exp
      
      * include <string> and <vector>
      
      * exchange include ordering
      
      * fix length of score to copy in evaluation
      
      * fix EvalOneMetric
      
      * fix cuda binary objective and prediction when boosting on gpu
      
      * Add white space
      
      * fix BoostFromScore for CUDABinaryLogloss
      
      update log in test_register_logger
      
      * include <algorithm>
      
      * simplify shared memory buffer
      2b8fe8b4
  7. 10 Aug, 2022 1 commit
    • Scott Votaw's avatar
      feature: Add true streaming APIs to reduce client-side memory usage (#5299) · 0a5c5838
      Scott Votaw authored
      * Extract streaming to own PR
      
      * small merge fixes and cleanup
      
      * linting fixes
      
      * fix cast warning
      
      * Fix accidental deletion during branch transfer
      
      * responded to initial triage comments
      
      * Added more tests to use create-from-samples APIs
      
      * added mutex and adjusted nclasses logic
      
      * Fix thread-safety for pushing data to sparse bins through Push APIs
      
      * lint and doc fixes
      
      * Small SWIG fix
      
      * nit fix
      
      * Responded to StrikerRUS comments
      
      * fix breaking change after merge with master
      
      * Extract streaming to own PR
      
      * small merge fixes and cleanup
      
      * Fix accidental deletion during branch transfer
      
      * responded to initial triage comments
      
      * Added more tests to use create-from-samples APIs
      
      * Fix rstcheck call in ci
      
      * remove TODOs
      
      * Extract streaming to own PR
      
      * small merge fixes and cleanup
      
      * Fix accidental deletion during branch transfer
      
      * responded to initial triage comments
      
      * Added more tests to use create-from-samples APIs
      
      * Small SWIG fix
      
      * remove ci change
      
      * responded to shiyu1994 comments
      
      * responded to StrikerRUS comments
      
      * Fixes from StrikerRUS comments
      0a5c5838
  8. 29 Jul, 2022 2 commits
  9. 23 May, 2022 1 commit
  10. 23 Mar, 2022 1 commit
    • shiyu1994's avatar
      [CUDA] New CUDA version Part 1 (#4630) · 6b56a90c
      shiyu1994 authored
      
      
      * new cuda framework
      
      * add histogram construction kernel
      
      * before removing multi-gpu
      
      * new cuda framework
      
      * tree learner cuda kernels
      
      * single tree framework ready
      
      * single tree training framework
      
      * remove comments
      
      * boosting with cuda
      
      * optimize for best split find
      
      * data split
      
      * move boosting into cuda
      
      * parallel synchronize best split point
      
      * merge split data kernels
      
      * before code refactor
      
      * use tasks instead of features as units for split finding
      
      * refactor cuda best split finder
      
      * fix configuration error with small leaves in data split
      
      * skip histogram construction of too small leaf
      
      * skip split finding of invalid leaves
      
      stop when no leaf to split
      
      * support row wise with CUDA
      
      * copy data for split by column
      
      * copy data from host to CPU by column for data partition
      
      * add synchronize best splits for one leaf from multiple blocks
      
      * partition dense row data
      
      * fix sync best split from task blocks
      
      * add support for sparse row wise for CUDA
      
      * remove useless code
      
      * add l2 regression objective
      
      * sparse multi value bin enabled for CUDA
      
      * fix cuda ranking objective
      
      * support for number of items <= 2048 per query
      
      * speedup histogram construction by interleaving global memory access
      
      * split optimization
      
      * add cuda tree predictor
      
      * remove comma
      
      * refactor objective and score updater
      
      * before use struct
      
      * use structure for split information
      
      * use structure for leaf splits
      
      * return CUDASplitInfo directly after finding best split
      
      * split with CUDATree directly
      
      * use cuda row data in cuda histogram constructor
      
      * clean src/treelearner/cuda
      
      * gather shared cuda device functions
      
      * put shared CUDA functions into header file
      
      * change smaller leaf from <= back to < for consistent result with CPU
      
      * add tree predictor
      
      * remove useless cuda_tree_predictor
      
      * predict on CUDA with pipeline
      
      * add global sort algorithms
      
      * add global argsort for queries with many items in ranking tasks
      
      * remove limitation of maximum number of items per query in ranking
      
      * add cuda metrics
      
      * fix CUDA AUC
      
      * remove debug code
      
      * add regression metrics
      
      * remove useless file
      
      * don't use mask in shuffle reduce
      
      * add more regression objectives
      
      * fix cuda mape loss
      
      add cuda xentropy loss
      
      * use template for different versions of BitonicArgSortDevice
      
      * add multiclass metrics
      
      * add ndcg metric
      
      * fix cross entropy objectives and metrics
      
      * fix cross entropy and ndcg metrics
      
      * add support for customized objective in CUDA
      
      * complete multiclass ova for CUDA
      
      * separate cuda tree learner
      
      * use shuffle based prefix sum
      
      * clean up cuda_algorithms.hpp
      
      * add copy subset on CUDA
      
      * add bagging for CUDA
      
      * clean up code
      
      * copy gradients from host to device
      
      * support bagging without using subset
      
      * add support of bagging with subset for CUDAColumnData
      
      * add support of bagging with subset for dense CUDARowData
      
      * refactor copy sparse subrow
      
      * use copy subset for column subset
      
      * add reset train data and reset config for CUDA tree learner
      
      add deconstructors for cuda tree learner
      
      * add USE_CUDA ifdef to cuda tree learner files
      
      * check that dataset doesn't contain CUDA tree learner
      
      * remove printf debug information
      
      * use full new cuda tree learner only when using single GPU
      
      * disable all CUDA code when using CPU version
      
      * recover main.cpp
      
      * add cpp files for multi value bins
      
      * update LightGBM.vcxproj
      
      * update LightGBM.vcxproj
      
      fix lint errors
      
      * fix lint errors
      
      * fix lint errors
      
      * update Makevars
      
      fix lint errors
      
      * fix the case with 0 feature and 0 bin
      
      fix split finding for invalid leaves
      
      create cuda column data when loaded from bin file
      
      * fix lint errors
      
      hide GetRowWiseData when cuda is not used
      
      * recover default device type to cpu
      
      * fix na_as_missing case
      
      fix cuda feature meta information
      
      * fix UpdateDataIndexToLeafIndexKernel
      
      * create CUDA trees when needed in CUDADataPartition::UpdateTrainScore
      
      * add refit by tree for cuda tree learner
      
      * fix test_refit in test_engine.py
      
      * create set of large bin partitions in CUDARowData
      
      * add histogram construction for columns with a large number of bins
      
      * add find best split for categorical features on CUDA
      
      * add bitvectors for categorical split
      
      * cuda data partition split for categorical features
      
      * fix split tree with categorical feature
      
      * fix categorical feature splits
      
      * refactor cuda_data_partition.cu with multi-level templates
      
      * refactor CUDABestSplitFinder by grouping task information into struct
      
      * pre-allocate space for vector split_find_tasks_ in CUDABestSplitFinder
      
      * fix misuse of reference
      
      * remove useless changes
      
      * add support for path smoothing
      
      * virtual destructor for LightGBM::Tree
      
      * fix overlapped cat threshold in best split infos
      
      * reset histogram pointers in data partition and spllit finder in ResetConfig
      
      * comment useless parameter
      
      * fix reverse case when na is missing and default bin is zero
      
      * fix mfb_is_na and mfb_is_zero and is_single_feature_column
      
      * remove debug log
      
      * fix cat_l2 when one-hot
      
      fix gradient copy when data subset is used
      
      * switch shared histogram size according to CUDA version
      
      * gpu_use_dp=true when cuda test
      
      * revert modification in config.h
      
      * fix setting of gpu_use_dp=true in .ci/test.sh
      
      * fix linter errors
      
      * fix linter error
      
      remove useless change
      
      * recover main.cpp
      
      * separate cuda_exp and cuda
      
      * fix ci bash scripts
      
      add description for cuda_exp
      
      * add USE_CUDA_EXP flag
      
      * switch off USE_CUDA_EXP
      
      * revert changes in python-packages
      
      * more careful separation for USE_CUDA_EXP
      
      * fix CUDARowData::DivideCUDAFeatureGroups
      
      fix set fields for cuda metadata
      
      * revert config.h
      
      * fix test settings for cuda experimental version
      
      * skip some tests due to unsupported features or differences in implementation details for CUDA Experimental version
      
      * fix lint issue by adding a blank line
      
      * fix lint errors by resorting imports
      
      * fix lint errors by resorting imports
      
      * fix lint errors by resorting imports
      
      * merge cuda.yml and cuda_exp.yml
      
      * update python version in cuda.yml
      
      * remove cuda_exp.yml
      
      * remove unrelated changes
      
      * fix compilation warnings
      
      fix cuda exp ci task name
      
      * recover task
      
      * use multi-level template in histogram construction
      
      check split only in debug mode
      
      * ignore NVCC related lines in parameter_generator.py
      
      * update job name for CUDA tests
      
      * apply review suggestions
      
      * Update .github/workflows/cuda.yml
      Co-authored-by: default avatarNikita Titov <nekit94-08@mail.ru>
      
      * Update .github/workflows/cuda.yml
      Co-authored-by: default avatarNikita Titov <nekit94-08@mail.ru>
      
      * update header
      
      * remove useless TODOs
      
      * remove [TODO(shiyu1994): constrain the split with min_data_in_group] and record in #5062
      
      * #include <LightGBM/utils/log.h> for USE_CUDA_EXP only
      
      * fix include order
      
      * fix include order
      
      * remove extra space
      
      * address review comments
      
      * add warning when cuda_exp is used together with deterministic
      
      * add comment about gpu_use_dp in .ci/test.sh
      
      * revert changing order of included headers
      Co-authored-by: default avatarYu Shi <shiyu1994@qq.com>
      Co-authored-by: default avatarNikita Titov <nekit94-08@mail.ru>
      6b56a90c
  11. 14 Nov, 2021 2 commits
  12. 13 Nov, 2021 2 commits
  13. 10 Nov, 2021 1 commit
    • Chen Yufei's avatar
      cmake: use object library to avoid duplicate compilation. (#4489) · 15a63696
      Chen Yufei authored
      
      
      * cmake: use object library to avoid duplicate compilation.
      
      * debug: verbose make log for building r package.
      
      * Include /usr/local/include for AppleClang.
      
      * Revert "debug: verbose make log for building r package."
      
      * update cmake comment and fix indentation
      
      * debug cmake USE_DEBUG.
      
      * Revert "debug cmake USt E_DEBUG."
      
      * Add -fPIC for building shared library.
      
      * Always set -fPIC for non MSVC compiler.
      
      * debug: print exception in setup.py
      
      * debug: print cmake output for vs build.
      
      * debug: set opencl related target_xxx on lightgbm_objs.
      
      * Define compile definitions, link libraries on lightgbm_objs.
      
      * Add PUBLIC to target_link_libraries to expose library dependency.
      
      * Use target_link_libraries on object library.
      
      This should propagate usage requirements.
      
      * Fix CUDA linking.
      
      Linking object library (lightgbm_objs) to object library (histograms)
      does not linked objects.
      
      * Use PUBLIC link for lightgbm lib.
      
      * Set cuda related properties on final targets.
      
      * Remove debugging changes.
      
      Revert "debug: print exception in setup.py"
      Revert "debug: print cmake output for vs build."
      etc.
      
      * Remove -D_lightgbm_EXPORTS.
      
      * Revert to add -fPIC only for NOT USE_DEBUG.
      
      * Enable PIC for shared lib.
      
      * Fix enable PIC.
      
      * Use -fPIC for shared lib.
      
      * testlightgbm depends only on object files.
      
      * tweak build for R.
      
      * Try to remove OpenMP related include dir settings.
      
      * link with openmp for capi object library.
      
      * Use PUBLIC for _lightgbm target_link_libraries.
      
      * Try removing exports definition.
      
      * fix typo
      Co-authored-by: default avatarNikita Titov <nekit94-08@mail.ru>
      
      * fix typo
      Co-authored-by: default avatarNikita Titov <nekit94-08@mail.ru>
      
      * Add some comments for cmake code.
      
      * Try to fix cmake warnings CUDA.
      
      * revert accidentally commited R-package path change.
      
      * Try to fix cmake CUDA warnings, set for _lightgbm target.
      
      * Try to fix cmake CUDA warnings, set for lightgbm target.
      
      * empty commit to trigger ci
      Co-authored-by: default avatarNikita Titov <nekit94-08@mail.ru>
      15a63696
  14. 16 Oct, 2021 1 commit
  15. 05 Oct, 2021 1 commit
  16. 22 Sep, 2021 1 commit
  17. 09 Sep, 2021 1 commit
  18. 10 Jul, 2021 1 commit
  19. 26 Jun, 2021 1 commit
  20. 16 May, 2021 1 commit
  21. 15 May, 2021 1 commit
  22. 16 Apr, 2021 1 commit
  23. 21 Mar, 2021 1 commit
  24. 12 Mar, 2021 1 commit
  25. 19 Feb, 2021 1 commit
  26. 01 Feb, 2021 2 commits
  27. 31 Jan, 2021 1 commit
    • Nikita Titov's avatar
      [CUDA] Support CUDA 9 and test different CUDA versions at CI (#3880) · 0c42cb40
      Nikita Titov authored
      * test different CUDA versions
      
      * Update cuda.yml
      
      * Update cuda.yml
      
      * Update setup.sh
      
      * Update setup.sh
      
      * Update cuda.yml
      
      * Update cuda.yml
      
      * Update setup.sh
      
      * Update setup.sh
      
      * Update setup.sh
      
      * Update setup.sh
      
      * Update cuda.yml
      
      * Update cuda.yml
      
      * Update cuda.yml
      
      * Update CMakeLists.txt
      
      * Update CMakeLists.txt
      
      * Update cuda.yml
      
      * Update cuda.yml
      
      * Update cuda.yml
      
      * Update CMakeLists.txt
      
      * specify CUDA version in workflow job name
      0c42cb40
  28. 22 Jan, 2021 1 commit
    • Nikita Titov's avatar
      Move compute and eigen libraries to external_libs folder (#3809) · 6bb6164e
      Nikita Titov authored
      * move all submodules to external_libs folder
      
      * Update .Rbuildignore
      
      * Update MANIFEST.in
      
      * Update .appveyor.yml
      
      * Update CMakeLists.txt
      
      * Update build_r.R
      
      * Update test.sh
      
      * Update setup.py
      
      * Update CMakeLists.txt
      
      * Update test.sh
      
      * Update setup.py
      
      * Update conf.py
      
      * Update MANIFEST.in
      
      * Update LightGBM.vcxproj
      
      * continue
      
      * test
      
      * test
      
      * Update setup.py
      
      * hotfix
      
      * revert CI tests
      6bb6164e
  29. 18 Jan, 2021 1 commit
  30. 29 Dec, 2020 1 commit
  31. 28 Dec, 2020 1 commit
    • Nikita Titov's avatar
      small code and docs refactoring (#3681) · 5a460846
      Nikita Titov authored
      * small code and docs refactoring
      
      * Update CMakeLists.txt
      
      * Update .vsts-ci.yml
      
      * Update test.sh
      
      * continue
      
      * continue
      
      * revert stable sort for all-unique values
      5a460846
  32. 24 Dec, 2020 1 commit
    • Belinda Trotta's avatar
      Trees with linear models at leaves (#3299) · fcfd4132
      Belinda Trotta authored
      * Add Eigen library.
      
      * Working for simple test.
      
      * Apply changes to config params.
      
      * Handle nan data.
      
      * Update docs.
      
      * Add test.
      
      * Only load raw data if boosting=gbdt_linear
      
      * Remove unneeded code.
      
      * Minor updates.
      
      * Update to work with sk-learn interface.
      
      * Update to work with chunked datasets.
      
      * Throw error if we try to create a Booster with an already-constructed dataset having incompatible parameters.
      
      * Save raw data in binary dataset file.
      
      * Update docs and fix parameter checking.
      
      * Fix dataset loading.
      
      * Add test for regularization.
      
      * Fix bugs when saving and loading tree.
      
      * Add test for load/save linear model.
      
      * Remove unneeded code.
      
      * Fix case where not enough leaf data for linear model.
      
      * Simplify code.
      
      * Speed up code.
      
      * Speed up code.
      
      * Simplify code.
      
      * Speed up code.
      
      * Fix bugs.
      
      * Working version.
      
      * Store feature data column-wise (not fully working yet).
      
      * Fix bugs.
      
      * Speed up.
      
      * Speed up.
      
      * Remove unneeded code.
      
      * Small speedup.
      
      * Speed up.
      
      * Minor updates.
      
      * Remove unneeded code.
      
      * Fix bug.
      
      * Fix bug.
      
      * Speed up.
      
      * Speed up.
      
      * Simplify code.
      
      * Remove unneeded code.
      
      * Fix bug, add more tests.
      
      * Fix bug and add test.
      
      * Only store numerical features
      
      * Fix bug and speed up using templates.
      
      * Speed up prediction.
      
      * Fix bug with regularisation
      
      * Visual studio files.
      
      * Working version
      
      * Only check nans if necessary
      
      * Store coeff matrix as an array.
      
      * Align cache lines
      
      * Align cache lines
      
      * Preallocation coefficient calculation matrices
      
      * Small speedups
      
      * Small speedup
      
      * Reverse cache alignment changes
      
      * Change to dynamic schedule
      
      * Update docs.
      
      * Refactor so that linear tree learner is not a separate class.
      
      * Add refit capability.
      
      * Speed up
      
      * Small speedups.
      
      * Speed up add prediction to score.
      
      * Fix bug
      
      * Fix bug and speed up.
      
      * Speed up dataload.
      
      * Speed up dataload
      
      * Use vectors instead of pointers
      
      * Fix bug
      
      * Add OMP exception handling.
      
      * Change return type of LGBM_BoosterGetLinear to bool
      
      * Change return type of LGBM_BoosterGetLinear back to int, only parameter type needed to change
      
      * Remove unused internal_parent_ property of tree
      
      * Remove unused parameter to CreateTreeLearner
      
      * Remove reference to LinearTreeLearner
      
      * Minor style issues
      
      * Remove unneeded check
      
      * Reverse temporary testing change
      
      * Fix Visual Studio project files
      
      * Restore LightGBM.vcxproj.filters
      
      * Speed up
      
      * Speed up
      
      * Simplify code
      
      * Update docs
      
      * Simplify code
      
      * Initialise storage space for max num threads
      
      * Move Eigen to include directory and delete unused files
      
      * Remove old files.
      
      * Fix so it compiles with mingw
      
      * Fix gpu tree learner
      
      * Change AddPredictionToScore back to const
      
      * Fix python lint error
      
      * Fix C++ lint errors
      
      * Change eigen to a submodule
      
      * Update comment
      
      * Add the eigen folder
      
      * Try to fix build issues with eigen
      
      * Remove eigen files
      
      * Add eigen as submodule
      
      * Fix include paths
      
      * Exclude eigen files from Python linter
      
      * Ignore eigen folders for pydocstyle
      
      * Fix C++ linting errors
      
      * Fix docs
      
      * Fix docs
      
      * Exclude eigen directories from doxygen
      
      * Update manifest to include eigen
      
      * Update build_r to include eigen files
      
      * Fix compiler warnings
      
      * Store raw feature data as float
      
      * Use float for calculating linear coefficients
      
      * Remove eigen directory from GLOB
      
      * Don't compile linear model code when building R package
      
      * Fix doxygen issue
      
      * Fix lint issue
      
      * Fix lint issue
      
      * Remove uneeded code
      
      * Restore delected lines
      
      * Restore delected lines
      
      * Change return type of has_raw to bool
      
      * Update docs
      
      * Rename some variables and functions for readability
      
      * Make tree_learner parameter const in AddScore
      
      * Fix style issues
      
      * Pass vectors as const reference when setting tree properties
      
      * Make temporary storage of serial_tree_learner mutable so we can make the object's methods const
      
      * Remove get_raw_size, use num_numeric_features instead
      
      * Fix typo
      
      * Make contains_nan_ and any_nan_ properties immutable again
      
      * Remove data_has_nan_ property of tree
      
      * Remove temporary test code
      
      * Make linear_tree a dataset param
      
      * Fix lint error
      
      * Make LinearTreeLearner a separate class
      
      * Fix lint errors
      
      * Fix lint error
      
      * Add linear_tree_learner.o
      
      * Simulate omp_get_max_threads if openmp is not available
      
      * Update PushOneData to also store raw data.
      
      * Cast size to int
      
      * Fix bug in ReshapeRaw
      
      * Speed up code with multithreading
      
      * Use OMP_NUM_THREADS
      
      * Speed up with multithreading
      
      * Update to use ArrayToString
      
      * Fix tests
      
      * Fix test
      
      * Fix bug introduced in merge
      
      * Minor updates
      
      * Update docs
      fcfd4132
  33. 21 Nov, 2020 1 commit
  34. 09 Nov, 2020 2 commits