Commits · b88cf8afaba4c69aff90e968c38fdb6cc7fbfa01 · tianlh / LightGBM-DCU

28 Dec, 2022 1 commit

Decouple Boosting Types (fixes #3128) (#4827) · fffd066c

Yifei Liu authored Dec 28, 2022



* add parameter data_sample_strategy

* abstract GOSS as a sample strategy(GOSS1), togetherwith origial GOSS (Normal Bagging has not been abstracted, so do NOT use it now)

* abstract Bagging as a subclass (BAGGING), but original Bagging members in GBDT are still kept

* fix some variables

* remove GOSS(as boost) and Bagging logic in GBDT

* rename GOSS1 to GOSS(as sample strategy)

* add warning about use GOSS as boosting_type

* a little ; bug

* remove CHECK when "gradients != nullptr"

* rename DataSampleStrategy to avoid confusion

* remove and add some ccomments, followingconvention

* fix bug about GBDT::ResetConfig (ObjectiveFunction inconsistencty bet…

* add std::ignore to avoid compiler warnings (anpotential fails)

* update Makevars and vcxproj

* handle constant hessian

move resize of gradient vectors out of sample strategy

* mark override for IsHessianChange

* fix lint errors

* rerun parameter_generator.py

* update config_auto.cpp

* delete redundant blank line

* update num_data_ when train_data_ is updated

set gradients and hessians when GOSS

* check bagging_freq is not zero

* reset config_ value

merge ResetBaggingConfig and ResetGOSS

* remove useless check

* add ttests in test_engine.py

* remove whitespace in blank line

* remove arguments verbose_eval and evals_result

* Update tests/python_package_test/test_engine.py

reduce num_boost_round
Co-authored-by: James Lamb <jaylamb20@gmail.com>

* Update tests/python_package_test/test_engine.py

reduce num_boost_round
Co-authored-by: James Lamb <jaylamb20@gmail.com>

* Update tests/python_package_test/test_engine.py

reduce num_boost_round
Co-authored-by: James Lamb <jaylamb20@gmail.com>

* Update tests/python_package_test/test_engine.py

reduce num_boost_round
Co-authored-by: James Lamb <jaylamb20@gmail.com>

* Update tests/python_package_test/test_engine.py

reduce num_boost_round
Co-authored-by: James Lamb <jaylamb20@gmail.com>

* Update tests/python_package_test/test_engine.py

reduce num_boost_round
Co-authored-by: James Lamb <jaylamb20@gmail.com>

* Update src/boosting/sample_strategy.cpp

modify warning about setting goss as `boosting_type`
Co-authored-by: James Lamb <jaylamb20@gmail.com>

* Update tests/python_package_test/test_engine.py

replace load_boston() with make_regression()

remove value checks of mean_squared_error in test_sample_strategy_with_boosting()

* Update tests/python_package_test/test_engine.py

add value checks of mean_squared_error in test_sample_strategy_with_boosting()

* Modify warnning about using goss as boosting type

* Update tests/python_package_test/test_engine.py

add random_state=42 for make_regression()

reduce the threshold of mean_square_error

* Update src/boosting/sample_strategy.cpp
Co-authored-by: James Lamb <jaylamb20@gmail.com>

* remove goss from boosting types in documentation

* Update src/boosting/bagging.hpp
Co-authored-by: Nikita Titov <nekit94-08@mail.ru>

* Update src/boosting/bagging.hpp
Co-authored-by: Nikita Titov <nekit94-08@mail.ru>

* Update src/boosting/goss.hpp
Co-authored-by: Nikita Titov <nekit94-08@mail.ru>

* Update src/boosting/goss.hpp
Co-authored-by: Nikita Titov <nekit94-08@mail.ru>

* rename GOSS with GOSSStrategy

* update doc

* address comments

* fix table in doc

* Update include/LightGBM/config.h
Co-authored-by: Nikita Titov <nekit94-08@mail.ru>

* update documentation

* update test case

* revert useless change in test_engine.py

* add tests for evaluation results in test_sample_strategy_with_boosting

* include <string>

* change to assert_allclose in test_goss_boosting_and_strategy_equivalent

* more tolerance in result checking, due to minor difference in results of gpu versions

* change == to np.testing.assert_allclose

* fix test case

* set gpu_use_dp to true

* change --report to --report-level for rstcheck

* use gpu_use_dp=true in test_goss_boosting_and_strategy_equivalent

* revert unexpected changes of non-ascii characters

* revert unexpected changes of non-ascii characters

* remove useless changes

* allocate gradients_pointer_ and hessians_pointer when necessary

* add spaces

* remove redundant virtual

* include <LightGBM/utils/log.h> for USE_CUDA

* check for  in test_goss_boosting_and_strategy_equivalent

* check for identity in test_sample_strategy_with_boosting

* remove cuda  option in test_sample_strategy_with_boosting

* Update tests/python_package_test/test_engine.py
Co-authored-by: Nikita Titov <nekit94-08@mail.ru>

* Update tests/python_package_test/test_engine.py
Co-authored-by: James Lamb <jaylamb20@gmail.com>

* ResetGradientBuffers after ResetSampleConfig

* ResetGradientBuffers after ResetSampleConfig

* ResetGradientBuffers after bagging

* remove useless code

* check objective_function_ instead of gradients

* enable rf with goss

simplify params in test cases

* remove useless changes

* allow rf with feature subsampling alone

* change position of ResetGradientBuffers

* check for dask

* add parameter types for data_sample_strategy
Co-authored-by: Guangda Liu <v-guangdaliu@microsoft.com>
Co-authored-by: Yu Shi <shiyu_k1994@qq.com>
Co-authored-by: GuangdaLiu <90019144+GuangdaLiu@users.noreply.github.com>
Co-authored-by: James Lamb <jaylamb20@gmail.com>
Co-authored-by: Nikita Titov <nekit94-08@mail.ru>

fffd066c

27 Dec, 2022 1 commit

[CUDA] Add L2 metric for new CUDA version (#5633) · 6482b47e

shiyu1994 authored Dec 27, 2022

* add rmse metric for new cuda version

* add Init for CUDAMetricInterface

* fix lint errors

* fix rmse and add l2 metric for new cuda version

* use CUDAL2Metric

* explicit template instantiation

* write result only with the first thread

* pre allocate buffer for output converting

* fix l2 regression with cuda metric evaluation

* weighting loss in cuda metric evaluation

* mark CUDATree::AsConstantTree as override

6482b47e

02 Dec, 2022 1 commit
- [CUDA] Add rmse metric for new CUDA version (#5611) · f0cfbff6
  shiyu1994 authored Dec 02, 2022
```
* add rmse metric for new cuda version

* add Init for CUDAMetricInterface

* fix lint errors
```
  f0cfbff6
29 Nov, 2022 1 commit
- Fix OpenMP thread allocation in Linux (#5551) · 4c5d0fbb
  Scott Votaw authored Nov 29, 2022
  
  4c5d0fbb
27 Nov, 2022 1 commit

[CUDA] Add Poisson regression objective for cuda_exp and refactor objective... · 24af9fa5

shiyu1994 authored Nov 27, 2022


[CUDA] Add Poisson regression objective for cuda_exp and refactor objective functions for cuda_exp (#5486)

* add poisson regression objective for cuda_exp

* enable Poisson regression for cuda_exp

* refactor cuda objective functions

* remove useless changes

* fix linter errors

* remove redundant buffer in cuda poisson regression objective

* fix log of cuda_exp binary objective

* fix threshold of poisson objective result

* remove useless changes

* fix compilation errors

* add cuda quantile regression objective

* remove cuda quantile regression objective
Co-authored-by: James Lamb <jaylamb20@gmail.com>

24af9fa5

06 Nov, 2022 1 commit
- [CUDA] Add multiclass_ova objective for cuda_exp (#5491) · f1d3181c
  shiyu1994 authored Nov 06, 2022
  
  f1d3181c
11 Oct, 2022 4 commits
- renamed cur_cat => cur_cat_idx and added some comments (#5522) · c35ecfbf
  Zhuyi Xue authored Oct 11, 2022
  
  c35ecfbf
- [python-package][R-package] load parameters from model file (fixes #2613) (#5424) · 8b720844
  José Morales authored Oct 11, 2022
  
  8b720844
- suppress alias warnings with verbosity<0 (fixes #4518) (#5253) · 46427128
  José Morales authored Oct 10, 2022
  
  46427128
- renamed tmp_num_sample_values to non_na_cnt (#5521) · c5391c97
  Zhuyi Xue authored Oct 10, 2022
  
  c5391c97
11 Sep, 2022 1 commit
- Remove redundant whitespaces (#5480) · 952458a9
  Ilya Chernov authored Sep 11, 2022
```
remove redundant whitespaces
```
  952458a9
09 Sep, 2022 1 commit

[CUDA] Add multiclass objective for cuda_exp (#5473) · 3d4e08e1

shiyu1994 authored Sep 09, 2022

* add multiclass objective for cuda_exp

* remove debug code

* add includes requested by lint checks

* fix compilation failure for cuda with cuda-9.0

* clean code

3d4e08e1

07 Sep, 2022 3 commits
- [CUDA] Add feature interaction constraint for cuda_exp (fix #4785) (#5474) · 1444a748
  shiyu1994 authored Sep 07, 2022
```
* add feature interaction constraint for cuda_exp

* test feature interaction constraints for cuda_exp

* remove useless check

* update comment
```
  1444a748
- [CUDA] Add rank_xendcg objective for cuda_exp (#5472) · a46c68fe
  shiyu1994 authored Sep 07, 2022
  
  a46c68fe
- [CUDA] Add fair regression objective for cuda_exp (#5469) · 7c503ba3
  shiyu1994 authored Sep 07, 2022
```
* change percentile in CUDARegressionL1loss::BoostFromScore to 0.5
```
  7c503ba3
06 Sep, 2022 1 commit
- fix references to 'object function' (#5468) · f0e32962
  James Lamb authored Sep 05, 2022
  
  f0e32962
05 Sep, 2022 2 commits

[CUDA] Add lambdarank objective for cuda_exp (#5453) · 1d5f46f6

shiyu1994 authored Sep 05, 2022



* add lambdarank for cuda_exp

* support unlimited number of ranks in labels

* fix lint errors

* remove warning for lambdarank with cuda_exp

* Update src/objective/cuda/cuda_rank_objective.hpp
Co-authored-by: Nikita Titov <nekit94-08@mail.ru>

* Update src/objective/cuda/cuda_rank_objective.hpp
Co-authored-by: Nikita Titov <nekit94-08@mail.ru>
Co-authored-by: Nikita Titov <nekit94-08@mail.ru>

1d5f46f6

Fix CUDA `#ifndef` guards (#5466) · c9a3b479

Nikita Titov authored Sep 05, 2022

* Update cuda_column_data.hpp

* Update cuda_metadata.hpp

* Update cuda_objective_function.hpp

* Update cuda_row_data.hpp

* Update cuda_regression_objective.hpp

c9a3b479

02 Sep, 2022 2 commits

[CUDA] Add Huber regression objective for cuda_exp (#5462) · 45c53f78

shiyu1994 authored Sep 02, 2022

* add huber regression for cuda_exp

* renew tree output on GPU

add test cases for regression objectives

* remove useless changes

* add white space

* fix test_regression

45c53f78

Rename Metadata num_classes to be more clear (#5461) · 7d1276ad
Scott Votaw authored Sep 01, 2022
```
Rename num_classes to be more clear
```
7d1276ad

01 Sep, 2022 1 commit

[CUDA] Add L1 regression objective for cuda_exp (#5457) · d78b6bc2

shiyu1994 authored Sep 01, 2022

* add (l1) regression objective for cuda_exp

* remove RenewTreeOutputCUDA from CUDARegressionL2loss

* remove mutable and use CUDAVector

* remove white spaces

* remove TODO and document in (#5459)

d78b6bc2

31 Aug, 2022 2 commits

[CUDA] L2 regression objective for cuda_exp (#5452) · 9e89ee7f
shiyu1994 authored Aug 31, 2022
```
* add (l2) regression objective for cuda_exp

* fix lint errors

* correct time tag
```
9e89ee7f

[CUDA] Add binary objective for cuda_exp (#5425) · 2b8fe8b4

shiyu1994 authored Aug 31, 2022

* add binary objective for cuda_exp

* include <string> and <vector>

* exchange include ordering

* fix length of score to copy in evaluation

* fix EvalOneMetric

* fix cuda binary objective and prediction when boosting on gpu

* Add white space

* fix BoostFromScore for CUDABinaryLogloss

update log in test_register_logger

* include <algorithm>

* simplify shared memory buffer

2b8fe8b4

29 Aug, 2022 1 commit

[ci][fix] Fix cuda_exp ci (#5438) · be7f3213

shiyu1994 authored Aug 29, 2022



* fix cuda_exp ci

* fix ci failures introduced by #5279

* cleanup cuda.yml

* fix test.sh

* clean up test.sh

* clean up test.sh

* skip lines by cuda_exp in test_register_logger

* Update tests/python_package_test/test_utilities.py
Co-authored-by: Nikita Titov <nekit94-08@mail.ru>
Co-authored-by: Nikita Titov <nekit94-08@mail.ru>

be7f3213

28 Aug, 2022 1 commit
- include parameters from reference dataset on subset (fixes #5402) (#5416) · 5079de4a
  José Morales authored Aug 28, 2022
```
* include parameters from reference dataset on copy

* lint

* set non-default parameters
```
  5079de4a
25 Aug, 2022 1 commit
- update tree to if-else (#5422) · 39eb041f
  José Morales authored Aug 25, 2022
```
* update tree to if-else

* add missing )

* fix case

* trigger ci
```
  39eb041f
20 Aug, 2022 1 commit
- [fix] change the destructor of ScoreUpdater to virtual (fixes #5400) (#5403) · 865c126a
  shiyu1994 authored Aug 20, 2022
```
change the destructor of ScoreUpdater to virtual
```
  865c126a
16 Aug, 2022 1 commit

Add default definition for GetColWiseData and GetColWiseData (#5413) · 9489f878

shiyu1994 authored Aug 16, 2022

* add default definition for GetColWiseData and GetColWiseData

* fix warnings of template instantiation

* remove files in Makevars and LightGBM.vcxproj

9489f878

10 Aug, 2022 1 commit

feature: Add true streaming APIs to reduce client-side memory usage (#5299) · 0a5c5838

Scott Votaw authored Aug 10, 2022

* Extract streaming to own PR

* small merge fixes and cleanup

* linting fixes

* fix cast warning

* Fix accidental deletion during branch transfer

* responded to initial triage comments

* Added more tests to use create-from-samples APIs

* added mutex and adjusted nclasses logic

* Fix thread-safety for pushing data to sparse bins through Push APIs

* lint and doc fixes

* Small SWIG fix

* nit fix

* Responded to StrikerRUS comments

* fix breaking change after merge with master

* Extract streaming to own PR

* small merge fixes and cleanup

* Fix accidental deletion during branch transfer

* responded to initial triage comments

* Added more tests to use create-from-samples APIs

* Fix rstcheck call in ci

* remove TODOs

* Extract streaming to own PR

* small merge fixes and cleanup

* Fix accidental deletion during branch transfer

* responded to initial triage comments

* Added more tests to use create-from-samples APIs

* Small SWIG fix

* remove ci change

* responded to shiyu1994 comments

* responded to StrikerRUS comments

* Fixes from StrikerRUS comments

0a5c5838

03 Aug, 2022 2 commits

Minor CUDA cleanup (#5394) · c7102e56

Nikita Titov authored Aug 03, 2022



* Update README.rst

* Update cuda_score_updater.cu
Co-authored-by: James Lamb <jaylamb20@gmail.com>

c7102e56

Fix potential overflow in linear trees (#5395) · e2dfcd69

Nikita Titov authored Aug 03, 2022



* Fix potential overflow in linear trees

* simplify
Co-authored-by: James Lamb <jaylamb20@gmail.com>

e2dfcd69

30 Jul, 2022 1 commit

reproducible parameter alias resolution for wrappers (fixes #5304) (#5338) · 83627ff0

José Morales authored Jul 30, 2022

* dump sorted parameter aliases

* update lgb.check.wrapper_param

* update _choose_param_value to look like lgb.check.wrapper_param

* apply suggestions from review

* reduce diff

* move DumpAliases to config

* remove unnecessary check

* restore parameter check

83627ff0

29 Jul, 2022 2 commits
- Use double precision in threaded calculation of linear tree coefficients (fixes #5226) (#5368) · 44d37184
  Belinda Trotta authored Jul 30, 2022
  
  44d37184
- [CUDA] Initial work for boosting and evaluation with CUDA (#5279) · e0af160a
  shiyu1994 authored Jul 29, 2022
```
* initial work for boosting and evaluation with CUDA

* fix compatibility with CPU code

* fix creating objective without USE_CUDA_EXP

* fix static analysis errors

* fix static analysis errors
```
  e0af160a
21 Jul, 2022 1 commit

fix: Adjust LGBM_DatasetCreateFromSampledColumn to handle distributed data (#5344) · f94050a4

Scott Votaw authored Jul 21, 2022

* Adjust LGBM_DatasetCreateFromSampledColumn to handle distributed data better

* linting fix

* switch to 1 API with breaking change

* Fix pything native call

* more python test fixes

f94050a4

27 Jun, 2022 1 commit

[python-package] check feature names in predict with dataframe (fixes #812) (#4909) · bdb02e05

José Morales authored Jun 27, 2022



* check feature names and order in predict with dataframe

* slice df in predict to remove the target

* scramble features

* handle int column names

* only change column order when needed

* include validate_features param in booster and sklearn estimators

* document validate_features argument

* use all_close in preds checks and check for assertion error to compare different arrays

* perform remapping and checks in cpp

* remove extra logs

* fixes

* revert cpp

* proposal

* remove extra arg

* lint

* restore _data_from_pandas arguments

* Apply suggestions from code review
Co-authored-by: Nikita Titov <nekit94-08@mail.ru>

* move data conversion to Predictor.predict

* use Vector2Ptr
Co-authored-by: Nikita Titov <nekit94-08@mail.ru>

bdb02e05

08 Jun, 2022 1 commit

Clear split info buffer in cost efficient gradient boosting before every... · f1328d5c

shiyu1994 authored Jun 08, 2022

Clear split info buffer in cost efficient gradient boosting before every iteration (fix partially #3679) (#5164)

* clear split info buffer in cegb_ before every iteration

* check nullable of cegb_ in serial_tree_learner.cpp

* add a test case for checking the split buffer in CEGB

* swith to Threading::For instead of raw OpenMP

* apply review suggestions

* apply review comments

* remove device cpu

f1328d5c

02 Jun, 2022 1 commit
- [c++][fix] check nullable of bin mappers in dataset_loader.cpp (fix #5221) (#5258) · fa9e4527
  shiyu1994 authored Jun 02, 2022
```
check nullable of bin mappers
```
  fa9e4527
29 May, 2022 1 commit
- Remove leftovers after the drop of Solaris support (#5248) · fb37e507
  Nikita Titov authored May 29, 2022
```
* Update tree.cpp

* Update common.h

* Update common.h
```
  fb37e507
23 May, 2022 1 commit

Check existence of inet_pton for win32 in CMakeLists.txt (fixes #5019) (#5159) · 17bfe1a1

shiyu1994 authored May 24, 2022



* check existence of inet_pton for win32 in CMakeLists.txt (fix #5019)

* remove extra spaces

* add check for inet_pton in configure.win

* Update CMakeLists.txt
Co-authored-by: Nikita Titov <nekit94-08@mail.ru>

* Update src/network/socket_wrapper.hpp
Co-authored-by: Nikita Titov <nekit94-08@mail.ru>

* Update src/network/socket_wrapper.hpp
Co-authored-by: Nikita Titov <nekit94-08@mail.ru>

* Update CMakeLists.txt
Co-authored-by: Nikita Titov <nekit94-08@mail.ru>

* Update R-package/configure.win
Co-authored-by: Nikita Titov <nekit94-08@mail.ru>

* Update CMakeLists.txt
Co-authored-by: James Lamb <jaylamb20@gmail.com>

* fix comments in CMakeLists.txt

* include check for WIN64

* remove WIN64 flag checks

* fix #ifdef

* Update R-package/configure.win
Co-authored-by: Nikita Titov <nekit94-08@mail.ru>
Co-authored-by: Nikita Titov <nekit94-08@mail.ru>
Co-authored-by: James Lamb <jaylamb20@gmail.com>

17bfe1a1