Commits · d78b6bc2fdf96c87e3cb61f2d497a962e3270c91 · tianlh / LightGBM-DCU

10 Aug, 2022 1 commit

feature: Add true streaming APIs to reduce client-side memory usage (#5299) · 0a5c5838

Scott Votaw authored Aug 10, 2022

* Extract streaming to own PR

* small merge fixes and cleanup

* linting fixes

* fix cast warning

* Fix accidental deletion during branch transfer

* responded to initial triage comments

* Added more tests to use create-from-samples APIs

* added mutex and adjusted nclasses logic

* Fix thread-safety for pushing data to sparse bins through Push APIs

* lint and doc fixes

* Small SWIG fix

* nit fix

* Responded to StrikerRUS comments

* fix breaking change after merge with master

* Extract streaming to own PR

* small merge fixes and cleanup

* Fix accidental deletion during branch transfer

* responded to initial triage comments

* Added more tests to use create-from-samples APIs

* Fix rstcheck call in ci

* remove TODOs

* Extract streaming to own PR

* small merge fixes and cleanup

* Fix accidental deletion during branch transfer

* responded to initial triage comments

* Added more tests to use create-from-samples APIs

* Small SWIG fix

* remove ci change

* responded to shiyu1994 comments

* responded to StrikerRUS comments

* Fixes from StrikerRUS comments

0a5c5838

26 Mar, 2022 1 commit
- Load initial scores with binary data files in CLI version (#4807) · 17d4e007
  shiyu1994 authored Mar 27, 2022
  
  17d4e007
23 Mar, 2022 1 commit

[CUDA] New CUDA version Part 1 (#4630) · 6b56a90c

shiyu1994 authored Mar 23, 2022



* new cuda framework

* add histogram construction kernel

* before removing multi-gpu

* new cuda framework

* tree learner cuda kernels

* single tree framework ready

* single tree training framework

* remove comments

* boosting with cuda

* optimize for best split find

* data split

* move boosting into cuda

* parallel synchronize best split point

* merge split data kernels

* before code refactor

* use tasks instead of features as units for split finding

* refactor cuda best split finder

* fix configuration error with small leaves in data split

* skip histogram construction of too small leaf

* skip split finding of invalid leaves

stop when no leaf to split

* support row wise with CUDA

* copy data for split by column

* copy data from host to CPU by column for data partition

* add synchronize best splits for one leaf from multiple blocks

* partition dense row data

* fix sync best split from task blocks

* add support for sparse row wise for CUDA

* remove useless code

* add l2 regression objective

* sparse multi value bin enabled for CUDA

* fix cuda ranking objective

* support for number of items <= 2048 per query

* speedup histogram construction by interleaving global memory access

* split optimization

* add cuda tree predictor

* remove comma

* refactor objective and score updater

* before use struct

* use structure for split information

* use structure for leaf splits

* return CUDASplitInfo directly after finding best split

* split with CUDATree directly

* use cuda row data in cuda histogram constructor

* clean src/treelearner/cuda

* gather shared cuda device functions

* put shared CUDA functions into header file

* change smaller leaf from <= back to < for consistent result with CPU

* add tree predictor

* remove useless cuda_tree_predictor

* predict on CUDA with pipeline

* add global sort algorithms

* add global argsort for queries with many items in ranking tasks

* remove limitation of maximum number of items per query in ranking

* add cuda metrics

* fix CUDA AUC

* remove debug code

* add regression metrics

* remove useless file

* don't use mask in shuffle reduce

* add more regression objectives

* fix cuda mape loss

add cuda xentropy loss

* use template for different versions of BitonicArgSortDevice

* add multiclass metrics

* add ndcg metric

* fix cross entropy objectives and metrics

* fix cross entropy and ndcg metrics

* add support for customized objective in CUDA

* complete multiclass ova for CUDA

* separate cuda tree learner

* use shuffle based prefix sum

* clean up cuda_algorithms.hpp

* add copy subset on CUDA

* add bagging for CUDA

* clean up code

* copy gradients from host to device

* support bagging without using subset

* add support of bagging with subset for CUDAColumnData

* add support of bagging with subset for dense CUDARowData

* refactor copy sparse subrow

* use copy subset for column subset

* add reset train data and reset config for CUDA tree learner

add deconstructors for cuda tree learner

* add USE_CUDA ifdef to cuda tree learner files

* check that dataset doesn't contain CUDA tree learner

* remove printf debug information

* use full new cuda tree learner only when using single GPU

* disable all CUDA code when using CPU version

* recover main.cpp

* add cpp files for multi value bins

* update LightGBM.vcxproj

* update LightGBM.vcxproj

fix lint errors

* fix lint errors

* fix lint errors

* update Makevars

fix lint errors

* fix the case with 0 feature and 0 bin

fix split finding for invalid leaves

create cuda column data when loaded from bin file

* fix lint errors

hide GetRowWiseData when cuda is not used

* recover default device type to cpu

* fix na_as_missing case

fix cuda feature meta information

* fix UpdateDataIndexToLeafIndexKernel

* create CUDA trees when needed in CUDADataPartition::UpdateTrainScore

* add refit by tree for cuda tree learner

* fix test_refit in test_engine.py

* create set of large bin partitions in CUDARowData

* add histogram construction for columns with a large number of bins

* add find best split for categorical features on CUDA

* add bitvectors for categorical split

* cuda data partition split for categorical features

* fix split tree with categorical feature

* fix categorical feature splits

* refactor cuda_data_partition.cu with multi-level templates

* refactor CUDABestSplitFinder by grouping task information into struct

* pre-allocate space for vector split_find_tasks_ in CUDABestSplitFinder

* fix misuse of reference

* remove useless changes

* add support for path smoothing

* virtual destructor for LightGBM::Tree

* fix overlapped cat threshold in best split infos

* reset histogram pointers in data partition and spllit finder in ResetConfig

* comment useless parameter

* fix reverse case when na is missing and default bin is zero

* fix mfb_is_na and mfb_is_zero and is_single_feature_column

* remove debug log

* fix cat_l2 when one-hot

fix gradient copy when data subset is used

* switch shared histogram size according to CUDA version

* gpu_use_dp=true when cuda test

* revert modification in config.h

* fix setting of gpu_use_dp=true in .ci/test.sh

* fix linter errors

* fix linter error

remove useless change

* recover main.cpp

* separate cuda_exp and cuda

* fix ci bash scripts

add description for cuda_exp

* add USE_CUDA_EXP flag

* switch off USE_CUDA_EXP

* revert changes in python-packages

* more careful separation for USE_CUDA_EXP

* fix CUDARowData::DivideCUDAFeatureGroups

fix set fields for cuda metadata

* revert config.h

* fix test settings for cuda experimental version

* skip some tests due to unsupported features or differences in implementation details for CUDA Experimental version

* fix lint issue by adding a blank line

* fix lint errors by resorting imports

* fix lint errors by resorting imports

* fix lint errors by resorting imports

* merge cuda.yml and cuda_exp.yml

* update python version in cuda.yml

* remove cuda_exp.yml

* remove unrelated changes

* fix compilation warnings

fix cuda exp ci task name

* recover task

* use multi-level template in histogram construction

check split only in debug mode

* ignore NVCC related lines in parameter_generator.py

* update job name for CUDA tests

* apply review suggestions

* Update .github/workflows/cuda.yml
Co-authored-by: Nikita Titov <nekit94-08@mail.ru>

* Update .github/workflows/cuda.yml
Co-authored-by: Nikita Titov <nekit94-08@mail.ru>

* update header

* remove useless TODOs

* remove [TODO(shiyu1994): constrain the split with min_data_in_group] and record in #5062

* #include <LightGBM/utils/log.h> for USE_CUDA_EXP only

* fix include order

* fix include order

* remove extra space

* address review comments

* add warning when cuda_exp is used together with deterministic

* add comment about gpu_use_dp in .ci/test.sh

* revert changing order of included headers
Co-authored-by: Yu Shi <shiyu1994@qq.com>
Co-authored-by: Nikita Titov <nekit94-08@mail.ru>

6b56a90c

17 Mar, 2021 1 commit

Range check for DCG position discount lookup (#4069) · 4580393f

ashok-ponnuswami-msft authored Mar 17, 2021

* Add check to prevent out of index lookup in the position discount table. Add debug logging to report number of queries found in the data.

* Change debug logging location so that we can print the data file name as well.

* Revert "Change debug logging location so that we can print the data file name as well."

This reverts commit 3981b34bd6e0530f89c4733e78e6b6603bf50d48.

* Add data file name to debug logging.

* Move log line to a place where it is output even when query IDs are read from a separate file.

* Also add the out-of-range check to rank metrics.

* Perform check after number of queries is initialized.

* Update

4580393f

19 Feb, 2021 1 commit
- [docs] Change some 'parallel learning' references to 'distributed learning' (#4000) · 7880b79f
  James Lamb authored Feb 19, 2021
```
* [docs] Change some 'parallel learning' references to 'distributed learning'

* found a few more

* one more reference
```
  7880b79f
30 Sep, 2020 1 commit

fix address alignment, required by cran (#3415) · f30dbe87

Guolin Ke authored Sep 30, 2020

* fix dataset binary file alignment

* many fixes

* fix warnings

* fix bug

* Update file_io.cpp

* Update file_io.cpp

* simplify code

* Apply suggestions from code review

* general

* remove unneeded alignment

* Update file_io.h

* int32 to byte8 alignment

* Apply suggestions from code review

* Apply suggestions from code review

f30dbe87

05 Jun, 2020 1 commit
- Revert "re-order includes (fixes #3132) (#3133)" (#3153) · ac5f5e56
  Nikita Titov authored Jun 05, 2020
```
This reverts commit 656d2676.
```
  ac5f5e56
01 Jun, 2020 1 commit
- re-order includes (fixes #3132) (#3133) · 656d2676
  James Lamb authored Jun 01, 2020
  
  656d2676
22 Feb, 2020 1 commit

some code refactoring (#2769) · 3e80df7e

Guolin Ke authored Feb 22, 2020

* some refines

* more omp refactoring

* format define

* fix merge bug

* some fixes

* fix some warnings

* Apply suggestions from code review

* Apply suggestions from code review

* remove dup codes

3e80df7e

20 Feb, 2020 1 commit

remove init-score parameter (#2776) · 3c394c8d

Guolin Ke authored Feb 20, 2020



* remove related cpp codes

* removed more mentiones of init_score_filename params
Co-authored-by: Nikita Titov <nekit94-08@mail.ru>

3c394c8d

08 Feb, 2020 1 commit

various minor style, docs and cpplint improvements (#2747) · 1c1a2765

Nikita Titov authored Feb 09, 2020

* various minor style, docs and cpplint improvements

* fixed typo in warning

* fix recently added cpplint errors

* move note for params upper in description for consistency

1c1a2765

14 Jan, 2020 1 commit

[python] [R-package] Use the same address when updated label/weight/query (#2662) · 82886ba6

Guolin Ke authored Jan 14, 2020

* Update metadata.cpp

* add version for training set, for efficiently update label/weight/... during training.

* Update lgb.Booster.R

82886ba6

29 Dec, 2019 1 commit

warning for init_score in save_binary (#2649) · 7b411bdd

Guolin Ke authored Dec 29, 2019



* warning for init_score in save_binary

fix #2639

* Update metadata.cpp

* added info into docs
Co-authored-by: Nikita Titov <nekit94-08@mail.ru>

7b411bdd

07 Sep, 2019 1 commit
- avoid nan and inf in weight/label/init_score (#2377) · 33d0378f
  Guolin Ke authored Sep 08, 2019
```
* avoid nan and inf in weight/label/init_score

* use prefix increment
```
  33d0378f
13 Apr, 2019 1 commit
- added copyright message in files (#2101) · 32ef7603
  Nikita Titov authored Apr 13, 2019
  
  32ef7603
11 Apr, 2019 1 commit

reworked includes in source files (#2066) · 50ce01b5

Nikita Titov authored Apr 12, 2019

* added all necessary includes - fixed build/include_what_you_use error

* fixed the order of includes (build/include_order)

50ce01b5

02 Feb, 2019 1 commit
- cpplint whitespaces and new lines (#1986) · 90127b52
  Nikita Titov authored Feb 02, 2019
  
  90127b52
11 May, 2018 1 commit
- [python] decode error description (#1362) · 899151fc
  Nikita Titov authored May 11, 2018
```
* decode error description

* added break line char in log massages
```
  899151fc
27 Feb, 2018 1 commit

Experimental support for HDFS (#1243) · 7e186a57

ebernhardson authored Feb 26, 2018

* Read and write datsets from hdfs.
* Only enabled when cmake is run with -DUSE_HDFS:BOOL=TRUE
* Introduces VirtualFile(Reader|Writer) to asbtract VFS differences

7e186a57

17 Dec, 2017 1 commit
- support label as double type (#1120) · aa78a6b9
  Guolin Ke authored Dec 17, 2017
  
  aa78a6b9
18 Aug, 2017 1 commit
- support specific path of initial scores. · 3b161919
  Guolin Ke authored Aug 11, 2017
  
  3b161919
25 Jan, 2017 1 commit
- fix partition error when set weight_colunm · 1765b2e3
  Guolin Ke authored Jan 25, 2017
  
  1765b2e3
10 Jan, 2017 2 commits
- change init_score to double type · 3ef3a489
  Guolin Ke authored Jan 10, 2017
  
  3ef3a489
- change inner prediction score to double type. · 12a96334
  Guolin Ke authored Jan 10, 2017
  
  12a96334
08 Jan, 2017 1 commit

R package (#168) · 551d59ca

Guolin Ke authored Jan 08, 2017

* finish R's c_api

* clean code

* fix sizeof pointer in 32bit system.

* add predictor class

* add Dataset class

* format code

* add booster

* add type check for expose function

* add a simple callback

* add all callbacks

* finish the basic training logic

* update docs

* add an simple training interface

* add basic test

* adapt the changes in c_api

* add test for Dataset

* add test for custom obj/eval functions

* fix python test

* fix bug in metadata init

* fix R CMD check

551d59ca

28 Dec, 2016 1 commit
- decouple num_class in Dataset class · 292f972e
  Guolin Ke authored Dec 28, 2016
  
  292f972e
18 Dec, 2016 1 commit
- fix bug in set_query · cbc56c74
  Guolin Ke authored Dec 18, 2016
  
  cbc56c74
05 Dec, 2016 1 commit
- use ".empty()" to check container · 866a2f91
  Guolin Ke authored Dec 05, 2016
  
  866a2f91
02 Dec, 2016 1 commit

Squash into one commit: · eba6d200

wxchan authored Dec 02, 2016

1. merge python-package
2. add dump model to json
3. fix bugs
4. clean code with pylint
5. update python examples

eba6d200

26 Nov, 2016 2 commits
- some bugs fixed · 83a14174
  Guolin Ke authored Nov 26, 2016
  
  83a14174
- thread-safe for set field of dataset · 6e0b58ba
  Guolin Ke authored Nov 26, 2016
  
  6e0b58ba
24 Nov, 2016 1 commit
- support set/get dataset field with nullptr · 5b4ee9db
  Guolin Ke authored Nov 24, 2016
  
  5b4ee9db
22 Nov, 2016 1 commit
- change some c_api interfaces for better compatibility · a178b75b
  Guolin Ke authored Nov 22, 2016
  
  a178b75b
18 Nov, 2016 1 commit

Refactor for RAII (#86) · 5442ed78

Guolin Ke authored Nov 18, 2016

* RAII for utils, application and c_api(partical)

* raii for class in include folder

* raii for application and boosting

* raii for dataset and dataset loader

* raii for dense bin and parser

* RAII refactor for almost all classes

* RAII for c_api

* clean code

* refine repeated code

* Decouple the "sigmoid" between objective and boosting.

* change std::vector<bool> back to std::vector<char> due to concurrence problem

* slight reduce some memory cost

5442ed78

05 Nov, 2016 1 commit
- add load data from mat · d41c78f9
  Guolin Ke authored Nov 05, 2016
  
  d41c78f9
04 Nov, 2016 1 commit
- use dataset_loader to load data · 1c08e71e
  Guolin Ke authored Nov 04, 2016
  
  1c08e71e
03 Nov, 2016 2 commits
- Improved consistency and wording of user-facing logs and documentation · 8497af62
  Allardvm authored Nov 03, 2016
```
Packages that parse LightGBM’s logs will require minor changes to
parsing logic to work correctly.
```
  8497af62
- support init_score for multiclass classification (#62) · 01ed04df
  wxchan authored Nov 03, 2016
```
support init_score for multiclass classification (#62)
```
  01ed04df
02 Nov, 2016 2 commits
- fixed some IO bugs · 24f1f9cd
  Guolin Ke authored Nov 02, 2016
  
  24f1f9cd
- change float to double back, except for score_t · c96ae6af
  Guolin Ke authored Nov 02, 2016
  
  c96ae6af