Commits · 3670e476b1918e72e72ccf00bca69d398827e24f · tianlh / LightGBM-DCU

10 Feb, 2020 1 commit

Refactoring monotone constraints (linked to #2305) (#2717) · 3670e476

CharlesAuguste authored Feb 10, 2020



* Move monotone constraints to the monotone_constraints files.

* Add checks for debug mode.

* Refactored FindBestSplitsFromHistograms.

* Add headers.

* fix

* Update data_parallel_tree_learner.cpp

* simplify ComputeBestSplitForFeature

* Fix min / max issue.

* Remove duplicated check.
Co-authored-by: Guolin Ke <guolin.ke@outlook.com>

3670e476

08 Feb, 2020 1 commit

various minor style, docs and cpplint improvements (#2747) · 1c1a2765

Nikita Titov authored Feb 09, 2020

* various minor style, docs and cpplint improvements

* fixed typo in warning

* fix recently added cpplint errors

* move note for params upper in description for consistency

1c1a2765

02 Feb, 2020 1 commit

Support both row-wise and col-wise multi-threading (#2699) · 509c2e50

Guolin Ke authored Feb 02, 2020



* commit

* fix a bug

* fix bug

* reset to track changes

* refine the auto choose logic

* sort the time stats output

* fix include

* change  multi_val_bin_sparse_threshold

* add cmake

* add _mm_malloc and _mm_free for cross platform

* fix cmake bug

* timer for split

* try to fix cmake

* fix tests

* refactor DataPartition::Split

* fix test

* typo

* formating

* Revert "formating"

This reverts commit 5b8de4f7fb9d975ee23701d276a66d40ee6d4222.

* add document

* [R-package] Added tests on use of force_col_wise and force_row_wise in training (#2719)

* naming

* fix gpu code

* Update include/LightGBM/bin.h
Co-Authored-By: James Lamb <jaylamb20@gmail.com>

* Update src/treelearner/ocl/histogram16.cl

* test: swap compilers for CI

* fix omp

* not avx2

* no aligned for feature histogram

* Revert "refactor DataPartition::Split"

This reverts commit 256e6d9641ade966a1f54da1752e998a1149b6f8.

* slightly refactor data partition

* reduce the memory cost
Co-authored-by: James Lamb <jaylamb20@gmail.com>
Co-authored-by: Nikita Titov <nekit94-08@mail.ru>

509c2e50

20 Jan, 2020 1 commit
- fix #2696 (#2700) · c7ae833e
  Vladimir authored Jan 20, 2020
```
Use an OpenMP critical section for clEnqueueWriteBuffer
```
  c7ae833e
22 Sep, 2019 1 commit

fix many cpp lint errors (#2426) · f1a14869

Guolin Ke authored Sep 22, 2019

* fix many cpp lint errors

* indent

* fix bug

* fix more

* fix gpu

* more fixes

f1a14869

30 Apr, 2019 1 commit
- Fix build of GPU debugging code (#2071) · 57ed8794
  remcob-gr authored Apr 30, 2019
  
  57ed8794
13 Apr, 2019 1 commit
- added copyright message in files (#2101) · 32ef7603
  Nikita Titov authored Apr 13, 2019
  
  32ef7603
11 Apr, 2019 1 commit

reworked includes in source files (#2066) · 50ce01b5

Nikita Titov authored Apr 12, 2019

* added all necessary includes - fixed build/include_what_you_use error

* fixed the order of includes (build/include_order)

50ce01b5

01 Apr, 2019 1 commit
- addressed cpplint error about C-style cast (#2064) · 2027f6b4
  Nikita Titov authored Apr 01, 2019
  
  2027f6b4
26 Mar, 2019 1 commit
- fixed cpplint error about spaces and newlines (#2068) · 3c999be3
  Nikita Titov authored Mar 26, 2019
  
  3c999be3
02 Feb, 2019 1 commit
- cpplint whitespaces and new lines (#1986) · 90127b52
  Nikita Titov authored Feb 02, 2019
  
  90127b52
22 Jan, 2019 1 commit
- [docs] fixed minor typos in documentation (#1959) · f3080967
  James Lamb authored Jan 22, 2019
```
* fixed minor typos in documentation

* fixed typo in gpu_tree_learner.cpp

* Update .gitignore
```
  f3080967
20 Dec, 2018 1 commit
- fix trival typo (#1915) · 92e95e62
  Lingyi Hu authored Dec 20, 2018
  
  92e95e62
29 Aug, 2018 1 commit

[R-package] CRAN fixes (#1499) · eded794e

James Lamb authored Aug 28, 2018

* Fixed typos in docs

* Fixed inconsistencies in documentation

* Updated strategy for registering routines

* Fixed issues caused by smashing multiple functions into one Rd

* Fixed issues with documentation

* Removed VignetteBuilder and updated Rbuildignore

* Added R build artefacts to gitignore

* Added namespacing on data.table set function. Updated handling of CMakeLists file to get around CRAN check.

* Updated build instructions

* Added R build script

* Removed build_r.sh script and updated R-package install instructions

eded794e

20 May, 2018 1 commit

Refine config object (#1381) · dc699574

Guolin Ke authored May 20, 2018

* [WIP] refine config

* [wip] ready for the auto code generate

* auto generate config codes

* use with to open file

* fix bug

* fix pylint

* fix bug

* fix pylint

* fix bugs.

* tmp for failed test.

* fix tests.

* added nthreads alias

* added new aliases from new config.h

* fixed duplicated alias

* refactored parameter_generator.py

* added new aliases from config.h and removed remaining old names

* fix bugs & some miss alias

* added aliases

* add more descriptions.

* add comment.

dc699574

11 May, 2018 1 commit
- [python] decode error description (#1362) · 899151fc
  Nikita Titov authored May 11, 2018
```
* decode error description

* added break line char in log massages
```
  899151fc
24 Apr, 2018 1 commit
- add force_split functionality (#1310) · 84fef715
  Jerry Liu authored Apr 24, 2018
  
  84fef715
18 Apr, 2018 1 commit
- Monotone Constraint (#1314) · e005cdb0
  Guolin Ke authored Apr 18, 2018
  
  e005cdb0
30 Jun, 2017 1 commit
- clean code for tree learner. · 82e273ba
  Guolin Ke authored Jun 30, 2017
  
  82e273ba
24 Apr, 2017 2 commits
- Better debugging info when OpenCL compilation fails (#449) · dbfa16c3
  Huan Zhang authored Apr 23, 2017
```
On some OpenCL platforms the GPU code may fail to compile.
This patch fixes the previously non-working exception handler
under this failure case and prints a nice build log.
```
  dbfa16c3
- Temporary fix for Intel OpenCL installation (#448) · 608a47b2
  Laurae authored Apr 24, 2017
  
  608a47b2
23 Apr, 2017 1 commit

Fix compilation problems with MSVC (#443) · 8487d0a2

Huan Zhang authored Apr 23, 2017

* Fix warnings when compiled with -pedantic

* add -DBOOST_ALL_NO_LIB for windows build

* fix some more MSVC warnings

* Break OpenCL string literal to smaller pieces to avoid error C2026 of MSVC

The string was longer than the limit of 16380 single-byte characters.

This affects Visual Studio 2005 - 2015. Untested on VS 2017.

8487d0a2

17 Apr, 2017 1 commit

Revert "[WIP]faster histogram sum up" (#422) · 062bfa79

Guolin Ke authored Apr 17, 2017

* Revert "python-package: support valid_names in scikit-learn API (#420)"

This reverts commit de39dbcf.

* Revert "faster histogram sum up (#418)"

This reverts commit 98c7c2a3.

062bfa79

16 Apr, 2017 1 commit

faster histogram sum up (#418) · 98c7c2a3

Guolin Ke authored Apr 16, 2017

* some refactor.

* two stage sum up to reduce sum up error.

* add more two-stage sumup.

* some refactor.

* add alignment.

* change name to aligned_allocator.

* remove some useless sumup.

* fix a warning.

* add -march=native .

* remove the padding of gradients.

* no alignment.

* fix test.

* change KNumSumupGroup to 32768.

* change gcc flags.

98c7c2a3

09 Apr, 2017 1 commit

Initial GPU acceleration support for LightGBM (#368) · 0bb4a825

Huan Zhang authored Apr 09, 2017

* add dummy gpu solver code

* initial GPU code

* fix crash bug

* first working version

* use asynchronous copy

* use a better kernel for root

* parallel read histogram

* sparse features now works, but no acceleration, compute on CPU

* compute sparse feature on CPU simultaneously

* fix big bug; add gpu selection; add kernel selection

* better debugging

* clean up

* add feature scatter

* Add sparse_threshold control

* fix a bug in feature scatter

* clean up debug

* temporarily add OpenCL kernels for k=64,256

* fix up CMakeList and definition USE_GPU

* add OpenCL kernels as string literals

* Add boost.compute as a submodule

* add boost dependency into CMakeList

* fix opencl pragma

* use pinned memory for histogram

* use pinned buffer for gradients and hessians

* better debugging message

* add double precision support on GPU

* fix boost version in CMakeList

* Add a README

* reconstruct GPU initialization code for ResetTrainingData

* move data to GPU in parallel

* fix a bug during feature copy

* update gpu kernels

* update gpu code

* initial port to LightGBM v2

* speedup GPU data loading process

* Add 4-bit bin support to GPU

* re-add sparse_threshold parameter

* remove kMaxNumWorkgroups and allows an unlimited number of features

* add feature mask support for skipping unused features

* enable kernel cache

* use GPU kernels withoug feature masks when all features are used

* REAdme.

* update README

* fix typos (#349)

* change compile to gcc on Apple as default

* clean vscode related file

* refine api of constructing from sampling data.

* fix bug in the last commit.

* more efficient algorithm to sample k from n.

* fix bug in filter bin

* change to boost from average output.

* fix tests.

* only stop training when all classes are finshed in multi-class.

* limit the max tree output. change hessian in multi-class objective.

* robust tree model loading.

* fix test.

* convert the probabilities to raw score in boost_from_average of classification.

* fix the average label for binary classification.

* Add boost_from_average to docs (#354)

* don't use "ConvertToRawScore" for self-defined objective function.

* boost_from_average seems doesn't work well in binary classification. remove it.

* For a better jump link (#355)

* Update Python-API.md

* for a better jump in page

A space is needed between `#` and the headers content according to Github's markdown format [guideline](https://guides.github.com/features/mastering-markdown/)

After adding the spaces, we can jump to the exact position in page by click the link.

* fixed something mentioned by @wxchan

* Update Python-API.md

* add FitByExistingTree.

* adapt GPU tree learner for FitByExistingTree

* avoid NaN output.

* update boost.compute

* fix typos (#361)

* fix broken links (#359)

* update README

* disable GPU acceleration by default

* fix image url

* cleanup debug macro

* remove old README

* do not save sparse_threshold_ in FeatureGroup

* add details for new GPU settings

* ignore submodule when doing pep8 check

* allocate workspace for at least one thread during builing Feature4

* move sparse_threshold to class Dataset

* remove duplicated code in GPUTreeLearner::Split

* Remove duplicated code in FindBestThresholds and BeforeFindBestSplit

* do not rebuild ordered gradients and hessians for sparse features

* support feature groups in GPUTreeLearner

* Initial parallel learners with GPU support

* add option device, cleanup code

* clean up FindBestThresholds; add some omp parallel

* constant hessian optimization for GPU

* Fix GPUTreeLearner crash when there is zero feature

* use np.testing.assert_almost_equal() to compare lists of floats in tests

* travis for GPU

0bb4a825