- 11 Apr, 2019 1 commit
-
-
Nikita Titov authored
* added all necessary includes - fixed build/include_what_you_use error * fixed the order of includes (build/include_order)
-
- 04 Apr, 2019 1 commit
-
-
remcob-gr authored
* Add configuration parameters for CEGB. * Add skeleton CEGB tree learner Like the original CEGB version, this inherits from SerialTreeLearner. Currently, it changes nothing from the original. * Track features used in CEGB tree learner. * Pull CEGB tradeoff and coupled feature penalty from config. * Implement finding best splits for CEGB This is heavily based on the serial version, but just adds using the coupled penalties. * Set proper defaults for cegb parameters. * Ensure sanity checks don't switch off CEGB. * Implement per-data-point feature penalties in CEGB. * Implement split penalty and remove unused parameters. * Merge changes from CEGB tree learner into serial tree learner * Represent features_used_in_data by a bitset, to reduce the memory overhead of CEGB, and add sanity checks for the lengths of the penalty vectors. * Fix bug where CEGB would incorrectly penalise a previously used feature The tree learner did not update the gains of previously computed leaf splits when splitting a leaf elsewhere in the tree. This caused it to prefer new features due to incorrectly penalising splitting on previously used features. * Document CEGB parameters and add them to the appropriate section. * Remove leftover reference to cegb tree learner. * Remove outdated diff. * Fix warnings * Fix minor issues identified by @StrikerRUS. * Add docs section on CEGB, including citation. * Fix link. * Fix CI failure. * Add some unit tests * Fix pylint issues. * Fix remaining pylint issue
-
- 06 Feb, 2019 1 commit
-
-
Nikita Titov authored
-
- 02 Feb, 2019 1 commit
-
-
Nikita Titov authored
-
- 17 Dec, 2018 1 commit
-
-
Guolin Ke authored
* fix RF's bugs * fix tests * rollback num_iterations * fix a bug and reduce memory costs * reduce memory cost
-
- 16 Aug, 2018 1 commit
-
-
Guolin Ke authored
* fix include * reduce dependency on header file * fix build
-
- 20 May, 2018 1 commit
-
-
Guolin Ke authored
* [WIP] refine config * [wip] ready for the auto code generate * auto generate config codes * use with to open file * fix bug * fix pylint * fix bug * fix pylint * fix bugs. * tmp for failed test. * fix tests. * added nthreads alias * added new aliases from new config.h * fixed duplicated alias * refactored parameter_generator.py * added new aliases from config.h and removed remaining old names * fix bugs & some miss alias * added aliases * add more descriptions. * add comment.
-
- 24 Apr, 2018 1 commit
-
-
Jerry Liu authored
-
- 16 Jan, 2018 1 commit
-
-
Guolin Ke authored
-
- 28 Dec, 2017 1 commit
-
-
Luke Gallagher authored
-
- 19 Dec, 2017 1 commit
-
-
Guolin Ke authored
* add code for refit tree * add implementation. * update documents. * clean code * fix a type
-
- 04 Dec, 2017 1 commit
-
-
Guolin Ke authored
* fix feature fraction * fix bugs.
-
- 30 Jun, 2017 1 commit
-
-
Guolin Ke authored
-
- 17 Apr, 2017 1 commit
-
- 16 Apr, 2017 1 commit
-
-
Guolin Ke authored
* some refactor. * two stage sum up to reduce sum up error. * add more two-stage sumup. * some refactor. * add alignment. * change name to aligned_allocator. * remove some useless sumup. * fix a warning. * add -march=native . * remove the padding of gradients. * no alignment. * fix test. * change KNumSumupGroup to 32768. * change gcc flags.
-
- 09 Apr, 2017 1 commit
-
-
Huan Zhang authored
* add dummy gpu solver code * initial GPU code * fix crash bug * first working version * use asynchronous copy * use a better kernel for root * parallel read histogram * sparse features now works, but no acceleration, compute on CPU * compute sparse feature on CPU simultaneously * fix big bug; add gpu selection; add kernel selection * better debugging * clean up * add feature scatter * Add sparse_threshold control * fix a bug in feature scatter * clean up debug * temporarily add OpenCL kernels for k=64,256 * fix up CMakeList and definition USE_GPU * add OpenCL kernels as string literals * Add boost.compute as a submodule * add boost dependency into CMakeList * fix opencl pragma * use pinned memory for histogram * use pinned buffer for gradients and hessians * better debugging message * add double precision support on GPU * fix boost version in CMakeList * Add a README * reconstruct GPU initialization code for ResetTrainingData * move data to GPU in parallel * fix a bug during feature copy * update gpu kernels * update gpu code * initial port to LightGBM v2 * speedup GPU data loading process * Add 4-bit bin support to GPU * re-add sparse_threshold parameter * remove kMaxNumWorkgroups and allows an unlimited number of features * add feature mask support for skipping unused features * enable kernel cache * use GPU kernels withoug feature masks when all features are used * REAdme. * REAdme. * update README * fix typos (#349) * change compile to gcc on Apple as default * clean vscode related file * refine api of constructing from sampling data. * fix bug in the last commit. * more efficient algorithm to sample k from n. * fix bug in filter bin * change to boost from average output. * fix tests. * only stop training when all classes are finshed in multi-class. * limit the max tree output. change hessian in multi-class objective. * robust tree model loading. * fix test. * convert the probabilities to raw score in boost_from_average of classification. * fix the average label for binary classification. * Add boost_from_average to docs (#354) * don't use "ConvertToRawScore" for self-defined objective function. * boost_from_average seems doesn't work well in binary classification. remove it. * For a better jump link (#355) * Update Python-API.md * for a better jump in page A space is needed between `#` and the headers content according to Github's markdown format [guideline](https://guides.github.com/features/mastering-markdown/) After adding the spaces, we can jump to the exact position in page by click the link. * fixed something mentioned by @wxchan * Update Python-API.md * add FitByExistingTree. * adapt GPU tree learner for FitByExistingTree * avoid NaN output. * update boost.compute * fix typos (#361) * fix broken links (#359) * update README * disable GPU acceleration by default * fix image url * cleanup debug macro * remove old README * do not save sparse_threshold_ in FeatureGroup * add details for new GPU settings * ignore submodule when doing pep8 check * allocate workspace for at least one thread during builing Feature4 * move sparse_threshold to class Dataset * remove duplicated code in GPUTreeLearner::Split * Remove duplicated code in FindBestThresholds and BeforeFindBestSplit * do not rebuild ordered gradients and hessians for sparse features * support feature groups in GPUTreeLearner * Initial parallel learners with GPU support * add option device, cleanup code * clean up FindBestThresholds; add some omp parallel * constant hessian optimization for GPU * Fix GPUTreeLearner crash when there is zero feature * use np.testing.assert_almost_equal() to compare lists of floats in tests * travis for GPU
-
- 05 Apr, 2017 2 commits
- 30 Mar, 2017 1 commit
-
-
Guolin Ke authored
-
- 25 Mar, 2017 1 commit
-
-
Guolin Ke authored
-
- 13 Mar, 2017 1 commit
-
-
Guolin Ke authored
-
- 07 Mar, 2017 1 commit
-
-
Guolin Ke authored
-
- 01 Mar, 2017 2 commits
-
-
Guolin Ke authored
commit 70f88c54f6820d4c3824d97b34042c617fb21635 Author: Guolin Ke <i@yumumu.me> Date: Wed Feb 22 19:40:10 2017 +0800 futher reduce guided commit f54620b807c68f5ac6cef521d787ca2750efe464 Author: Guolin Ke <i@yumumu.me> Date: Wed Feb 22 19:32:36 2017 +0800 avoid to use guided -
Guolin Ke authored
-
- 22 Jan, 2017 1 commit
-
-
Guolin Ke authored
-
- 10 Jan, 2017 1 commit
-
-
Guolin Ke authored
-
- 18 Dec, 2016 1 commit
-
-
Guolin Ke authored
-
- 05 Dec, 2016 1 commit
-
-
Guolin Ke authored
Categorical feature support (#108)
-
- 18 Nov, 2016 1 commit
-
-
Guolin Ke authored
* RAII for utils, application and c_api(partical) * raii for class in include folder * raii for application and boosting * raii for dataset and dataset loader * raii for dense bin and parser * RAII refactor for almost all classes * RAII for c_api * clean code * refine repeated code * Decouple the "sigmoid" between objective and boosting. * change std::vector<bool> back to std::vector<char> due to concurrence problem * slight reduce some memory cost
-
- 15 Nov, 2016 1 commit
-
-
Allard van Mossel authored
-
- 02 Nov, 2016 1 commit
-
-
Guolin Ke authored
-
- 01 Nov, 2016 1 commit
-
-
Guolin Ke authored
-
- 31 Oct, 2016 1 commit
-
-
Guolin Ke authored
-
- 25 Oct, 2016 1 commit
-
-
Hui Xue authored
-
- 24 Oct, 2016 1 commit
-
-
Guolin Ke authored
-
- 22 Oct, 2016 1 commit
-
-
Guolin Ke authored
-
- 08 Aug, 2016 1 commit
-
-
Guolin Ke authored
-
- 05 Aug, 2016 1 commit
-
-
Guolin Ke authored
-