- 07 Oct, 2019 1 commit
-
-
James Lamb authored
* fixed miscellaneous typos in documentation * fix typo introduced in typo-fixing PR
-
- 22 Sep, 2019 1 commit
-
-
Guolin Ke authored
* fix many cpp lint errors * indent * fix bug * fix more * fix gpu * more fixes
-
- 08 Sep, 2019 1 commit
-
-
CharlesAuguste authored
* Some basic changes to the plot of the trees to make them readable. * Squeezed the information in the nodes. * Added colouring when a dictionnary mapping the features to the constraints is passed. * Fix spaces. * Added data percentage as an option in the nodes. * Squeezed the information in the leaves. * Important information is now in bold. * Added a legend for the color of monotone splits. * Changed "split_gain" to "gain" and "internal_value" to "value". * Sqeezed leaves a bit more. * Changed description in the legend. * Revert "Sqeezed leaves a bit more." This reverts commit dd8bf14a3ba604b0dfae3b7bb1c64b6784d15e03. * Increased the readability for the gain. * Tidied up the legend. * Added the data percentage in the leaves. * Added the monotone constraints to the dumped model. * Monotone constraints are now specified automatically when plotting trees. * Raise an exception instead of the bug that was here before. * Removed operators on the branches for a clearer design. * Small cleaning of the code. * Setting a monotone constraint on a categorical feature now returns an exception instead of doing nothing. * Fix bug when monotone constraints are empty. * Fix another bug when monotone constraints are empty. * Variable name change. * Added is / isn't on every edge of the trees. * Fix test "tree_create_digraph". * Add new test for plotting trees with monotone constraints. * Typo. * Update documentation of categorical features. * Typo. * Information in nodes more explicit. * Used regular strings instead of raw strings. * Small refactoring. * Some cleaning. * Added future statement. * Changed output for consistency. * Updated documentation. * Added comments for colors. * Changed text on edges for more clarity. * Small refactoring. * Modified text in leaves for consistency with nodes. * Updated default values and documentaton for consistency. * Replaced CHECK with Log::Fatal for user-friendliness. * Updated tests. * Typo. * Simplify imports. * Swapped count and weight to improve readibility of the leaves in the plotted trees. * Thresholds in bold. * Made information in nodes written in a specific order. * Added information to clarify legend. * Code cleaning.
-
- 14 Aug, 2019 1 commit
-
-
Guolin Ke authored
* fix nan in tree model * fix
-
- 18 Jul, 2019 1 commit
-
-
Guolin Ke authored
* throw error when meet non ascii * check ascii for config strings.
-
- 13 Apr, 2019 2 commits
-
-
Nikita Titov authored
-
Nikita Titov authored
-
- 11 Apr, 2019 1 commit
-
-
Nikita Titov authored
* added all necessary includes - fixed build/include_what_you_use error * fixed the order of includes (build/include_order)
-
- 04 Apr, 2019 1 commit
-
-
remcob-gr authored
* Add configuration parameters for CEGB. * Add skeleton CEGB tree learner Like the original CEGB version, this inherits from SerialTreeLearner. Currently, it changes nothing from the original. * Track features used in CEGB tree learner. * Pull CEGB tradeoff and coupled feature penalty from config. * Implement finding best splits for CEGB This is heavily based on the serial version, but just adds using the coupled penalties. * Set proper defaults for cegb parameters. * Ensure sanity checks don't switch off CEGB. * Implement per-data-point feature penalties in CEGB. * Implement split penalty and remove unused parameters. * Merge changes from CEGB tree learner into serial tree learner * Represent features_used_in_data by a bitset, to reduce the memory overhead of CEGB, and add sanity checks for the lengths of the penalty vectors. * Fix bug where CEGB would incorrectly penalise a previously used feature The tree learner did not update the gains of previously computed leaf splits when splitting a leaf elsewhere in the tree. This caused it to prefer new features due to incorrectly penalising splitting on previously used features. * Document CEGB parameters and add them to the appropriate section. * Remove leftover reference to cegb tree learner. * Remove outdated diff. * Fix warnings * Fix minor issues identified by @StrikerRUS. * Add docs section on CEGB, including citation. * Fix link. * Fix CI failure. * Add some unit tests * Fix pylint issues. * Fix remaining pylint issue
-
- 01 Apr, 2019 1 commit
-
-
Nikita Titov authored
-
- 26 Mar, 2019 1 commit
-
-
Nikita Titov authored
-
- 06 Feb, 2019 1 commit
-
-
Nikita Titov authored
-
- 02 Feb, 2019 1 commit
-
-
Nikita Titov authored
-
- 30 Jan, 2019 1 commit
-
-
Guolin Ke authored
* always save the score of the first round in early stopping fix #1971 * avoid using std::log on non-positive numbers * remove unnecessary changes * add tests * Update test_sklearn.py * enhanced tests
-
- 01 Nov, 2018 1 commit
-
-
Guolin Ke authored
-
- 10 Oct, 2018 1 commit
-
-
Guolin Ke authored
* fix ndcg consistency. * more stable sorts * Update gbdt_model_text.cpp * Update dataset.cpp * Update gbdt_model_text.cpp
-
- 21 Aug, 2018 1 commit
-
-
Qiwei Ye authored
* remove unnecessary std::move * remove unused-lambda-capture * remove unused-lambda-capture * fix unused parameter * minor fix * invalid capture of lambda function
-
- 16 Aug, 2018 1 commit
-
-
Guolin Ke authored
* fix include * reduce dependency on header file * fix build
-
- 14 Jun, 2018 1 commit
-
-
Guolin Ke authored
* add per-feature-penalites * fix comment
-
- 11 May, 2018 1 commit
-
-
Nikita Titov authored
* decode error description * added break line char in log massages
-
- 20 Apr, 2018 1 commit
-
-
Adilet Zhaxybay authored
-
- 18 Apr, 2018 1 commit
-
-
Guolin Ke authored
-
- 27 Feb, 2018 1 commit
-
-
ebernhardson authored
* Read and write datsets from hdfs. * Only enabled when cmake is run with -DUSE_HDFS:BOOL=TRUE * Introduces VirtualFile(Reader|Writer) to asbtract VFS differences
-
- 16 Jan, 2018 1 commit
-
-
Guolin Ke authored
-
- 15 Jan, 2018 2 commits
- 12 Jan, 2018 1 commit
-
-
Guolin Ke authored
* fix a bug in `ArgMaxAtK`
-
- 11 Dec, 2017 1 commit
-
-
Guolin Ke authored
-
- 26 Nov, 2017 1 commit
-
-
Guolin Ke authored
* remove protobuf * add version number * remove pmml script * use float for split gain * fix warnings * refine the read model logic of gbdt * fix compile error * improve decode speed * fix some bugs * fix double accuracy problem * fix bug * multi-thread save model * speed up save model to string * parallel save/load model * fix some warnings. * fix warnings. * fix a bug * remove debug output * fix doc * fix max_bin warning in tests. * fix max_bin warning * fix pylint * clean code for stringToArray * clean code for TToString * remove max_bin * replace "class" with typename
-
- 16 Nov, 2017 1 commit
-
-
zhangjin authored
add 'null' to missing value
-
- 14 Nov, 2017 1 commit
-
-
wxchan authored
* refine common.h * fix typo * specify captured variables
-
- 09 Nov, 2017 1 commit
-
-
wxchan authored
* add init_score & test cpp and python result consistency * try fix common.h * Fix tests (#3) * update atof * fix bug * fix tests. * fix bug * fix dtypes * fix categorical feature override * fix protobuf on vs build (#1004) * [optional] support protobuf * fix windows/LightGBM.vcxproj * add doc * fix doc * fix vs support (#2) * fix vs support * fix cmake * fix #1012 * [python] add network config api (#1019) * add network * update doc * add float tolerance in bin finder. * fix a bug * update tests * add double torelance on tree model * fix tests * simplify the double comparison * fix lightsvm zero base * move double tolerance to the bin finder. * fix pylint * clean test.sh * add sklearn test * remove underline * clean codes * set random_state=None * add last line * fix doc * rename file * try fix test
-
- 12 Oct, 2017 1 commit
-
-
Guolin Ke authored
* add network apis. * support parallel loading dataset in c api. * fix bug * fix bug
-
- 11 Oct, 2017 1 commit
-
-
Guolin Ke authored
* thread local for the network interface. * fix bug.
-
- 08 Oct, 2017 1 commit
-
-
Tony-Y authored
* Fix coding style (#969) Function names must be in the "Pascal Case" style. * check_elements_interval_closed to CheckElementsIntervalClosed * obtain_min_max_sum to ObtainMinMaxSum * Change functions in common.h into template functions (#969) * CheckElementsIntervalClosed * ObtainMinMaxSum These two functions were changed into template functions. * Remove an unpreferable overload * remove an overload of the function ObtainMinMaxSum * Use stringstream to format T type
-
- 07 Oct, 2017 1 commit
-
-
Tony-Y authored
Function names must be in the "Pascal Case" style. * check_elements_interval_closed to CheckElementsIntervalClosed * obtain_min_max_sum to ObtainMinMaxSum
-
- 29 Sep, 2017 1 commit
-
-
olofer authored
* Remove signed/unsigned integer compare warning with g++
-
- 28 Sep, 2017 1 commit
-
-
Guolin Ke authored
-
- 02 Sep, 2017 1 commit
-
-
Guolin Ke authored
-
- 28 Aug, 2017 1 commit
-
-
Guolin Ke authored
-