- 05 May, 2020 1 commit
-
-
Nikita Titov authored
-
- 10 Apr, 2020 1 commit
-
-
Nikita Titov authored
* Revert "specify the last supported version of scikit-learn (#2637)" This reverts commit d1002776. * ban scikit-learn 0.22.0 and skip broken test * fix updated test * fix lint test * Revert "fix lint test" This reverts commit 8b4db0805fe7a9e7f7eb0be3eac231f85026d196.
-
- 20 Mar, 2020 1 commit
-
-
Lukas Pfannschmidt authored
* Add handling of RandomState object, which is standard for sklearn methods. LightGBM expects an integer seed instead of an object. If passed object is RandomState, we choose random integer based on its state to seed the underlying low level code. While chosen random integer is only in the range between 1 and 1e10 I expect it to have enough entropy (?) to not matter in practice. * Add RandomState object to random_state docstring. * remove blank line * Use property to handle setting random_state. This enables setting cloned estimators with the set_params method in sklearn. * Add docstring to attribute. * Fix and simplify docstring. * Add test case. * Use maximal int for datatype in seed derivation. * Replace random_state property with interfacing in fit method. Derives int seed for C code only when fitting and keeps RandomState object as param. * Adapt unit test to property change. * Extended test case and docstring Co-Authored-By:
Nikita Titov <nekit94-08@mail.ru> * Add more equality checks (feature importance, best iteration/score). * Add equality comparison of boosters represented by strings. Remove useless best_iteration_ comparison (we do not use early_stopping). * fix whitespace * Test if two subsequent fits produce different models * Apply suggestions from code review Co-Authored-By:
Nikita Titov <nekit94-08@mail.ru> Co-authored-by:
Nikita Titov <nekit94-08@mail.ru>
-
- 15 Feb, 2020 1 commit
-
-
Nikita Titov authored
-
- 06 Feb, 2020 1 commit
-
-
zhangqibot authored
* add property: feature_name_ in lgb sklearn api * modify the comments * fix linting errors and add info about new attribute: feature_name_
-
- 19 Dec, 2019 1 commit
-
-
Nikita Titov authored
-
- 09 Dec, 2019 1 commit
-
-
Nikita Titov authored
* clean code * clean code * do not modify args in fit function * added test
-
- 05 Dec, 2019 2 commits
-
-
aaiyer authored
* allow python sklearn interface's fit() to pass init_model to train() * Fix whitespace issues, and change ordering of parameters to be backward compatible * Formatting fixes * allow python sklearn interface's fit() to pass init_model to train() * Fix whitespace issues, and change ordering of parameters to be backward compatible * Formatting fixes * Recognize LGBModel objects for init_model * simplified condition * updated docstring * added test
-
Nikita Titov authored
-
- 27 Oct, 2019 1 commit
-
-
Nikita Titov authored
-
- 22 Oct, 2019 1 commit
-
-
Nikita Titov authored
* handle aliases centralized * convert aliases dict to class
-
- 26 Sep, 2019 1 commit
-
-
Nikita Titov authored
-
- 15 Sep, 2019 1 commit
-
-
kenmatsu4 authored
* Bug fix for first_metric_only if the first metric is train metric. * Update bug fix for feval issue. * Disable feval for first_metric_only. * Additional test items. * Fix wrong assertEqual settings & formating. * Change dataset of test. * Fix random seed for test. * Modiry assumed test result due to different sklearn verion between CI and local. * Remove f-string * Applying variable assumed test result for test. * Fix flake8 error. * Modifying in accordance with review comments. * Modifying for pylint. * simplified tests * Deleting error criteria `if eval_metric is None`. * Delete test items of classification. * Simplifying if condition. * Applying first_metric_only for sklearn wrapper. * Modifying test_sklearn for comforming to python 2.x * Fix flake8 error. * Additional fix for sklearn and add tests. * Bug fix and add test cases. * some refactor * fixed lint * fixed lint * Fix duplicated metrics scores to pass the test. * Fix the case first_metric_only not in params. * Converting metrics aliases. * Add comment. * Modify comment for pylint. * Modify comment for pydocstyle. * Using split test set for two eval_set. * added test case for metric aliases and length checks * minor style fixes * fixed rmse name and alias position * Fix the case metric=[] * Fix using env.model._train_data_name * Fix wrong test condition. * Move initial process to _init() func. * Modify test setting for test_sklearn & training data matching on callback.py * test_sklearn.py -> A test case for training is wrong, so fixed. * callback.py -> A condition of if statement for detecting test dataset is wrong, so fixed. * Support composite name metrics. * Remove metric check process & reduce redundant test cases. For #2273 fixed not only the order of metrics in cpp, removing metric check process at callback.py * Revised according to the matters pointed out on a review. * increased code readability * Fix the issue of order of validation set. * Changing to OrderdDict from default dict for score result. * added missed check in cv function for first_metric_only and feval co-occurrence * keep order only for metrics but not for datasets in best_score * move OrderedDict initialization to init phase * fixed minor printing issues * move first metric detection to init phase and split can be performed without checks * split only once during callback * removed excess code * fixed typo in variable name and squashed ifs * use setdefault * hotfix * fixed failing test * refined tests * refined sklearn test * Making "feval" effective on early stopping. * allow feval and first_metric_only for cv * removed unused code * added tests for feval * fixed printing * add note about whitespaces in feval name * Modifying final iteration process in case valid set is training data.
-
- 08 Sep, 2019 1 commit
-
-
CharlesAuguste authored
* Some basic changes to the plot of the trees to make them readable. * Squeezed the information in the nodes. * Added colouring when a dictionnary mapping the features to the constraints is passed. * Fix spaces. * Added data percentage as an option in the nodes. * Squeezed the information in the leaves. * Important information is now in bold. * Added a legend for the color of monotone splits. * Changed "split_gain" to "gain" and "internal_value" to "value". * Sqeezed leaves a bit more. * Changed description in the legend. * Revert "Sqeezed leaves a bit more." This reverts commit dd8bf14a3ba604b0dfae3b7bb1c64b6784d15e03. * Increased the readability for the gain. * Tidied up the legend. * Added the data percentage in the leaves. * Added the monotone constraints to the dumped model. * Monotone constraints are now specified automatically when plotting trees. * Raise an exception instead of the bug that was here before. * Removed operators on the branches for a clearer design. * Small cleaning of the code. * Setting a monotone constraint on a categorical feature now returns an exception instead of doing nothing. * Fix bug when monotone constraints are empty. * Fix another bug when monotone constraints are empty. * Variable name change. * Added is / isn't on every edge of the trees. * Fix test "tree_create_digraph". * Add new test for plotting trees with monotone constraints. * Typo. * Update documentation of categorical features. * Typo. * Information in nodes more explicit. * Used regular strings instead of raw strings. * Small refactoring. * Some cleaning. * Added future statement. * Changed output for consistency. * Updated documentation. * Added comments for colors. * Changed text on edges for more clarity. * Small refactoring. * Modified text in leaves for consistency with nodes. * Updated default values and documentaton for consistency. * Replaced CHECK with Log::Fatal for user-friendliness. * Updated tests. * Typo. * Simplify imports. * Swapped count and weight to improve readibility of the leaves in the plotted trees. * Thresholds in bold. * Made information in nodes written in a specific order. * Added information to clarify legend. * Code cleaning.
-
- 04 Jun, 2019 2 commits
-
-
Nikita Titov authored
* fixed class_weight * fixed lint * added test * hotfix
-
Nikita Titov authored
* Update sklearn.py * Update parameter_generator.py
-
- 27 May, 2019 1 commit
-
-
Nikita Titov authored
[python] fixed picklability of sklearn models with custom obj and updated docstings for custom obj (#2191) * refactored joblib test * fixed picklability of sklearn models with custom obj and updated docstings for custom obj * pickled model should be able to predict without refitting
-
- 15 May, 2019 2 commits
-
-
Laurae authored
* PR #1879 * Update docs with parameter_generator.py * Update wrapper doc for sklearn
-
Nikita Titov authored
* added ability to pass first_metric_only in params * simplified tests * fixed test * fixed punctuation
-
- 06 May, 2019 1 commit
-
-
Nikita Titov authored
-
- 28 Apr, 2019 1 commit
-
-
Nikita Titov authored
-
- 19 Apr, 2019 2 commits
-
-
Nikita Titov authored
* ignore pandas ordered categorical columns by default * fix tests * fix tests * added comments
-
Scott Lundberg authored
* Update doc string for pred_contrib See comments at the end of #1969 * Update basic.py * Update basic.py * update doc strings * update equals sign in doc string * strip whitespace and gen rst * strip whitespace
-
- 18 Apr, 2019 1 commit
-
-
Nikita Titov authored
-
- 25 Mar, 2019 1 commit
-
-
kenmatsu4 authored
* Use first_metric_only flag for early_stopping function. In order to apply early stopping with only first metric, applying first_metric_only flag for early_stopping function. * upcate comment * Revert "upcate comment" This reverts commit 1e75a1a415cc16cfbe795181e148ebfe91469be4. * added test * fixed docstring * cut comment and save one line * document new feature
-
- 04 Feb, 2019 1 commit
-
-
Guolin Ke authored
* convert datatable to numpy directly * fix according to comments * updated more docstrings * simplified isinstance check * Update compat.py
-
- 20 Dec, 2018 1 commit
-
-
Tsukasa OMOTO authored
* [python] fix creating train_set in fit https://github.com/Microsoft/LightGBM/blob/cc99f0d36ae929eb02b22a072823ab7c6d3155ab/python-package/lightgbm/sklearn.py#L519 may False even if valid_data[0] is X and valid_data[1] is y actually, because `check_X_y` might return copy of X and y. https://scikit-learn.org/0.20/modules/generated/sklearn.utils.check_X_y.html cf. https://github.com/Microsoft/LightGBM/pull/451 * use assertIn
-
- 25 Nov, 2018 1 commit
-
-
Nikita Titov authored
-
- 25 Oct, 2018 1 commit
-
-
Nikita Titov authored
-
- 16 Oct, 2018 1 commit
-
-
Nikita Titov authored
* added docstring style test and fixed errors in existing docstrings * hotfix * hotfix * fix grammar * hotfix
-
- 09 Oct, 2018 1 commit
-
-
Zafarullah Mahmood authored
* Fixed some typos in Python API Docs * FixTypo changed validation set -> sets
-
- 03 Oct, 2018 1 commit
-
-
Nikita Titov authored
-
- 28 Sep, 2018 1 commit
-
-
Nikita Titov authored
* fixed FutureWarning about cv default value * fixed according to new check_estimator API * fixed joblib warning
-
- 25 Sep, 2018 1 commit
-
-
Nikita Titov authored
* break extremely large lines in basic.py * break extremely large lines in callback.py * break extremely large lines in engine.py * break extremely large lines in sklearn.py * hotfixes
-
- 20 Sep, 2018 1 commit
-
-
Nikita Titov authored
-
- 19 Sep, 2018 1 commit
-
-
Chi Su authored
-
- 11 Sep, 2018 1 commit
-
-
dmitryikh authored
* warning on categorical feature with sparse values * [docs] categorical features note
-
- 06 Sep, 2018 1 commit
-
-
Nikita Titov authored
* pass params to _InnerPredictor in train and cv * fixed verbosity param description * treat silent param as Fatal log level * create Dataset in refit method silently * do not overwrite verbose param by silent argument
-
- 29 Aug, 2018 1 commit
-
-
Nikita Titov authored
-
- 27 Aug, 2018 1 commit
-
-
Nikita Titov authored
* bring consistency and clearness into early_stopping_rounds desc, metric desc and implementation * hotfix * hotfix * used NDCG as default metric for lambdarank task * fixed missed methods at ReadTheDocs and changed default eval_metric * leaved only unique metrics * fixed comment
-