- 10 Apr, 2020 1 commit
-
-
Nikita Titov authored
* Revert "specify the last supported version of scikit-learn (#2637)" This reverts commit d1002776. * ban scikit-learn 0.22.0 and skip broken test * fix updated test * fix lint test * Revert "fix lint test" This reverts commit 8b4db0805fe7a9e7f7eb0be3eac231f85026d196.
-
- 20 Mar, 2020 1 commit
-
-
Lukas Pfannschmidt authored
* Add handling of RandomState object, which is standard for sklearn methods. LightGBM expects an integer seed instead of an object. If passed object is RandomState, we choose random integer based on its state to seed the underlying low level code. While chosen random integer is only in the range between 1 and 1e10 I expect it to have enough entropy (?) to not matter in practice. * Add RandomState object to random_state docstring. * remove blank line * Use property to handle setting random_state. This enables setting cloned estimators with the set_params method in sklearn. * Add docstring to attribute. * Fix and simplify docstring. * Add test case. * Use maximal int for datatype in seed derivation. * Replace random_state property with interfacing in fit method. Derives int seed for C code only when fitting and keeps RandomState object as param. * Adapt unit test to property change. * Extended test case and docstring Co-Authored-By:
Nikita Titov <nekit94-08@mail.ru> * Add more equality checks (feature importance, best iteration/score). * Add equality comparison of boosters represented by strings. Remove useless best_iteration_ comparison (we do not use early_stopping). * fix whitespace * Test if two subsequent fits produce different models * Apply suggestions from code review Co-Authored-By:
Nikita Titov <nekit94-08@mail.ru> Co-authored-by:
Nikita Titov <nekit94-08@mail.ru>
-
- 26 Feb, 2020 1 commit
-
-
Guolin Ke authored
* code refactoring * update vcproject * refine * fix test * Update tests/python_package_test/test_sklearn.py * fix test
-
- 25 Feb, 2020 1 commit
-
-
Nikita Titov authored
* fxied pandas deprecation warning in tests * support old versions of pandas
-
- 03 Feb, 2020 1 commit
-
-
Nikita Titov authored
* Update test_engine.py * Update test_sklearn.py
-
- 02 Feb, 2020 1 commit
-
-
Guolin Ke authored
* commit * fix a bug * fix bug * reset to track changes * refine the auto choose logic * sort the time stats output * fix include * change multi_val_bin_sparse_threshold * add cmake * add _mm_malloc and _mm_free for cross platform * fix cmake bug * timer for split * try to fix cmake * fix tests * refactor DataPartition::Split * fix test * typo * formating * Revert "formating" This reverts commit 5b8de4f7fb9d975ee23701d276a66d40ee6d4222. * add document * [R-package] Added tests on use of force_col_wise and force_row_wise in training (#2719) * naming * fix gpu code * Update include/LightGBM/bin.h Co-Authored-By:
James Lamb <jaylamb20@gmail.com> * Update src/treelearner/ocl/histogram16.cl * test: swap compilers for CI * fix omp * not avx2 * no aligned for feature histogram * Revert "refactor DataPartition::Split" This reverts commit 256e6d9641ade966a1f54da1752e998a1149b6f8. * slightly refactor data partition * reduce the memory cost Co-authored-by:
James Lamb <jaylamb20@gmail.com> Co-authored-by:
Nikita Titov <nekit94-08@mail.ru>
-
- 30 Jan, 2020 1 commit
-
-
sbruch authored
* Implementation of XE_NDCG loss function for ranking. * Add citation * Check in example usage for xe_ndcg loss. * Seed the generator when a seed is provided in the config. Add unit-tests for xe_ndcg * Update documentation * Fix indentation * Address issues raised by reviewers. * Clean up include statements. * Fix issues raised by reviewers. * Regenerate parameters.rst * Add a note to explain that reproducing xe_ndcg results requires num_threads to be one. * Introduce objective_seed and use that in rank_xendcg instead of directly using seed * Change default value of objective_seed
-
- 09 Dec, 2019 1 commit
-
-
Nikita Titov authored
* clean code * clean code * do not modify args in fit function * added test
-
- 05 Dec, 2019 2 commits
-
-
aaiyer authored
* allow python sklearn interface's fit() to pass init_model to train() * Fix whitespace issues, and change ordering of parameters to be backward compatible * Formatting fixes * allow python sklearn interface's fit() to pass init_model to train() * Fix whitespace issues, and change ordering of parameters to be backward compatible * Formatting fixes * Recognize LGBModel objects for init_model * simplified condition * updated docstring * added test
-
Nikita Titov authored
-
- 27 Oct, 2019 2 commits
-
-
Nikita Titov authored
* speed up tests * more updates * fixed pylint * updated tests * Update test_sklearn.py * test that indices are sorted internally
-
Nikita Titov authored
-
- 15 Sep, 2019 1 commit
-
-
kenmatsu4 authored
* Bug fix for first_metric_only if the first metric is train metric. * Update bug fix for feval issue. * Disable feval for first_metric_only. * Additional test items. * Fix wrong assertEqual settings & formating. * Change dataset of test. * Fix random seed for test. * Modiry assumed test result due to different sklearn verion between CI and local. * Remove f-string * Applying variable assumed test result for test. * Fix flake8 error. * Modifying in accordance with review comments. * Modifying for pylint. * simplified tests * Deleting error criteria `if eval_metric is None`. * Delete test items of classification. * Simplifying if condition. * Applying first_metric_only for sklearn wrapper. * Modifying test_sklearn for comforming to python 2.x * Fix flake8 error. * Additional fix for sklearn and add tests. * Bug fix and add test cases. * some refactor * fixed lint * fixed lint * Fix duplicated metrics scores to pass the test. * Fix the case first_metric_only not in params. * Converting metrics aliases. * Add comment. * Modify comment for pylint. * Modify comment for pydocstyle. * Using split test set for two eval_set. * added test case for metric aliases and length checks * minor style fixes * fixed rmse name and alias position * Fix the case metric=[] * Fix using env.model._train_data_name * Fix wrong test condition. * Move initial process to _init() func. * Modify test setting for test_sklearn & training data matching on callback.py * test_sklearn.py -> A test case for training is wrong, so fixed. * callback.py -> A condition of if statement for detecting test dataset is wrong, so fixed. * Support composite name metrics. * Remove metric check process & reduce redundant test cases. For #2273 fixed not only the order of metrics in cpp, removing metric check process at callback.py * Revised according to the matters pointed out on a review. * increased code readability * Fix the issue of order of validation set. * Changing to OrderdDict from default dict for score result. * added missed check in cv function for first_metric_only and feval co-occurrence * keep order only for metrics but not for datasets in best_score * move OrderedDict initialization to init phase * fixed minor printing issues * move first metric detection to init phase and split can be performed without checks * split only once during callback * removed excess code * fixed typo in variable name and squashed ifs * use setdefault * hotfix * fixed failing test * refined tests * refined sklearn test * Making "feval" effective on early stopping. * allow feval and first_metric_only for cv * removed unused code * added tests for feval * fixed printing * add note about whitespaces in feval name * Modifying final iteration process in case valid set is training data.
-
- 03 Sep, 2019 1 commit
-
-
Nikita Titov authored
-
- 24 Aug, 2019 1 commit
-
-
Guolin Ke authored
* norm the lambda scores * change default to false * update doc * typo * Update Parameters.rst * Update config.h * Update test_sklearn.py * Update test_sklearn.py * Update test_sklearn.py * Update test_sklearn.py * Update test_sklearn.py * Update rank_objective.hpp * Update Parameters.rst * Update config.h * Update test_sklearn.py * Update test_sklearn.py * Update test_sklearn.py
-
- 17 Aug, 2019 1 commit
-
-
sbruch authored
* Lambdas and hessians need to factor sigmoid_ into the computation. Additionally, the sigmoid function has an arbitrary factor of 2 in the exponent; it is not just non-standard but the gradients are not computed correctly anyway. * Update unit test * Also remove a heuristic that normalizes the gradient by the difference in scores. * Also fix unit test after removing the heuristic
-
- 13 Aug, 2019 1 commit
-
-
Nikita Titov authored
* reworked pandas dtypes mapper * added tests * added sparsity support for new version of pandas * fixed tests for old pandas * check pd.Series for bad dtypes as well * enhanced tests * fixed pylint
-
- 20 Jun, 2019 1 commit
-
-
Nikita Titov authored
* Update test.py * Update test_consistency.py * Update test_basic.py * Update test_sklearn.py * Update test_sklearn.py * Update test_engine.py * more replacements
-
- 04 Jun, 2019 1 commit
-
-
Nikita Titov authored
* fixed class_weight * fixed lint * added test * hotfix
-
- 27 May, 2019 1 commit
-
-
Nikita Titov authored
[python] fixed picklability of sklearn models with custom obj and updated docstings for custom obj (#2191) * refactored joblib test * fixed picklability of sklearn models with custom obj and updated docstings for custom obj * pickled model should be able to predict without refitting
-
- 08 May, 2019 1 commit
-
-
Guolin Ke authored
* fix travis badge * updated GitHub Microsoft URL
-
- 22 Apr, 2019 1 commit
-
-
Nikita Titov authored
* disable default pandas cat features if cat features were explicitly provided * added assertion for cat features
-
- 19 Apr, 2019 1 commit
-
-
Nikita Titov authored
* ignore pandas ordered categorical columns by default * fix tests * fix tests * added comments
-
- 02 Feb, 2019 1 commit
-
-
Nikita Titov authored
-
- 30 Jan, 2019 1 commit
-
-
Guolin Ke authored
* always save the score of the first round in early stopping fix #1971 * avoid using std::log on non-positive numbers * remove unnecessary changes * add tests * Update test_sklearn.py * enhanced tests
-
- 27 Jan, 2019 1 commit
-
-
Nikita Titov authored
[tests][python] added tests for metrics' behavior and fixed case for multiclass task with custom objective (#1954) * added metrics test for standard interface * simplified code * less trees * less trees * use dummy custom objective and metric * added tests for multiclass metrics aliases * fixed bug in case of custom obj and num_class > 1 * added metric test for sklearn wrapper
-
- 20 Dec, 2018 1 commit
-
-
Tsukasa OMOTO authored
* [python] fix creating train_set in fit https://github.com/Microsoft/LightGBM/blob/cc99f0d36ae929eb02b22a072823ab7c6d3155ab/python-package/lightgbm/sklearn.py#L519 may False even if valid_data[0] is X and valid_data[1] is y actually, because `check_X_y` might return copy of X and y. https://scikit-learn.org/0.20/modules/generated/sklearn.utils.check_X_y.html cf. https://github.com/Microsoft/LightGBM/pull/451 * use assertIn
-
- 11 Oct, 2018 1 commit
-
-
Nikita Titov authored
* break huge lines in sklearn tests * break huge line in plotting tests * break huge lines in basic tests * multiple enhancements in engine tests * multiple enhancements in sklearn tests * hotfixes * break huge lines and use with statement in C API test * make NDCG test more strict
-
- 10 Oct, 2018 1 commit
-
-
Guolin Ke authored
* fix ndcg consistency. * more stable sorts * Update gbdt_model_text.cpp * Update dataset.cpp * Update gbdt_model_text.cpp
-
- 28 Sep, 2018 1 commit
-
-
Nikita Titov authored
* fixed FutureWarning about cv default value * fixed according to new check_estimator API * fixed joblib warning
-
- 25 Jul, 2018 1 commit
-
-
Nikita Titov authored
* added new aliases for params * run helper/parameter_generator.py * removed useless test
-
- 11 Jul, 2018 1 commit
-
-
Misha Lisovyi authored
* ignore vim temporary files * add importance_type arg to sklearn API * update documentation info * remote a trailing space * remove trailing space (again :)) * add instructions on importance choices to sklearn API * drop mention of constructor in the feature type setting * adding a test for different feture types * remove trailing spaces, make shorter assert in feature importance type handling test * fixing style issue introduced with the new test
-
- 20 Jun, 2018 1 commit
-
-
Nikita Titov authored
* removed excess import * added tests for plotting trees in Python * refined module_INSTALLED mechanism * added note about that create_tree_digraph is better than plot_tree
-
- 09 Jun, 2018 1 commit
-
-
Nikita Titov authored
* fixed grammar * fixed params description in graph plotting functions * clarified types of attributes in their descriptions * increased readability of graphs by adding spaces * added precision parameter to plot tree functions
-
- 10 May, 2018 1 commit
-
-
Nikita Titov authored
* fixed docs * reworker predict method of sklearn wrapper * fixed encapsulation * added test * fixed consistency between docstring and params docs * fixed verbose * replaced predict_proba with predict in test * fixed verbose again * fixed fraction params descriptions * added description of skip_drop and drop_rate constraints * fixed subsample_freq consistency with C++ default value * fixed nice look of params list * made force splits json file example clickable * fixed nice look of metrics list and added comma * reduced warning in test about same param specified twice * replaced pred_parameter with **kwargs in predict method * added test for **kwargs in predict method * fixed warnings * fixed pylint
-
- 19 Sep, 2017 1 commit
-
-
Nikita Titov authored
* added test for sklearn handle categorical features * use raw X, y in sklearn wrapper in case of pandas.DataFrame * fixed probs
-
- 08 Sep, 2017 1 commit
-
-
Nikita Titov authored
* disabled logs from compilers; fixed #874 * fixed safe clear_fplder * added windows folder to manifest.in * added windows folder to build * added library path * added compilation with MSBuild from .sln-file * fixed unknown PlatformToolset returns exitcode 0 * hotfix * updated Readme * removed return * added installation with mingw test to appveyor * let's test appveyor with both VS 2015 and VS 2017; but MinGW isn't installed on VS 2017 image * fixed built-in name 'file' * simplified appveyor * removed excess data_files * fixed unreadable paths * separated exceptions for cmake and mingw * refactored silent_call * don't create artifacts with VS 2015 and mingw * be more precise with python versioning in Travis * removed unnecessary if statement * added classifiers for PyPI and python versions badge * changed python version in travis * added support of scikit-learn 0.18.x * added more python versions to Travis * added more python versions to Appveyor * reduced number of tests in Travis * Travis trick is not needed anymore * attempt to fix according to https://github.com/Microsoft/LightGBM/pull/880#discussion_r137438856
-
- 05 Sep, 2017 2 commits
-
-
Nikita Titov authored
* fixed sklearn test on python 2.7 * commit to show that problem has been solved * come back to python 3.6 * removed warnings check
-
Nikita Titov authored
* improved sklearn interface; added sklearns' tests * moved best_score into the if statement * improved docstrings; simplified LGBMCheckConsistentLength * fixed typo * pylint * updated example * fixed Ranker interface * added missed boosting_type * fixed more comfortable autocomplete without unused objects * removed check for None of eval_at * fixed according to review * fixed typo * added description of fit return type * dictionary->dict for short * markdown cleanup
-
- 23 Aug, 2017 1 commit
-
-
Nikita Titov authored
* updated scikit-learn interface * fixed better description * updated set_params() * removed backward compatibility * removed excess lines * replaced pop with setdefault * added deprecated warnings * added tests
-