Commits · 83627ff0d6baf49e3b18991f47b12d9fa5724cbc · tianlh / LightGBM-DCU

30 Jul, 2022 1 commit

reproducible parameter alias resolution for wrappers (fixes #5304) (#5338) · 83627ff0

José Morales authored Jul 30, 2022

* dump sorted parameter aliases

* update lgb.check.wrapper_param

* update _choose_param_value to look like lgb.check.wrapper_param

* apply suggestions from review

* reduce diff

* move DumpAliases to config

* remove unnecessary check

* restore parameter check

83627ff0

29 Jul, 2022 1 commit
- Use double precision in threaded calculation of linear tree coefficients (fixes #5226) (#5368) · 44d37184
  Belinda Trotta authored Jul 30, 2022
  
  44d37184
03 Jul, 2022 1 commit
- [python-package] add `validate_features` argument to `refit()` (#5331) · 25e32e94
  José Morales authored Jul 03, 2022
```
add validate_features to refit
```
  25e32e94
27 Jun, 2022 2 commits

[python-package] allow custom weighing in fobj for scikit-learn API (closes #5027) (#5211) · b6deb9a8
José Morales authored Jun 27, 2022
```
* allow custom weighing in sklearn api

* add suggestions from review
Co-authored-by: Nikita Titov <nekit94-08@mail.ru>
```
b6deb9a8

[python-package] check feature names in predict with dataframe (fixes #812) (#4909) · bdb02e05

José Morales authored Jun 27, 2022



* check feature names and order in predict with dataframe

* slice df in predict to remove the target

* scramble features

* handle int column names

* only change column order when needed

* include validate_features param in booster and sklearn estimators

* document validate_features argument

* use all_close in preds checks and check for assertion error to compare different arrays

* perform remapping and checks in cpp

* remove extra logs

* fixes

* revert cpp

* proposal

* remove extra arg

* lint

* restore _data_from_pandas arguments

* Apply suggestions from code review
Co-authored-by: Nikita Titov <nekit94-08@mail.ru>

* move data conversion to Predictor.predict

* use Vector2Ptr
Co-authored-by: Nikita Titov <nekit94-08@mail.ru>

bdb02e05

19 Jun, 2022 2 commits

[python] preserve None in `_choose_param_value()` (#5289) · 70654048

James Lamb authored Jun 19, 2022



* [python] preserve None in _choose_param_value()

* Update python-package/lightgbm/basic.py
Co-authored-by: Nikita Titov <nekit94-08@mail.ru>
Co-authored-by: Nikita Titov <nekit94-08@mail.ru>

70654048

[python-package] Use scikit-learn interpretation of negative `n_jobs` and... · f3ea1ad7

david-cortes authored Jun 19, 2022


[python-package] Use scikit-learn interpretation of negative `n_jobs` and change default to number of cores (#5105)

* use joblib formula for negative n_jobs

* correction for n_jobs calculation

* use more robust cpu_count from joblib

* change default n_jobs to number of cores

* fix detection of num_threads under parameters

* better handling of n_jobs at prediction time

* fix incorrect usage of list.pop

* correct pop/remove yet again

* Update python-package/lightgbm/sklearn.py
Co-authored-by: Nikita Titov <nekit94-08@mail.ru>

* Update tests/python_package_test/test_sklearn.py
Co-authored-by: Nikita Titov <nekit94-08@mail.ru>

* Update tests/python_package_test/test_sklearn.py
Co-authored-by: Nikita Titov <nekit94-08@mail.ru>

* add comments clarifying negative n_jobs

* fix CI (code taken from PR comment)

* change default to n_jobs=None in dask interface

* corrections for handling of n_jobs

* linter

* corrections for predict-time n_jobs

* linter

* add more comments about n_jobs values

* linter

* more corrections

* linter

* linter

* linter

* Update python-package/lightgbm/compat.py
Co-authored-by: Nikita Titov <nekit94-08@mail.ru>

* Update python-package/lightgbm/sklearn.py
Co-authored-by: Nikita Titov <nekit94-08@mail.ru>

* Update python-package/lightgbm/sklearn.py
Co-authored-by: Nikita Titov <nekit94-08@mail.ru>

* Update python-package/lightgbm/sklearn.py
Co-authored-by: Nikita Titov <nekit94-08@mail.ru>

* Update python-package/lightgbm/sklearn.py
Co-authored-by: Nikita Titov <nekit94-08@mail.ru>

* workaround for passing test about outputs with multiple threads

* Update tests/python_package_test/test_sklearn.py
Co-authored-by: Nikita Titov <nekit94-08@mail.ru>

* Update tests/python_package_test/test_sklearn.py
Co-authored-by: Nikita Titov <nekit94-08@mail.ru>
Co-authored-by: Nikita Titov <nekit94-08@mail.ru>

f3ea1ad7

12 Jun, 2022 1 commit
- [python-package] remove `Booster.set_attr()` and `Booster.attr()` (#5272) · 11110c54
  James Lamb authored Jun 12, 2022
  
  11110c54
08 Jun, 2022 1 commit

Clear split info buffer in cost efficient gradient boosting before every... · f1328d5c

shiyu1994 authored Jun 08, 2022

Clear split info buffer in cost efficient gradient boosting before every iteration (fix partially #3679) (#5164)

* clear split info buffer in cegb_ before every iteration

* check nullable of cegb_ in serial_tree_learner.cpp

* add a test case for checking the split buffer in CEGB

* swith to Threading::For instead of raw OpenMP

* apply review suggestions

* apply review comments

* remove device cpu

f1328d5c

05 Jun, 2022 2 commits
- [tests][python] Make test that checks original pandas data isn't modified more strict (#5267) · 27d9ad2e
  Nikita Titov authored Jun 06, 2022
```
* Update test_basic.py

* Address review comment
```
  27d9ad2e
- [python-package] make a shallow copy on dataframe rename (fixes #4596) (#5254) · 65b3db1c
  José Morales authored Jun 04, 2022
```
* dont copy dataframe on rename

* test with feature_name and 'auto'
```
  65b3db1c
24 May, 2022 1 commit
- [python] Fix training on subset constructed without params (#5213) · a4478f7e
  Nikita Titov authored May 24, 2022
```
* Update basic.py

* Update test_engine.py

* Add return type annotation
```
  a4478f7e
22 May, 2022 1 commit
- [python-package] make a shallow copy when replacing categorical features with... · c000b8cc
  José Morales authored May 21, 2022
```
[python-package] make a shallow copy when replacing categorical features with codes (fixes #4596) (#5225)
```
  c000b8cc
17 May, 2022 1 commit

[python-package][R-package] allow using feature names when retrieving number of bins (#5116) · 5b664b67

José Morales authored May 16, 2022

* allow using feature names when retrieving number of bins

* unname vector

* use default feature names when not defined

* lint

* apply suggestions

* remove extra comma

* add test with categorical feature

* make feature names sync more transparent

5b664b67

10 May, 2022 1 commit

Fix potential overflow "Multiplication result converted to larger type" (#5189) · 6de9bafa

Nikita Titov authored May 10, 2022



* Update dataset_loader.cpp

* Update gbdt.h

* Update regression_objective.hpp

* Update linker_topo.cpp

* Update xentropy_objective.hpp

* Update regression_objective.hpp

* investigate inf test failure

* avoid overflow in regression objective

* remove `test_inf_handle` test
Co-authored-by: Guolin Ke <guolin.ke@outlook.com>

6de9bafa

30 Apr, 2022 1 commit
- [c-api] check number of features when retrieving number of bins (#5183) · f53fa691
  José Morales authored Apr 30, 2022
```
* check number of features when retrieving number of bins

* check for negative values

* lint
```
  f53fa691
24 Apr, 2022 1 commit
- [tests] replace `fobj` with `custom objective` in test comments and make tests stricter (#5173) · 56ccea42
  Nikita Titov authored Apr 24, 2022
  
  56ccea42
22 Apr, 2022 1 commit

[python-package] remove 'fobj' in favor of passing custom objective function... · 416ecd5a

Miguel Trejo Marrufo authored Apr 21, 2022


[python-package] remove 'fobj' in favor of passing custom objective function in params (fixes #3244) (#5052)

* feat: support custom metrics in params

* feat: support objective in params

* test: custom objective and metric

* fix: imports are incorrectly sorted

* feat: convert eval metrics str and set to list

* feat: convert single callable eval_metric to list

* test: single callable objective in params
Signed-off-by: Miguel Trejo <armando.trejo.marrufo@gmail.com>

* feat: callable fobj in basic cv function
Signed-off-by: Miguel Trejo <armando.trejo.marrufo@gmail.com>

* test: cv support objective callable
Signed-off-by: Miguel Trejo <armando.trejo.marrufo@gmail.com>

* fix: assert in cv_res
Signed-off-by: Miguel Trejo <armando.trejo.marrufo@gmail.com>

* docs: objective callable in params
Signed-off-by: Miguel Trejo <armando.trejo.marrufo@gmail.com>

* recover test_boost_from_average_with_single_leaf_trees
Signed-off-by: Miguel Trejo <armando.trejo.marrufo@gmail.com>

* linters fail
Signed-off-by: Miguel Trejo <armando.trejo.marrufo@gmail.com>

* remove metrics helper functions
Signed-off-by: Miguel Trejo <armando.trejo.marrufo@gmail.com>

* feat: choose objective through _choose_param_values
Signed-off-by: Miguel Trejo <armando.trejo.marrufo@gmail.com>

* test: test objective through _choose_param_values
Signed-off-by: Miguel Trejo <armando.trejo.marrufo@gmail.com>

* test: test objective is callabe in train
Signed-off-by: Miguel Trejo <armando.trejo.marrufo@gmail.com>

* test: parametrize choose_param_value with objective aliases
Signed-off-by: Miguel Trejo <armando.trejo.marrufo@gmail.com>

* test: cv booster metric is none
Signed-off-by: Miguel Trejo <armando.trejo.marrufo@gmail.com>

* fix: if string and callable choose callable
Signed-off-by: Miguel Trejo <armando.trejo.marrufo@gmail.com>

* test train uses custom objective metrics
Signed-off-by: Miguel Trejo <armando.trejo.marrufo@gmail.com>

* test: cv uses custom objective metrics
Signed-off-by: Miguel Trejo <armando.trejo.marrufo@gmail.com>

* refactor: remove fobj parameter in train and cv
Signed-off-by: Miguel Trejo <armando.trejo.marrufo@gmail.com>

* refactor: objective through params in sklearn API
Signed-off-by: Miguel Trejo <armando.trejo.marrufo@gmail.com>

* custom objective function in advanced_example
Signed-off-by: Miguel Trejo <armando.trejo.marrufo@gmail.com>

* fix whitespackes lint

* objective is none not a particular case for predict method
Signed-off-by: Miguel Trejo <armando.trejo.marrufo@gmail.com>

* replace scipy.expit with custom implementation
Signed-off-by: Miguel Trejo <armando.trejo.marrufo@gmail.com>

* test: set num_boost_round value to 20
Signed-off-by: Miguel Trejo <armando.trejo.marrufo@gmail.com>

* fix: custom objective default_value is none
Signed-off-by: Miguel Trejo <armando.trejo.marrufo@gmail.com>

* refactor: remove self._fobj
Signed-off-by: Miguel Trejo <armando.trejo.marrufo@gmail.com>

* custom_objective default value is None
Signed-off-by: Miguel Trejo <armando.trejo.marrufo@gmail.com>

* refactor: variables name reference dummy_obj
Signed-off-by: Miguel Trejo <armando.trejo.marrufo@gmail.com>

* linter errors

* fix: process objective parameter when calling predict
Signed-off-by: Miguel Trejo <armando.trejo.marrufo@gmail.com>

* linter errors

* fix: objective is None during predict call
Signed-off-by: Miguel Trejo <armando.trejo.marrufo@gmail.com>

416ecd5a

31 Mar, 2022 1 commit
- [python] make `reset_parameter` callback pickleable (#5109) · 4ae3d138
  Nikita Titov authored Mar 31, 2022
  
  4ae3d138
30 Mar, 2022 2 commits
- [python] make `record_evaluation` callback pickleable (#5107) · 60244e4a
  Nikita Titov authored Mar 31, 2022
```
* make `log_evaluation` callback pickleable

* make callback tests stricter

* make `record_evaluation` callback picklable
```
  60244e4a
- [python] make `log_evaluation` callback pickleable (#5101) · 8b33e776
  Nikita Titov authored Mar 30, 2022
```
* make `log_evaluation` callback pickleable

* make callback tests stricter
```
  8b33e776
28 Mar, 2022 1 commit

[python] allow to register any custom logger (fixes #4783) (#4880) · 60e72d5f

RustingSword authored Mar 29, 2022



* [python] allow to register any custom logger

* allow customizable logging method name; add unit test

* [python] allow to register any custom logger

* allow customizable logging method name; add unit test

* update tests

* fix lint error

* remove unused method

* fix docstring style
Co-authored-by: gongxudong <gongxudong@kuaishou.com>

60e72d5f

23 Mar, 2022 1 commit

[CUDA] New CUDA version Part 1 (#4630) · 6b56a90c

shiyu1994 authored Mar 23, 2022



* new cuda framework

* add histogram construction kernel

* before removing multi-gpu

* new cuda framework

* tree learner cuda kernels

* single tree framework ready

* single tree training framework

* remove comments

* boosting with cuda

* optimize for best split find

* data split

* move boosting into cuda

* parallel synchronize best split point

* merge split data kernels

* before code refactor

* use tasks instead of features as units for split finding

* refactor cuda best split finder

* fix configuration error with small leaves in data split

* skip histogram construction of too small leaf

* skip split finding of invalid leaves

stop when no leaf to split

* support row wise with CUDA

* copy data for split by column

* copy data from host to CPU by column for data partition

* add synchronize best splits for one leaf from multiple blocks

* partition dense row data

* fix sync best split from task blocks

* add support for sparse row wise for CUDA

* remove useless code

* add l2 regression objective

* sparse multi value bin enabled for CUDA

* fix cuda ranking objective

* support for number of items <= 2048 per query

* speedup histogram construction by interleaving global memory access

* split optimization

* add cuda tree predictor

* remove comma

* refactor objective and score updater

* before use struct

* use structure for split information

* use structure for leaf splits

* return CUDASplitInfo directly after finding best split

* split with CUDATree directly

* use cuda row data in cuda histogram constructor

* clean src/treelearner/cuda

* gather shared cuda device functions

* put shared CUDA functions into header file

* change smaller leaf from <= back to < for consistent result with CPU

* add tree predictor

* remove useless cuda_tree_predictor

* predict on CUDA with pipeline

* add global sort algorithms

* add global argsort for queries with many items in ranking tasks

* remove limitation of maximum number of items per query in ranking

* add cuda metrics

* fix CUDA AUC

* remove debug code

* add regression metrics

* remove useless file

* don't use mask in shuffle reduce

* add more regression objectives

* fix cuda mape loss

add cuda xentropy loss

* use template for different versions of BitonicArgSortDevice

* add multiclass metrics

* add ndcg metric

* fix cross entropy objectives and metrics

* fix cross entropy and ndcg metrics

* add support for customized objective in CUDA

* complete multiclass ova for CUDA

* separate cuda tree learner

* use shuffle based prefix sum

* clean up cuda_algorithms.hpp

* add copy subset on CUDA

* add bagging for CUDA

* clean up code

* copy gradients from host to device

* support bagging without using subset

* add support of bagging with subset for CUDAColumnData

* add support of bagging with subset for dense CUDARowData

* refactor copy sparse subrow

* use copy subset for column subset

* add reset train data and reset config for CUDA tree learner

add deconstructors for cuda tree learner

* add USE_CUDA ifdef to cuda tree learner files

* check that dataset doesn't contain CUDA tree learner

* remove printf debug information

* use full new cuda tree learner only when using single GPU

* disable all CUDA code when using CPU version

* recover main.cpp

* add cpp files for multi value bins

* update LightGBM.vcxproj

* update LightGBM.vcxproj

fix lint errors

* fix lint errors

* fix lint errors

* update Makevars

fix lint errors

* fix the case with 0 feature and 0 bin

fix split finding for invalid leaves

create cuda column data when loaded from bin file

* fix lint errors

hide GetRowWiseData when cuda is not used

* recover default device type to cpu

* fix na_as_missing case

fix cuda feature meta information

* fix UpdateDataIndexToLeafIndexKernel

* create CUDA trees when needed in CUDADataPartition::UpdateTrainScore

* add refit by tree for cuda tree learner

* fix test_refit in test_engine.py

* create set of large bin partitions in CUDARowData

* add histogram construction for columns with a large number of bins

* add find best split for categorical features on CUDA

* add bitvectors for categorical split

* cuda data partition split for categorical features

* fix split tree with categorical feature

* fix categorical feature splits

* refactor cuda_data_partition.cu with multi-level templates

* refactor CUDABestSplitFinder by grouping task information into struct

* pre-allocate space for vector split_find_tasks_ in CUDABestSplitFinder

* fix misuse of reference

* remove useless changes

* add support for path smoothing

* virtual destructor for LightGBM::Tree

* fix overlapped cat threshold in best split infos

* reset histogram pointers in data partition and spllit finder in ResetConfig

* comment useless parameter

* fix reverse case when na is missing and default bin is zero

* fix mfb_is_na and mfb_is_zero and is_single_feature_column

* remove debug log

* fix cat_l2 when one-hot

fix gradient copy when data subset is used

* switch shared histogram size according to CUDA version

* gpu_use_dp=true when cuda test

* revert modification in config.h

* fix setting of gpu_use_dp=true in .ci/test.sh

* fix linter errors

* fix linter error

remove useless change

* recover main.cpp

* separate cuda_exp and cuda

* fix ci bash scripts

add description for cuda_exp

* add USE_CUDA_EXP flag

* switch off USE_CUDA_EXP

* revert changes in python-packages

* more careful separation for USE_CUDA_EXP

* fix CUDARowData::DivideCUDAFeatureGroups

fix set fields for cuda metadata

* revert config.h

* fix test settings for cuda experimental version

* skip some tests due to unsupported features or differences in implementation details for CUDA Experimental version

* fix lint issue by adding a blank line

* fix lint errors by resorting imports

* fix lint errors by resorting imports

* fix lint errors by resorting imports

* merge cuda.yml and cuda_exp.yml

* update python version in cuda.yml

* remove cuda_exp.yml

* remove unrelated changes

* fix compilation warnings

fix cuda exp ci task name

* recover task

* use multi-level template in histogram construction

check split only in debug mode

* ignore NVCC related lines in parameter_generator.py

* update job name for CUDA tests

* apply review suggestions

* Update .github/workflows/cuda.yml
Co-authored-by: Nikita Titov <nekit94-08@mail.ru>

* Update .github/workflows/cuda.yml
Co-authored-by: Nikita Titov <nekit94-08@mail.ru>

* update header

* remove useless TODOs

* remove [TODO(shiyu1994): constrain the split with min_data_in_group] and record in #5062

* #include <LightGBM/utils/log.h> for USE_CUDA_EXP only

* fix include order

* fix include order

* remove extra space

* address review comments

* add warning when cuda_exp is used together with deterministic

* add comment about gpu_use_dp in .ci/test.sh

* revert changing order of included headers
Co-authored-by: Yu Shi <shiyu1994@qq.com>
Co-authored-by: Nikita Titov <nekit94-08@mail.ru>

6b56a90c

22 Mar, 2022 1 commit
- clarify no-meaningful-features warning in Dataset construction (fixes #5081) (#5083) · b857ee10
  James Lamb authored Mar 22, 2022
```
* clarify no-meaningful-features warning in Dataset construction (fixes #5081)

* update tests
```
  b857ee10
17 Mar, 2022 1 commit

[python] make `early_stopping` callback pickleable (#5012) · f77e0adf

Antoni Baum authored Mar 17, 2022

* Turn `early_stopping` into a Callable class

* Fix

* Lint

* Remove print

* Fix order

* Revert "Lint"

This reverts commit 7ca8b557572446888cf793c0082d9a7efd1e29a7.

* Apply suggestion from code review

* Nit

* Lint

* Move callable class outside the func for pickling

* Move _pickle and _unpickle to tests utils

* Add early stopping callback picklability test

* Nit

* Fix

* Lint

* Improve type hint

* Lint

* Lint

* Add cloudpickle to test_windows

* Update tests/python_package_test/test_engine.py

* Fix

* Apply suggestions from code review

f77e0adf

15 Mar, 2022 1 commit

[c-api][python-package][R-package] expose feature num bin (#5048) · d10372e2

José Morales authored Mar 14, 2022



* expose FeatureNumBin in C api

* parametrize min_data_in_bin and add test with max_bin_by_feature

* include feature_num_bin in R package

* add suggestion from review
Co-authored-by: Nikita Titov <nekit94-08@mail.ru>

* update error message and lint

* lint

* add call method

* minor improvements in tests

* add suggestions from review

* lint

* rename argument to feature in python and r packages
Co-authored-by: Nikita Titov <nekit94-08@mail.ru>

d10372e2

12 Mar, 2022 1 commit
- [python-package] [R-package] propagate the best iteration of cvbooster into... · 9a4e7068
  José Morales authored Mar 12, 2022
```
[python-package] [R-package] propagate the best iteration of cvbooster into the individual boosters (#5066)
```
  9a4e7068
09 Mar, 2022 1 commit

[fix] fix duplicate added initial scores for single-leaf trees (#fixes #4708) · f6d654b7

shiyu1994 authored Mar 09, 2022



* fix duplicate added initial scores for single-leaf trees

* add test case

* Fix import in Python test

* commit python suggestions
Co-authored-by: Nikita Titov <nekit94-08@mail.ru>

f6d654b7

01 Mar, 2022 1 commit

[tests][python] move tests that use `train()` function defined in `engine.py`... · 01568cf5

Nikita Titov authored Mar 01, 2022

[tests][python] move tests that use `train()` function defined in `engine.py` from `test_basic.py` to `test_engine.py` (#5034)

* Update test_basic.py

* Update test_engine.py

* Update test_engine.py

01568cf5

24 Feb, 2022 1 commit

[python-package] add support for pandas nullable types (fixes #4173) (#4927) · f1856956

José Morales authored Feb 23, 2022



* map nullable dtypes to regular float dtypes

* cast x3 to float after introducing missing values

* add test for regular dtypes

* use .astype and then values. update nullable_dtypes test and include test for regular numpy dtypes

* more specific allowed dtypes. test no copy when single float dtype df

* use np.find_common_type. set np.float128 to None when it isn't supported

* set default as type(None)

* move tests that use lgb.train to test_engine

* include np.float32 when finding common dtype

* Apply suggestions from code review
Co-authored-by: Nikita Titov <nekit94-08@mail.ru>

* add linebreak
Co-authored-by: Nikita Titov <nekit94-08@mail.ru>

f1856956

23 Feb, 2022 1 commit

[python-package] use 2d collections for predictions, grads and hess in... · d670a4d6

José Morales authored Feb 22, 2022

[python-package] use 2d collections for predictions, grads and hess in multiclass custom objective (#4925)

* reshape predictions, grad and hess in multiclass custom objective

* add sklearn test. move custom obj to utils. docs for numpy

* use num_model_per_iteration to get num_classes

* update docs and dask multiclass custom objective test

* move reshaping to __inner_predict. add test for feval

* add missing note. remove extra line

d670a4d6

15 Feb, 2022 1 commit

[python-package] make record_evaluation compatible with cv (fixes #4943) (#4947) · 9fc348af

José Morales authored Feb 15, 2022

* make record_evaluation compatible with cv

* test multiple metrics in cv

* lint

* fix cv with train metric. save stdv as well

* always add dataset prefix to cv_agg

* remove unused function

9fc348af

12 Feb, 2022 1 commit

[tests][python] remove compatibility code for old versions in tests (#4978) · a3e073ad

Nikita Titov authored Feb 13, 2022

* Update test_dask.py

* Update test_engine.py

* Update test_sklearn.py

* Update test_sklearn.py

* Update test_sklearn.py

* Update test_sklearn.py

* Update test_sklearn.py

* Update test_sklearn.py

* Update test_engine.py

* Update test_sklearn.py

* Update test_sklearn.py

* Update test_sklearn.py

a3e073ad

22 Jan, 2022 1 commit

[python-package] support customizing Dataset creation in Booster.refit() (fixes #3038) (#4894) · e6a2f716

Miguel Trejo Marrufo authored Jan 22, 2022

* feat: refit additional kwargs for dataset and predict

* test: kwargs for refit method

* fix: __init__ got multiple values for argument

* fix: pycodestyle E302 error

* refactor: dataset_params to avoid breaking change

* refactor: expose all Dataset params in refit

* feat: dataset_params updates new_params

* fix: remove unnecessary params to test

* test: parameters input are the same

* docs: address StrikeRUS changes

* test: refit test changes in train dataset

* test: set init_score and decay_rate to zero

e6a2f716

17 Jan, 2022 1 commit

[dask] add support for custom objective functions (fixes #3934) (#4920) · a06fadfb

James Lamb authored Jan 17, 2022



* add test for custom objective with regressor

* add test for custom binary classification objective with classifier

* isort

* got tests working for multiclass

* update docs

* train deeper model for classifier

* Apply suggestions from code review
Co-authored-by: José Morales <jmoralz92@gmail.com>

* Apply suggestions from code review
Co-authored-by: Nikita Titov <nekit94-08@mail.ru>

* update multiclass tests

* Apply suggestions from code review
Co-authored-by: Nikita Titov <nekit94-08@mail.ru>

* fix multiclass probabilities

* linting
Co-authored-by: José Morales <jmoralz92@gmail.com>
Co-authored-by: Nikita Titov <nekit94-08@mail.ru>

a06fadfb

30 Dec, 2021 1 commit

[python] raise an informative error instead of segfaulting when custom... · af5b40e1

Yaqub Alwan authored Dec 30, 2021


[python] raise an informative error instead of segfaulting when custom objective produces incorrect output (#4815)

* fix for bad grads causing segfault

* adjust checking criteria to properly reflect reality of multi-class classifiers

* fix styling

* Line break before operator

* Update python-package/lightgbm/basic.py
Co-authored-by: Nikita Titov <nekit94-08@mail.ru>

* Update python-package/lightgbm/basic.py
Co-authored-by: Nikita Titov <nekit94-08@mail.ru>

* add a note to the C-API docs

* rearrange text s;ightly

* add some tests to python package

* Update include/LightGBM/c_api.h
Co-authored-by: Nikita Titov <nekit94-08@mail.ru>

* PR comments

* match argument is a regex and our expression has brackets ..

* rework tests

* isorting imports

* updating test to relfect that the python APi does not take pres/labels as a fobj function
Co-authored-by: Nikita Titov <nekit94-08@mail.ru>

af5b40e1

26 Dec, 2021 1 commit
- [python] remove `early_stopping_rounds` argument of `train()` and `cv()` functions (#4908) · ce486e5b
  Nikita Titov authored Dec 26, 2021
  
  ce486e5b
23 Dec, 2021 1 commit
- [python] remove `evals_result` argument of `train()` function (#4882) · e4c0ca5f
  Nikita Titov authored Dec 23, 2021
  
  e4c0ca5f
20 Dec, 2021 1 commit

[tests][python-package] change boston dataset to synthetic dataset in tests... · 8a34b1af

José Morales authored Dec 20, 2021

[tests][python-package] change boston dataset to synthetic dataset in tests that don't check score (#4895)

* change boston dataset to synthetic dataset in tests that don't evaluate score

* format imports

8a34b1af

18 Dec, 2021 1 commit
- [python] reset storage in record evaluation callback each time before starting training (#4885) · 8e729af3
  Nikita Titov authored Dec 18, 2021
```
* Update test_sklearn.py

* Update python_package.yml

* Update python_package.yml

* Update callback.py

* Update callback.py
```
  8e729af3