Commits · 11b77471cf02c17e1cd43cc8f4b2ddf71be4c2b2 · tianlh / LightGBM-DCU

05 May, 2021 1 commit
- [python] added f-strings to python-package/lightgbm/engine.py (#4258) · 11b77471
  Kantajit Shaw authored May 06, 2021
  
  11b77471
04 May, 2021 1 commit

Andrew Ziem authored May 04, 2021



* Correct spelling

Most changes were in comments, and there were a few changes to literals for log output.

There were no changes to variable names, function names, IDs, or functionality.

* Clarify a phrase in a comment
Co-authored-by: James Lamb <jaylamb20@gmail.com>

* Clarify a phrase in a comment
Co-authored-by: James Lamb <jaylamb20@gmail.com>

* Clarify a phrase in a comment
Co-authored-by: James Lamb <jaylamb20@gmail.com>

* Correct spelling

Most are code comments, but one case is a literal in a logging message.

There are a few grammar fixes too.
Co-authored-by: James Lamb <jaylamb20@gmail.com>

e79716e0

02 May, 2021 1 commit
- [docs][python] update some docs related to custom objective (#4245) · 1a367c65
  Nikita Titov authored May 02, 2021
  
  1a367c65
30 Apr, 2021 1 commit
- [docs][python][scikit-learn] added note for LGBMRanker (#4243) · 023dc53d
  Nikita Titov authored Apr 30, 2021
  
  023dc53d
26 Apr, 2021 1 commit
- [python][scikit-learn] change MRO (#3192) · b6c71e5e
  Nikita Titov authored Apr 26, 2021
```
* chanche MRO

* fix MRO resolution
```
  b6c71e5e
21 Apr, 2021 1 commit

[dask] Fix typo mentioned in 4101 (#4214) · 887ef4cc

Frank Fineis authored Apr 21, 2021



* fix typo in dask _train as mentioned in 4101

* Update python-package/lightgbm/dask.py
Co-authored-by: James Lamb <jaylamb20@gmail.com>

887ef4cc

19 Apr, 2021 1 commit

[python] Migrate to f-strings in python-package/lightgbm/sklearn.py (#4188) · 8e126c80

Akshita Dixit authored Apr 19, 2021



* Migrate to f-strings in python-package/lightgbm/sklearn.py

* Apply suggestions from code review
Co-authored-by: James Lamb <jaylamb20@gmail.com>

* Update python-package/lightgbm/sklearn.py
Co-authored-by: James Lamb <jaylamb20@gmail.com>

* Apply suggestions from code review
Co-authored-by: Nikita Titov <nekit94-08@mail.ru>

* Add suggestions from code review

* resolve conflicts

* Apply suggestions from code review
Co-authored-by: James Lamb <jaylamb20@gmail.com>

* Update sklearn.py
Co-authored-by: James Lamb <jaylamb20@gmail.com>
Co-authored-by: Nikita Titov <nekit94-08@mail.ru>

8e126c80

10 Apr, 2021 1 commit

[python-package] Add type hints to the callback file (#4093) · 55a31bfe

Deddy Jobson authored Apr 10, 2021



* added type hints; implemented one workaround

* resolving some linting errors

* Added doc strings

* fixed more linting errors

* Made documentation more imperative.

* removed one type hint

* more specific type hinting
Co-authored-by: James Lamb <jaylamb20@gmail.com>

* added import

* Apply suggestions from code review
Co-authored-by: James Lamb <jaylamb20@gmail.com>

* made a class and function private

* Apply suggestions from code review

Make the documentation clearer.
Co-authored-by: James Lamb <jaylamb20@gmail.com>

* linting error fix

* more linting errors

* removing the decorator

* ignore mypy function attribute errors

* fix lints
Co-authored-by: James Lamb <jaylamb20@gmail.com>

55a31bfe

01 Apr, 2021 1 commit

[tests][dask] Add voting_parallel algorithm in tests (fixes #3834) (#4088) · d517ba12

jmoralez authored Apr 01, 2021

* include voting_parallel tree_learner in test_regressor, test_classifier and test_ranker

* remove test for warnings and test for error when using feature_parallel

* use real names for tree_learner intest and include test for aliases. use the error message in the test for error in feature parallel

* split all tests with rf in test_classifier

* remove task parametrization for tree_learner aliases test. smaller input data from feature_parallel error

* define task for tree_learner aliases

d517ba12

31 Mar, 2021 1 commit

[dask] make random port search more resilient to random collisions (fixes #4057) (#4133) · 1ce4b22b

James Lamb authored Mar 31, 2021

* [dask] make random port search more resilient to random collisions

* linting

* more reliable ports check

* address review comments

* add error message

1ce4b22b

30 Mar, 2021 1 commit
- [docs] fix param name typo in comments (#4139) · 841943f2
  Nikita Titov authored Mar 30, 2021
  
  841943f2
29 Mar, 2021 2 commits
- [ci] use f-strings in libpath.py (#4137) · e4cf2e4f
  Nikita Titov authored Mar 30, 2021
  
  e4cf2e4f
- [dask] run one training task on each worker (#4132) · 337103d3
  James Lamb authored Mar 29, 2021
```
* [dask] run one training task on each worker

* add comment on pure

* missing ticks

* empty commit
```
  337103d3
27 Mar, 2021 1 commit

[dask] Include support for raw_score in predict (fixes #3793) (#4024) · fe1b80a5

jmoralez authored Mar 27, 2021

* include test for prediction with raw_score

* close client

* initial comments

* update data creation and include ranking task

* linting

* update _create_data

* compare unique raw_predictions with values in leaves_df

fe1b80a5

16 Mar, 2021 1 commit
- [dask] remove unused imports from typing (#4079) · e9f50a59
  James Lamb authored Mar 16, 2021
  
  e9f50a59
15 Mar, 2021 2 commits
- [python-package] Some mypy fixes (#3916) · 296b2a26
  Alberto Ferreira authored Mar 15, 2021
```
* Some mypy fixes

* address James' comments

* Re-introduce pass in empty classes

* Update compat.py

Remove extra lines
```
  296b2a26
- [python-package] add type hints on Booster.set_network() (#4068) · dc1bc23a
  James Lamb authored Mar 15, 2021
```
* [python-package] add type hints on Booster.set_network()

* change behavior
```
  dc1bc23a
14 Mar, 2021 1 commit
- added type hint (#4070) · 96728a04
  Deddy Jobson authored Mar 15, 2021
  
  96728a04
10 Mar, 2021 1 commit

[dask] raise more informative error for duplicates in 'machines' (fixes #4057) (#4059) · 296397df

James Lamb authored Mar 10, 2021

* [dask] raise more informative error for duplicates in 'machines'

* uncomment

* avoid test failure

* Revert "avoid test failure"

This reverts commit 9442bdf00f193a19a923dc0deb46b7822cb6f601.

296397df

04 Mar, 2021 1 commit

[dask] Include support for init_score (#3950) · 37e98782

jmoralez authored Mar 04, 2021

* include support for init_score

* use dataframe from init_score and test difference with and without init_score in local model

* revert refactoring

* initial docs. test between distributed models with and without init_score

* remove ranker from tests

* test value for root node and change docs

* comma

* re-include parametrize

* fix incorrect merge

* use single init_score and the booster_ attribute

* use np.float64 instead of float

37e98782

24 Feb, 2021 3 commits

[dask][python-package] include support for column array as label (#3943) · 5dacd603

jmoralez authored Feb 24, 2021

* include support for column array as label

* remove nested ifs

* fix linting errors

* include tests for sklearn regressors

* include docstring for numpy_1d_array_to_dtype

* include . at end of docstring

* remove pandas import and test for regression, classification and ranking

* check predictions of sklearn models as well

* test training only in dask. drop pandas series tests

* use PANDAS_INSTALLED and pd_Series

* inline imports

* use col array in fit for test_dask

* include review comments

5dacd603

[dask] use random ports in network setup (#3823) · 0e576575

jmoralez authored Feb 23, 2021

* use socket.bind with port 0 and client.run to find random open ports

* include test for found ports

* find random open ports as default

* parametrize local_listen_port. type hint to _find_random_open_port. fid open ports only on workers with data.

* make indentation consistent and pass list of workers to client.run

* remove socket import

* change random port implementation

* fix test

0e576575

[dask] Reuse addresses saved in variable (#4016) · 7777852a
Nikita Titov authored Feb 24, 2021

7777852a

23 Feb, 2021 1 commit

[dask] allow tight control over ports (#3994) · 1f73f559

James Lamb authored Feb 23, 2021



* [dask] allow tight control over ports

* getting there, getting there

* fix params maybe

* fixing params

* remove unnecessary stuff

* fix tests

* fixes

* some minor changes

* fix flaky test

* linting

* more linting

* clarify parameter description

* add warning

* revert docs change

* Update python-package/lightgbm/dask.py

* Apply suggestions from code review
Co-authored-by: Nikita Titov <nekit94-08@mail.ru>

* trying to fix stuff

* this is working

* update tests

* Apply suggestions from code review
Co-authored-by: Nikita Titov <nekit94-08@mail.ru>

* indent
Co-authored-by: Nikita Titov <nekit94-08@mail.ru>

1f73f559

20 Feb, 2021 1 commit
- [dask] use more specific method names on _DaskLGBMModel (#4004) · 646267d2
  James Lamb authored Feb 20, 2021
  
  646267d2
19 Feb, 2021 1 commit
- [docs] Change some 'parallel learning' references to 'distributed learning' (#4000) · 7880b79f
  James Lamb authored Feb 19, 2021
```
* [docs] Change some 'parallel learning' references to 'distributed learning'

* found a few more

* one more reference
```
  7880b79f
17 Feb, 2021 1 commit

Optimize array-from-ctypes in basic.py (#3927) · de8c6105

Alex Ford authored Feb 16, 2021

Approximately %80 of runtime when loading "low column count, high row
count" DataFrames into Datasets is consumed in `np.fromiter`, called
as part of the `Dataset.get_field` method.

This is particularly pernicious hotspot, as unlike other ctypes-based
methods this is a hot loop over a python iterator loop and causes
significant GIL-contention in multi-threaded applications.

Replace `np.fromiter` with a direct call to `np.ctypeslib.as_array`,
which allows a single-shot `copy` of the underlying array.

This reduces the load time of a ~35 million row categorical dataframe
with 1 column from ~5 seconds to ~1 second, and allows multi-threaded
execution.

de8c6105

16 Feb, 2021 6 commits
- [ci][python] run isort in CI linting job (#3990) · d6ebd063
  Nikita Titov authored Feb 16, 2021
```
* run isort in CI linting job

* workaround conda compatibility issues
```
  d6ebd063
- [ci][python] apply isort to python-package/lightgbm/compat.py #3958 (#3968) · 4ae59494
  Zhuyi Xue authored Feb 16, 2021
  
  4ae59494
- [ci][python] apply isort to python-package/lightgbm/engine.py #3958 (#3970) · 6110bd15
  Zhuyi Xue authored Feb 16, 2021
  
  6110bd15
- [ci][python] apply isort to python-package/lightgbm/basic.py #3958 (#3967) · af0c2260
  Zhuyi Xue authored Feb 15, 2021
  
  af0c2260
- [ci][python] apply isort to python-package/lightgbm/__init__.py #3958 (#3966) · 9b64b9c9
  Zhuyi Xue authored Feb 15, 2021
  
  9b64b9c9
- [ci][python] apply isort to python-package/lightgbm/sklearn.py #3958 (#3973) · acb67741
  Zhuyi Xue authored Feb 15, 2021
  
  acb67741
15 Feb, 2021 5 commits
- [ci][python] apply isort to python-package/lightgbm/plotting.py #3958 (#3972) · e9ea85bd
  Zhuyi Xue authored Feb 15, 2021
  
  e9ea85bd
- [ci][python] apply isort to python-package/lightgbm/libpath.py #3958 (#3971) · 8d5c0343
  Zhuyi Xue authored Feb 15, 2021
  
  8d5c0343
- reuse len(parts) as n_parts (#3985) · d74b1be9
  Frank Fineis authored Feb 15, 2021
  
  d74b1be9
- [ci][python] apply isort to python-package/lightgbm/dask.py #3958 (#3969) · 3b547001
  Zhuyi Xue authored Feb 14, 2021
  
  3b547001
- [python-package] fix some warnings from mypy (#3891) · eda1effb
  Tara Jawahar authored Feb 14, 2021
```
* minor mypy type errors fixed

* fix some warnings from mypy

* fix 3 mypy warnings

* selectively ignored some mypy errors

* minor mypy type errors fixed

* minor mypy type errors fixed

* minor mypy type errors fixed

* added import

* Update python-package/lightgbm/callback.py

* Apply suggestions from code review

* Apply suggestions from code review
Co-authored-by: James Lamb <jaylamb20@gmail.com>
Co-authored-by: James Lamb <jaylamb20@gmail.com>
```
  eda1effb
10 Feb, 2021 1 commit
- [docs][python] fix shape description of returned result for predict_proba (#3933) · 15916a95
  Nikita Titov authored Feb 10, 2021
```
* Update dask.py

* Update sklearn.py
```
  15916a95
09 Feb, 2021 1 commit

[dask] [docs] Fix inaccuracies in API docs for Dask module (fixes #3871) (#3930) · 06ed4337

James Lamb authored Feb 09, 2021



* got fit() working

* add predict()

* predict_proba()

* remove custom objective docs

* Apply suggestions from code review
Co-authored-by: Nikita Titov <nekit94-08@mail.ru>

* fix capitalization

* Update tests/python_package_test/test_dask.py
Co-authored-by: Nikita Titov <nekit94-08@mail.ru>
Co-authored-by: Nikita Titov <nekit94-08@mail.ru>

06ed4337