- 24 Feb, 2021 3 commits
-
-
jmoralez authored
* include support for column array as label * remove nested ifs * fix linting errors * include tests for sklearn regressors * include docstring for numpy_1d_array_to_dtype * include . at end of docstring * remove pandas import and test for regression, classification and ranking * check predictions of sklearn models as well * test training only in dask. drop pandas series tests * use PANDAS_INSTALLED and pd_Series * inline imports * use col array in fit for test_dask * include review comments
-
jmoralez authored
* use socket.bind with port 0 and client.run to find random open ports * include test for found ports * find random open ports as default * parametrize local_listen_port. type hint to _find_random_open_port. fid open ports only on workers with data. * make indentation consistent and pass list of workers to client.run * remove socket import * change random port implementation * fix test
-
Nikita Titov authored
-
- 23 Feb, 2021 1 commit
-
-
James Lamb authored
* [dask] allow tight control over ports * getting there, getting there * fix params maybe * fixing params * remove unnecessary stuff * fix tests * fixes * some minor changes * fix flaky test * linting * more linting * clarify parameter description * add warning * revert docs change * Update python-package/lightgbm/dask.py * Apply suggestions from code review Co-authored-by:
Nikita Titov <nekit94-08@mail.ru> * trying to fix stuff * this is working * update tests * Apply suggestions from code review Co-authored-by:
Nikita Titov <nekit94-08@mail.ru> * indent Co-authored-by:
Nikita Titov <nekit94-08@mail.ru>
-
- 20 Feb, 2021 1 commit
-
-
James Lamb authored
-
- 19 Feb, 2021 1 commit
-
-
James Lamb authored
* [docs] Change some 'parallel learning' references to 'distributed learning' * found a few more * one more reference
-
- 17 Feb, 2021 1 commit
-
-
Alex Ford authored
Approximately %80 of runtime when loading "low column count, high row count" DataFrames into Datasets is consumed in `np.fromiter`, called as part of the `Dataset.get_field` method. This is particularly pernicious hotspot, as unlike other ctypes-based methods this is a hot loop over a python iterator loop and causes significant GIL-contention in multi-threaded applications. Replace `np.fromiter` with a direct call to `np.ctypeslib.as_array`, which allows a single-shot `copy` of the underlying array. This reduces the load time of a ~35 million row categorical dataframe with 1 column from ~5 seconds to ~1 second, and allows multi-threaded execution.
-
- 16 Feb, 2021 6 commits
- 15 Feb, 2021 6 commits
-
-
Zhuyi Xue authored
-
Zhuyi Xue authored
-
Zhuyi Xue authored
-
Frank Fineis authored
-
Zhuyi Xue authored
-
Tara Jawahar authored
* minor mypy type errors fixed * fix some warnings from mypy * fix 3 mypy warnings * selectively ignored some mypy errors * minor mypy type errors fixed * minor mypy type errors fixed * minor mypy type errors fixed * added import * Update python-package/lightgbm/callback.py * Apply suggestions from code review * Apply suggestions from code review Co-authored-by:
James Lamb <jaylamb20@gmail.com> Co-authored-by:
James Lamb <jaylamb20@gmail.com>
-
- 10 Feb, 2021 1 commit
-
-
Nikita Titov authored
* Update dask.py * Update sklearn.py
-
- 09 Feb, 2021 1 commit
-
-
James Lamb authored
* got fit() working * add predict() * predict_proba() * remove custom objective docs * Apply suggestions from code review Co-authored-by:
Nikita Titov <nekit94-08@mail.ru> * fix capitalization * Update tests/python_package_test/test_dask.py Co-authored-by:
Nikita Titov <nekit94-08@mail.ru> Co-authored-by:
Nikita Titov <nekit94-08@mail.ru>
-
- 07 Feb, 2021 1 commit
-
-
James Lamb authored
* fix tests * fix tests * fix test comments * simplify tests * Apply suggestions from code review
-
- 06 Feb, 2021 1 commit
-
-
James Lamb authored
* add support for pandas categorical columns * remove commented code * quotes * syntax error * fix shape for ranker test * Apply suggestions from code review Co-authored-by:
Nikita Titov <nekit94-08@mail.ru> * Update tests/python_package_test/test_dask.py * trying * fix tests * remove unnecessary debugging stuff * skip accuracy checks on categorical * use category columns as categorical features Co-authored-by:
Nikita Titov <nekit94-08@mail.ru>
-
- 03 Feb, 2021 4 commits
-
-
James Lamb authored
-
Nikita Titov authored
* Update test_dask.py * Update dask.py * Update .vsts-ci.yml * Revert "Update .vsts-ci.yml" This reverts commit 98422be5b5095f0585de333b5b5545356776ef88.
-
Nikita Titov authored
* Update dask.py * Update dask.py
-
James Lamb authored
* starting on Dask client * more docs stuff * fix pickling * just copy docstrings * fit docs * switch test order * linting * use client kwarg * remove inner set_params() * add type hints * fix type hints * remove commented code * reorder * fix tests, add client_ property * Apply suggestions from code review Co-authored-by:
Nikita Titov <nekit94-08@mail.ru> * fix tests * linting * simplify Co-authored-by:
Nikita Titov <nekit94-08@mail.ru>
-
- 01 Feb, 2021 1 commit
-
-
Nikita Titov authored
-
- 31 Jan, 2021 3 commits
-
-
Nikita Titov authored
-
Nikita Titov authored
* start supporting Python 3.9 * Update .vsts-ci.yml * Update .vsts-ci.yml * Update .appveyor.yml * Update .vsts-ci.yml * Update .appveyor.yml * fixed conflicts * continue
-
Nikita Titov authored
* document CUDA version support * address review comments * collapse CUDA section in the guide * remove Clang support from CUDA docs as we have never tested it
-
- 29 Jan, 2021 1 commit
-
-
James Lamb authored
* add type hints in dask module * starting on asserts * remove unused code * add hints for dtypes * replace accidentally-removed docstrings * revert unrelated change * Update python-package/lightgbm/dask.py * empty commit * fix hints on group * capitalize array * hide hints in signatures * empty commit * sphinx version * Apply suggestions from code review Co-authored-by:
Nikita Titov <nekit94-08@mail.ru> * fix hint for MatrixLike * Update python-package/lightgbm/dask.py Co-authored-by:
Nikita Titov <nekit94-08@mail.ru> * Apply suggestions from code review Co-authored-by:
Nikita Titov <nekit94-08@mail.ru> * update docstring * empty commit Co-authored-by:
Nikita Titov <nekit94-08@mail.ru>
-
- 28 Jan, 2021 1 commit
-
-
Nikita Titov authored
-
- 27 Jan, 2021 1 commit
-
-
James Lamb authored
* [dask] add tests on warnings, fix incorrect variable in log * Update tests/python_package_test/test_dask.py Co-authored-by:
Nikita Titov <nekit94-08@mail.ru> Co-authored-by:
Nikita Titov <nekit94-08@mail.ru>
-
- 26 Jan, 2021 5 commits
-
-
Nikita Titov authored
* Update sklearn.py * Update dask.py
-
Nikita Titov authored
-
Nikita Titov authored
* fix Dask docstrings and mimic sklearn importing way * Update .vsts-ci.yml * revert CI checks * use import aliases for Dask classes * check Dask is installed in _predict() func * fix lint issues introduced during resolving merge conflicts * Update dask.py
-
James Lamb authored
-
James Lamb authored
* [dask] allow parameter aliases for tree_learner and local_listen_port (fixes #3671) * num_thread too * Apply suggestions from code review Co-authored-by:
Nikita Titov <nekit94-08@mail.ru> * empty commit * add _choose_param_value * revert param order change * Apply suggestions from code review Co-authored-by:
Nikita Titov <nekit94-08@mail.ru> * Update python-package/lightgbm/dask.py Co-authored-by:
Nikita Titov <nekit94-08@mail.ru> * just import deepcopy * remove machines aliases * Apply suggestions from code review Co-authored-by:
Nikita Titov <nekit94-08@mail.ru> Co-authored-by:
Nikita Titov <nekit94-08@mail.ru>
-
- 25 Jan, 2021 1 commit
-
-
Frank Fineis authored
* store data parts in dict instead of list * simplify weight/group parts dict assignment
-