- 06 Jul, 2021 1 commit
-
-
James Lamb authored
-
- 05 Jul, 2021 1 commit
-
-
Nikita Titov authored
* Update test_sklearn.py * Update test_basic.py * Update dask.py * Update basic.py * Update basic.py * Update basic.py * Update basic.py * Update callback.py
-
- 04 Jul, 2021 3 commits
-
-
Nikita Titov authored
-
James Lamb authored
* [python-package] convert string concatenation to f-strings in test_engine.py (fixes #4136) * Update tests/python_package_test/test_engine.py Co-authored-by:
Nikita Titov <nekit94-08@mail.ru> * revert get_workflow_status changes Co-authored-by:
Nikita Titov <nekit94-08@mail.ru>
-
Nikita Titov authored
-
- 02 Jul, 2021 1 commit
-
-
Chen Yufei authored
* [python-package] create Dataset from sampled data. * [python-package] create Dataset from List[Sequence]. 1. Use random access for data sampling 2. Support read data from multiple input files 3. Read data in batch so no need to hold all data in memory * [python-package] example: create Dataset from multiple HDF5 file. * fix: revert is_class implementation for seq * fix: unwanted memory view reference for seq * fix: seq is_class accepts sklearn matrices * fix: requirements for example * fix: pycode * feat: print static code linting stage * fix: linting: avoid shell str regex conversion * code style: doc style * code style: isort * fix ci dependency: h5py on windows * [py] remove rm files in test seq https://github.com/microsoft/LightGBM/pull/4089#discussion_r612929623 * docs(python): init_from_sample summary https://github.com/microsoft/LightGBM/pull/4089#discussion_r612903389 * remove dataset dump sample data debugging code. * remove typo fix. Create separate PR for this. * fix typo in src/c_api.cpp Co-authored-by:
James Lamb <jaylamb20@gmail.com> * style(linting): py3 type hint for seq * test(basic): os.path style path handling * Revert "feat: print static code linting stage" This reverts commit 10bd79f7f8258bea8e61c3abb8c9c7e4456a916d. * feat(python): sequence on validation set * minor(python): comment * minor(python): test option hint * style(python): fix code linting * style(python): add pydoc for ref_dataset * doc(python): sequence Co-authored-by:
shiyu1994 <shiyu_k1994@qq.com> * revert(python): sequence class abc * chore(python): remove rm_files * Remove useless static_assert. * refactor: test_basic test for sequence. * fix lint complaint. * remove dataset._dump_text in sequence test. * Fix reverting typo fix. * Apply suggestions from code review Co-authored-by:
James Lamb <jaylamb20@gmail.com> * Fix type hint, code and doc style. * fix failing test_basic. * Remove TODO about keep constant in sync with cpp. * Install h5py only when running python-examples. * Fix lint complaint. * Apply suggestions from code review Co-authored-by:
James Lamb <jaylamb20@gmail.com> * Doc fixes, remove unused params_str in __init_from_seqs. * Apply suggestions from code review Co-authored-by:
Nikita Titov <nekit94-08@mail.ru> * Remove unnecessary conda install in windows ci script. * Keep param as example in dataset_from_multi_hdf5.py * Add _get_sample_count function to remove code duplication. * Use batch_size parameter in generate_hdf. * Apply suggestions from code review Co-authored-by:
Nikita Titov <nekit94-08@mail.ru> * Fix after applying suggestions. * Fix test, check idx is instance of numbers.Integral. * Update python-package/lightgbm/basic.py Co-authored-by:
Nikita Titov <nekit94-08@mail.ru> * Expose Sequence class in Python-API doc. * Handle Sequence object not having batch_size. * Fix isort lint complaint. * Apply suggestions from code review Co-authored-by:
Nikita Titov <nekit94-08@mail.ru> * Update docstring to mention Sequence as data input. * Remove get_one_line in test_basic.py * Make Sequence an abstract class. * Reduce number of tests for test_sequence. * Add c_api: LGBM_SampleCount, fix potential bug in LGBMSampleIndices. * empty commit to trigger ci * Apply suggestions from code review Co-authored-by:
Nikita Titov <nekit94-08@mail.ru> * Rename to LGBM_GetSampleCount, change LGBM_SampleIndices out_len to int32_t. Also rename total_nrow to num_total_row in c_api.h for consistency. * Doc about Sequence in docs/Python-Intro.rst. * Fix: basic.py change LGBM_SampleIndices out_len to int32. * Add create_valid test case with Dataset from Sequence. * Apply suggestions from code review Co-authored-by:
Nikita Titov <nekit94-08@mail.ru> * Apply suggestions from code review Co-authored-by:
shiyu1994 <shiyu_k1994@qq.com> * Remove no longer used DEFAULT_BIN_CONSTRUCT_SAMPLE_CNT. * Update python-package/lightgbm/basic.py Co-authored-by:
Nikita Titov <nekit94-08@mail.ru> Co-authored-by:
Willian Zhang <willian@willian.email> Co-authored-by:
Willian Z <Willian@Willian-Zhang.com> Co-authored-by:
James Lamb <jaylamb20@gmail.com> Co-authored-by:
shiyu1994 <shiyu_k1994@qq.com> Co-authored-by:
Nikita Titov <nekit94-08@mail.ru>
-
- 28 Jun, 2021 1 commit
-
-
Frank Fineis authored
* es WiP, need to add eval_sample_weight and eval_group * add weight, group to dask es. WiP. * dask es reorg * Update python-package/lightgbm/dask.py _train_part model.fit args to lines Co-authored-by:
James Lamb <jaylamb20@gmail.com> * Update tests/python_package_test/test_dask.py _train_part model.fit args to lines, pt2 Co-authored-by:
James Lamb <jaylamb20@gmail.com> * Update python-package/lightgbm/dask.py _train_part model.fit args to lines pt3 Co-authored-by:
James Lamb <jaylamb20@gmail.com> * Update tests/python_package_test/test_dask.py dask_model.fit args to lines Co-authored-by:
James Lamb <jaylamb20@gmail.com> * Update tests/python_package_test/test_dask.py Co-authored-by:
James Lamb <jaylamb20@gmail.com> * Update python-package/lightgbm/dask.py use is instead of id() Co-authored-by:
James Lamb <jaylamb20@gmail.com> * Update python-package/lightgbm/dask.py Co-authored-by:
James Lamb <jaylamb20@gmail.com> * Update python-package/lightgbm/dask.py Co-authored-by:
James Lamb <jaylamb20@gmail.com> * Update python-package/lightgbm/dask.py Co-authored-by:
James Lamb <jaylamb20@gmail.com> * Update tests/python_package_test/test_dask.py Co-authored-by:
James Lamb <jaylamb20@gmail.com> * Update tests/python_package_test/test_dask.py Co-authored-by:
James Lamb <jaylamb20@gmail.com> * Update python-package/lightgbm/dask.py Co-authored-by:
James Lamb <jaylamb20@gmail.com> * Update python-package/lightgbm/dask.py Co-authored-by:
James Lamb <jaylamb20@gmail.com> * Update python-package/lightgbm/dask.py Co-authored-by:
James Lamb <jaylamb20@gmail.com> * applying changes to eval_set PR WiP * dask support for eval_names, eval_metric, eval_stopping_rounds * add evals_result checks and other eval_set attribute-related test checks. need to merge master - WiP * fix lint errors in test_dask.py * drop group_shape from _lgbmmodel_doc_fit.format for non-rankers, add support for eval_at for dask ranker * add eval_at to test_dask eval_set ranker tests * add back group_shape to lgbmmmodel docs, tighten tests * drop random eval weights from early stopping, probably causing training to terminate too early * add eval data templates to sklearn fit docs, add eval data docs to dask * add n_features to _create_data, eval_set tests stop w/ desirable tree counts * import alphabetically * add back get_worker for eval_set error handling * test_dask argmin typo * push forgotten eval_names bugfix * eval_stopping_rounds -> early_stopping_rounds, fix failing non-es test * change default eval_at to tuple 1-5 * re-drop get_worker * drop early stopping support from eval_set commits, move eval_set worker check prior to client.submit * add eval_class_weight and eval_init_score to lightgbm/dask, WiP * clean up eval_set tests, allow user to specify fewer eval_names, clswghts than eval_sets * remove redundant backslash * lint fixes * fix eval_at, eval_metric duplication, let eval_at be Iterable not just Tuple * use all data_outputs for test_eval_set tests * undo newlines from first pr * add custom_eval_metric test, correct issue with eval_at and metric names * move _constant_metric outside of test * dataset reference names instead of __strings__ * add padding to eval_set parts makes each part has same len(eval_set) * eval set code clean up * revert n_evals to be max len eval_set across all parts on worker * pylint errors in _DatasetNames * more pylint fixes * pylinting... * add by pytest.mark, mistakenly deleted during merge conflict resolution * address code review comments * add _pad_eval_names to handle nondeterministic evals_result_ valid set names * change not evaluated evals_result_ test criteria * address fit eval docs issues, switch _DatasetNames to Enum * Update python-package/lightgbm/dask.py Co-authored-by:
Nikita Titov <nekit94-08@mail.ru> * Update python-package/lightgbm/dask.py Co-authored-by:
Nikita Titov <nekit94-08@mail.ru> * Update python-package/lightgbm/dask.py Co-authored-by:
Nikita Titov <nekit94-08@mail.ru> * Update python-package/lightgbm/dask.py Co-authored-by:
Nikita Titov <nekit94-08@mail.ru> * Update python-package/lightgbm/dask.py Co-authored-by:
Nikita Titov <nekit94-08@mail.ru> * Update python-package/lightgbm/dask.py Co-authored-by:
Nikita Titov <nekit94-08@mail.ru> * Update python-package/lightgbm/dask.py Co-authored-by:
Nikita Titov <nekit94-08@mail.ru> * Update python-package/lightgbm/dask.py Co-authored-by:
Nikita Titov <nekit94-08@mail.ru> * Update python-package/lightgbm/dask.py Co-authored-by:
Nikita Titov <nekit94-08@mail.ru> * Update python-package/lightgbm/dask.py Co-authored-by:
Nikita Titov <nekit94-08@mail.ru> * Update python-package/lightgbm/dask.py Co-authored-by:
Nikita Titov <nekit94-08@mail.ru> * Update python-package/lightgbm/dask.py Co-authored-by:
Nikita Titov <nekit94-08@mail.ru> * Update python-package/lightgbm/dask.py Co-authored-by:
Nikita Titov <nekit94-08@mail.ru> * Update python-package/lightgbm/dask.py Co-authored-by:
Nikita Titov <nekit94-08@mail.ru> * Update python-package/lightgbm/dask.py Co-authored-by:
Nikita Titov <nekit94-08@mail.ru> * Update python-package/lightgbm/dask.py Co-authored-by:
Nikita Titov <nekit94-08@mail.ru> * Update python-package/lightgbm/dask.py Co-authored-by:
Nikita Titov <nekit94-08@mail.ru> * Update python-package/lightgbm/dask.py Co-authored-by:
Nikita Titov <nekit94-08@mail.ru> * Update python-package/lightgbm/dask.py Co-authored-by:
Nikita Titov <nekit94-08@mail.ru> * Update python-package/lightgbm/dask.py Co-authored-by:
Nikita Titov <nekit94-08@mail.ru> * Update python-package/lightgbm/dask.py Co-authored-by:
Nikita Titov <nekit94-08@mail.ru> * Update python-package/lightgbm/dask.py Co-authored-by:
Nikita Titov <nekit94-08@mail.ru> * update eval_metrics, eval_at dask fit docstr to match sklearn, make tests reflect that l2 (rmse), logloss in evals_result_ by default * address eval_set dict keys naming in docstr and training eval_set naming issue * in test_dask check for obj-default metric names in eval_results, remove check for training key * lint fixes for _pad_eval_names * remove unnecessary breaklinen in _pad_eval_names docstr * use Enum.member syntax not Enum.member.name * remove str from supported eval_at types * add whitespace and remove DaskDataframes mention from eval_ param docstrs in _train * remove "of shape = [n_samples]" from group_shape docs * add eval_at base_doc in DaskLGBMRanker.fit * remove excess paren from eval_names docs in _train * make requested changes to test_dask.py * remove Optional() wrapper on eval_at * add _lgbmmodel_doc_custom_eval_note to dask.py fit.__doc__ * fix ordering of .sklearn imports to attempt lint fix * dask custom eval note to f-string pt1 Co-authored-by:
Nikita Titov <nekit94-08@mail.ru> * dask custom eval note to f-string pt 2 Co-authored-by:
Nikita Titov <nekit94-08@mail.ru> * dask custom eval note to f-string pt 3 Co-authored-by:
Nikita Titov <nekit94-08@mail.ru> Co-authored-by:
James Lamb <jaylamb20@gmail.com> Co-authored-by:
Nikita Titov <nekit94-08@mail.ru>
-
- 27 Jun, 2021 2 commits
-
-
Nikita Titov authored
-
James Lamb authored
-
- 26 Jun, 2021 2 commits
-
-
James Lamb authored
* [dask] pass predict() kwargs through when input is a Dask Array * add tests * Apply suggestions from code review Co-authored-by:
Nikita Titov <nekit94-08@mail.ru> * add prediction early stopping params Co-authored-by:
Nikita Titov <nekit94-08@mail.ru>
-
Nikita Titov authored
-
- 15 Jun, 2021 1 commit
-
-
Nikita Titov authored
* replace pytest.parametrize * add informative message for assert
-
- 12 Jun, 2021 1 commit
-
-
Nikita Titov authored
-
- 09 Jun, 2021 2 commits
-
-
sayantan sadhu authored
[python] improving the syntax of the fstring in the file : tests/python_package_test/test_dask.py (#4358) * updated the old syntax with fstrings * Updated the strings with + catenation to fstrings * Updated the strings with + catenation to fstrings * Update tests/python_package_test/test_dask.py Co-authored-by:James Lamb <jaylamb20@gmail.com>
-
Weston King-Leatham authored
-
- 07 Jun, 2021 1 commit
-
-
sayantan sadhu authored
-
- 03 Jun, 2021 1 commit
-
-
Belinda Trotta authored
* Add linear leaf models to json output * Add closing bracket * Move test into test_engine.py and add asserts * Update tests/python_package_test/test_engine.py Co-authored-by:
Nikita Titov <nekit94-08@mail.ru> * Update tests/python_package_test/test_engine.py Co-authored-by:
Nikita Titov <nekit94-08@mail.ru> * Update tests/python_package_test/test_engine.py Co-authored-by:
Nikita Titov <nekit94-08@mail.ru> Co-authored-by:
Nikita Titov <nekit94-08@mail.ru>
-
- 21 May, 2021 3 commits
-
-
sayantan sadhu authored
[python] improving the syntax of the fstring in the file : tests/python_package_test/test_basic.py (#4312)
-
Nikita Titov authored
* run Dask tests on aarch64 architecture * make random Dask test to fail * Revert "make random Dask test to fail" This reverts commit c43c98507f818994bb08b4f7d289ecad3b3449eb. * empty commit * empty commit * empty commit * empty commit Co-authored-by:James Lamb <jaylamb20@gmail.com>
-
Nikita Titov authored
* handle arbitrary length feature names in Python-package * added tests
-
- 04 May, 2021 1 commit
-
-
Andrew Ziem authored
* Correct spelling Most changes were in comments, and there were a few changes to literals for log output. There were no changes to variable names, function names, IDs, or functionality. * Clarify a phrase in a comment Co-authored-by:
James Lamb <jaylamb20@gmail.com> * Clarify a phrase in a comment Co-authored-by:
James Lamb <jaylamb20@gmail.com> * Clarify a phrase in a comment Co-authored-by:
James Lamb <jaylamb20@gmail.com> * Correct spelling Most are code comments, but one case is a literal in a logging message. There are a few grammar fixes too. Co-authored-by:
James Lamb <jaylamb20@gmail.com>
-
- 28 Apr, 2021 1 commit
-
-
James Lamb authored
-
- 11 Apr, 2021 1 commit
-
-
Christoph Aymanns authored
* add test for interaction constraints and monotone constraints * enforce interaction constraints in RecomputeBestSplitForLeaf * code formatting * code formatting * move interaction constraint test to test_engine * Apply suggestions from code review Co-authored-by:
Nikita Titov <nekit94-08@mail.ru> Co-authored-by:
Nikita Titov <nekit94-08@mail.ru>
-
- 05 Apr, 2021 1 commit
-
-
jmoralez authored
* replace client fixture with cluster fixture * wait on persist before rebalance
-
- 01 Apr, 2021 2 commits
-
-
jmoralez authored
* include voting_parallel tree_learner in test_regressor, test_classifier and test_ranker * remove test for warnings and test for error when using feature_parallel * use real names for tree_learner intest and include test for aliases. use the error message in the test for error in feature parallel * split all tests with rf in test_classifier * remove task parametrization for tree_learner aliases test. smaller input data from feature_parallel error * define task for tree_learner aliases
-
jmoralez authored
-
- 31 Mar, 2021 1 commit
-
-
James Lamb authored
* [dask] make random port search more resilient to random collisions * linting * more reliable ports check * address review comments * add error message
-
- 30 Mar, 2021 1 commit
-
-
jmoralez authored
* test all boosting types * lint * bring scores comparison back and set y as second argument in assert_eq
-
- 27 Mar, 2021 2 commits
-
-
Nikita Titov authored
* Update test_dask.py * Update test_dask.py
-
jmoralez authored
* include test for prediction with raw_score * close client * initial comments * update data creation and include ranking task * linting * update _create_data * compare unique raw_predictions with values in leaves_df
-
- 26 Mar, 2021 1 commit
-
-
jmoralez authored
* make one categorical variable informative. increase n_samples. reduce n_features for regression * adjust tolerances in checks
-
- 16 Mar, 2021 1 commit
-
-
Nikita Titov authored
* simplify Dask tests code * enable CI * disable CI
-
- 15 Mar, 2021 1 commit
-
-
James Lamb authored
-
- 10 Mar, 2021 2 commits
-
-
James Lamb authored
* [dask] raise more informative error for duplicates in 'machines' * uncomment * avoid test failure * Revert "avoid test failure" This reverts commit 9442bdf00f193a19a923dc0deb46b7822cb6f601.
-
jmoralez authored
* include multiclass-classification task and task_to_model_factory dicts * define centers coordinates. flatten init_scores within each partition for multiclass-classification * include issue comment and fix linting error
-
- 04 Mar, 2021 1 commit
-
-
jmoralez authored
* include support for init_score * use dataframe from init_score and test difference with and without init_score in local model * revert refactoring * initial docs. test between distributed models with and without init_score * remove ranker from tests * test value for root node and change docs * comma * re-include parametrize * fix incorrect merge * use single init_score and the booster_ attribute * use np.float64 instead of float
-
- 02 Mar, 2021 1 commit
-
-
James Lamb authored
* [dask] [ci] add support for scikit-learn 0.24+ in tests (fixes #4031) * Update tests/python_package_test/test_dask.py Co-authored-by:
Nikita Titov <nekit94-08@mail.ru> * try upgrading mixtexsetup * they changed the executable name UGH * more changes for executable name * another path change * changing package mirrors * undo experiments Co-authored-by:
Nikita Titov <nekit94-08@mail.ru>
-
- 24 Feb, 2021 3 commits
-
-
Nikita Titov authored
* simplify fit calls in Dask tests * Update .vsts-ci.yml * Update .vsts-ci.yml
-
jmoralez authored
* include support for column array as label * remove nested ifs * fix linting errors * include tests for sklearn regressors * include docstring for numpy_1d_array_to_dtype * include . at end of docstring * remove pandas import and test for regression, classification and ranking * check predictions of sklearn models as well * test training only in dask. drop pandas series tests * use PANDAS_INSTALLED and pd_Series * inline imports * use col array in fit for test_dask * include review comments
-
Nikita Titov authored
* Update test_engine.py * Update python_package.yml * Update python_package.yml * Update test_engine.py * hotfix
-