1. 17 Mar, 2022 1 commit
    • Antoni Baum's avatar
      [python] make `early_stopping` callback pickleable (#5012) · f77e0adf
      Antoni Baum authored
      * Turn `early_stopping` into a Callable class
      
      * Fix
      
      * Lint
      
      * Remove print
      
      * Fix order
      
      * Revert "Lint"
      
      This reverts commit 7ca8b557572446888cf793c0082d9a7efd1e29a7.
      
      * Apply suggestion from code review
      
      * Nit
      
      * Lint
      
      * Move callable class outside the func for pickling
      
      * Move _pickle and _unpickle to tests utils
      
      * Add early stopping callback picklability test
      
      * Nit
      
      * Fix
      
      * Lint
      
      * Improve type hint
      
      * Lint
      
      * Lint
      
      * Add cloudpickle to test_windows
      
      * Update tests/python_package_test/test_engine.py
      
      * Fix
      
      * Apply suggestions from code review
      f77e0adf
  2. 11 Mar, 2022 1 commit
  3. 19 Feb, 2022 1 commit
  4. 12 Feb, 2022 1 commit
  5. 11 Feb, 2022 1 commit
    • James Lamb's avatar
      [ci] use conda-forge in Linux and macOS CI jobs (#4953) · 3500cb67
      James Lamb authored
      
      
      * [ci] use conda-forge in CI jobs (fixes #4948)
      
      * comment out more jobs
      
      * try reverting graphviz patch, running more cuda jobs
      
      * get graphviz from PyPI and try removing some patches for r-lintr
      
      * start running appveyor again
      
      * use conda-forge if using conda
      
      * fix commands
      
      * conda install graphviz
      
      * try newer openmp
      
      * pin below openmp 11.x
      
      * focus on gpu task
      
      * trying to narrow down error
      
      * maybe gcc11 is the issue
      
      * start adding other tests back
      
      * pin openmp too
      
      * maybe need to pin to gcc less than 10.x
      
      * pin libgfortran and libstdcxx as well
      
      * pin to gcc 9.3.0
      
      * move constraints up to initial environment
      
      * add all CI jobs back
      
      * try installing python-graphviz separately
      
      * try new lightgbm/vsts-agent image
      
      * fix typo
      
      * test if pinning gcc for linux gpu_source build is still necessary
      
      * ok yes, pinning gcc is necessary
      
      * test if Linux gpu_source works with Python 3.9.6
      
      * no special exception for Linux gpu_source job
      
      * pin to Python 3.9.6 in Linux gpu_source
      
      * try explicitly asking for libstdcxx-ng for every linux build
      
      * swap compilers
      
      * switch compilers back
      
      * revert accidental whitespace change
      
      * comment out CI
      
      * try Linux gpu_source with different Python versions
      
      * Revert "try Linux gpu_source with different Python versions"
      
      This reverts commit f6f63cbb9b4a9cf138f3580ae4223a8acdd0e94a.
      
      * Revert "comment out CI"
      
      This reverts commit ece191f01e3650c2f325e80ff86bfc8c485fb7bc.
      
      * remove libxml2 install, change CONDA path
      
      * avoid installing conda in rchk job
      
      * empty commit 1
      
      * empty commit 2
      
      * empty commit 3
      
      * empty commit 4
      
      * add more verbose logging around installation of python-graphviz
      
      * empty commit 1
      
      * get mamba info
      
      * get more conda info
      
      * add another mamba info call
      
      * allow for other macOS environments in GHA configuration
      
      * Revert "allow for other macOS environments in GHA configuration"
      
      This reverts commit a3c7a19926be94e3719f5ae9100fbe30e87b35da.
      
      * get more logs from mamba
      
      * get Build.ArtifactsStagingDirectory
      
      * get more logs and try to force re-installing everything
      
      * clean cache after every step
      
      * remove --update-all and make logs less verbose
      
      * remove more print statements and uncomment jobs
      
      * test if conda-clean issue fixes segfaults for gpu_source
      
      * pin python version for gpu_source
      
      * empty commit 1
      
      * use miniforge instead
      
      * empty commit 1
      
      * Apply suggestions from code review
      
      * bring workarounds back
      
      * remove duplicated graphviz system-wide installation (reverts #4095, #4097, #4238)
      
      * empty commit 1
      
      * empty commit 2
      
      * empty commit 3
      
      * empty commit 4
      
      * empty commit 5
      
      * empty commit 6
      
      * empty commit 7
      
      * empty commit 8
      
      * empty commit 9
      
      * empty commit 10
      
      * empty commit 10
      
      * empty commit 10
      
      * empty commit 10
      
      * empty commit 11
      
      * one more try
      
      * try to downgrade Python version for Linux GPU job
      
      * swap compilers
      
      * Revert "swap compilers"
      
      This reverts commit f04dc27b17920a69cbcba1254a8e109ce9791154.
      Co-authored-by: default avatarNikita Titov <nekit94-12@hotmail.com>
      Co-authored-by: default avatarNikita Titov <nekit94-08@mail.ru>
      3500cb67
  6. 01 Feb, 2022 1 commit
    • James Lamb's avatar
      [ci] upgrade to R 4.1.2 in CI, change approach to macOS-latest R CI builds (fixes #4988) (#4989) · 0688f471
      James Lamb authored
      * [ci] manually create symlinks to R entrypoints on macOS (fixes #4988)
      
      * exclude non-R CI jobs
      
      * upgrade to R 4.1.2
      
      * get logs for R install
      
      * pin R 4.1.x jobs to macOS-10.15
      
      * update to R 4.1.2 on Windows
      
      * allow for non-latest macOS builds in GHA configuration
      
      * fix prefix check
      
      * fix config check
      
      * more direct check for mac version
      
      * uncomment other CIs
      
      * update R version in CI job names
      0688f471
  7. 23 Jan, 2022 1 commit
  8. 18 Dec, 2021 1 commit
  9. 04 Dec, 2021 1 commit
  10. 18 Nov, 2021 1 commit
  11. 17 Nov, 2021 1 commit
  12. 14 Nov, 2021 2 commits
  13. 10 Nov, 2021 1 commit
  14. 06 Nov, 2021 1 commit
  15. 05 Nov, 2021 1 commit
  16. 03 Nov, 2021 1 commit
  17. 31 Oct, 2021 1 commit
  18. 30 Oct, 2021 1 commit
  19. 29 Oct, 2021 1 commit
  20. 24 Oct, 2021 1 commit
  21. 23 Oct, 2021 1 commit
  22. 05 Oct, 2021 1 commit
  23. 04 Oct, 2021 1 commit
  24. 22 Sep, 2021 1 commit
  25. 27 Aug, 2021 1 commit
  26. 26 Aug, 2021 1 commit
  27. 22 Aug, 2021 1 commit
  28. 20 Aug, 2021 1 commit
  29. 19 Aug, 2021 1 commit
  30. 14 Aug, 2021 2 commits
  31. 10 Aug, 2021 1 commit
  32. 11 Jul, 2021 1 commit
  33. 10 Jul, 2021 1 commit
  34. 08 Jul, 2021 1 commit
  35. 04 Jul, 2021 1 commit
  36. 02 Jul, 2021 1 commit
    • Chen Yufei's avatar
      [python-package] Create Dataset from multiple data files (#4089) · c359896e
      Chen Yufei authored
      * [python-package] create Dataset from sampled data.
      
      * [python-package] create Dataset from List[Sequence].
      
      1. Use random access for data sampling
      2. Support read data from multiple input files
      3. Read data in batch so no need to hold all data in memory
      
      * [python-package] example: create Dataset from multiple HDF5 file.
      
      * fix: revert is_class implementation for seq
      
      * fix: unwanted memory view reference for seq
      
      * fix: seq is_class accepts sklearn matrices
      
      * fix: requirements for example
      
      * fix: pycode
      
      * feat: print static code linting stage
      
      * fix: linting: avoid shell str regex conversion
      
      * code style: doc style
      
      * code style: isort
      
      * fix ci dependency: h5py on windows
      
      * [py] remove rm files in test seq
      https://github.com/microsoft/LightGBM/pull/4089#discussion_r612929623
      
      * docs(python): init_from_sample summary
      
      https://github.com/microsoft/LightGBM/pull/4089#discussion_r612903389
      
      
      
      * remove dataset dump sample data debugging code.
      
      * remove typo fix.
      
      Create separate PR for this.
      
      * fix typo in src/c_api.cpp
      Co-authored-by: default avatarJames Lamb <jaylamb20@gmail.com>
      
      * style(linting): py3 type hint for seq
      
      * test(basic): os.path style path handling
      
      * Revert "feat: print static code linting stage"
      
      This reverts commit 10bd79f7f8258bea8e61c3abb8c9c7e4456a916d.
      
      * feat(python): sequence on validation set
      
      * minor(python): comment
      
      * minor(python): test option hint
      
      * style(python): fix code linting
      
      * style(python): add pydoc for ref_dataset
      
      * doc(python): sequence
      Co-authored-by: default avatarshiyu1994 <shiyu_k1994@qq.com>
      
      * revert(python): sequence class abc
      
      * chore(python): remove rm_files
      
      * Remove useless static_assert.
      
      * refactor: test_basic test for sequence.
      
      * fix lint complaint.
      
      * remove dataset._dump_text in sequence test.
      
      * Fix reverting typo fix.
      
      * Apply suggestions from code review
      Co-authored-by: default avatarJames Lamb <jaylamb20@gmail.com>
      
      * Fix type hint, code and doc style.
      
      * fix failing test_basic.
      
      * Remove TODO about keep constant in sync with cpp.
      
      * Install h5py only when running python-examples.
      
      * Fix lint complaint.
      
      * Apply suggestions from code review
      Co-authored-by: default avatarJames Lamb <jaylamb20@gmail.com>
      
      * Doc fixes, remove unused params_str in __init_from_seqs.
      
      * Apply suggestions from code review
      Co-authored-by: default avatarNikita Titov <nekit94-08@mail.ru>
      
      * Remove unnecessary conda install in windows ci script.
      
      * Keep param as example in dataset_from_multi_hdf5.py
      
      * Add _get_sample_count function to remove code duplication.
      
      * Use batch_size parameter in generate_hdf.
      
      * Apply suggestions from code review
      Co-authored-by: default avatarNikita Titov <nekit94-08@mail.ru>
      
      * Fix after applying suggestions.
      
      * Fix test, check idx is instance of numbers.Integral.
      
      * Update python-package/lightgbm/basic.py
      Co-authored-by: default avatarNikita Titov <nekit94-08@mail.ru>
      
      * Expose Sequence class in Python-API doc.
      
      * Handle Sequence object not having batch_size.
      
      * Fix isort lint complaint.
      
      * Apply suggestions from code review
      Co-authored-by: default avatarNikita Titov <nekit94-08@mail.ru>
      
      * Update docstring to mention Sequence as data input.
      
      * Remove get_one_line in test_basic.py
      
      * Make Sequence an abstract class.
      
      * Reduce number of tests for test_sequence.
      
      * Add c_api: LGBM_SampleCount, fix potential bug in LGBMSampleIndices.
      
      * empty commit to trigger ci
      
      * Apply suggestions from code review
      Co-authored-by: default avatarNikita Titov <nekit94-08@mail.ru>
      
      * Rename to LGBM_GetSampleCount, change LGBM_SampleIndices out_len to int32_t.
      
      Also rename total_nrow to num_total_row in c_api.h for consistency.
      
      * Doc about Sequence in docs/Python-Intro.rst.
      
      * Fix: basic.py change LGBM_SampleIndices out_len to int32.
      
      * Add create_valid test case with Dataset from Sequence.
      
      * Apply suggestions from code review
      Co-authored-by: default avatarNikita Titov <nekit94-08@mail.ru>
      
      * Apply suggestions from code review
      Co-authored-by: default avatarshiyu1994 <shiyu_k1994@qq.com>
      
      * Remove no longer used DEFAULT_BIN_CONSTRUCT_SAMPLE_CNT.
      
      * Update python-package/lightgbm/basic.py
      Co-authored-by: default avatarNikita Titov <nekit94-08@mail.ru>
      Co-authored-by: default avatarWillian Zhang <willian@willian.email>
      Co-authored-by: default avatarWillian Z <Willian@Willian-Zhang.com>
      Co-authored-by: default avatarJames Lamb <jaylamb20@gmail.com>
      Co-authored-by: default avatarshiyu1994 <shiyu_k1994@qq.com>
      Co-authored-by: default avatarNikita Titov <nekit94-08@mail.ru>
      c359896e
  37. 26 Jun, 2021 1 commit
  38. 28 May, 2021 1 commit