1. 07 Feb, 2021 2 commits
    • James Lamb's avatar
      [dask] Add support for 'pred_leaf' in Dask estimators (fixes #3792) (#3919) · 37485fff
      James Lamb authored
      * fix tests
      
      * fix tests
      
      * fix test comments
      
      * simplify tests
      
      * Apply suggestions from code review
      37485fff
    • GOusignu's avatar
      [dask] Add unit tests that signatures are the same between Dask and... · 6f127847
      GOusignu authored
      [dask] Add unit tests that signatures are the same between Dask and scikit-learn estimators  (#3911)
      
      * [dask] Add unit tests that signatures are the same between Dask and scikit-learn estimators (fixes microsoft#3907)
      
      * [dask] Add unit tests that signatures are the same between Dask and scikit-learn estimators (fixes microsoft#3907)
      
      * [dask] Add unit tests that signatures are the same between Dask and scikit-learn estimators (fixes microsoft#3907)
      
      * [dask] Add unit tests that signatures are the same between Dask and scikit-learn estimators (fixes microsoft#3907)
      
      * [dask] Add unit tests that signatures are the same between Dask and scikit-learn estimators (fixes microsoft#3907)
      
      * [dask] Add unit tests that signatures are the same between Dask and scikit-learn estimators (fixes microsoft#3907)
      
      * [dask] Add unit tests that signatures are the same between Dask and scikit-learn estimators (fixes microsoft#3907)
      
      * [dask] Add unit tests that signatures are the same between Dask and scikit-learn estimators (fixes microsoft#3907)
      
      * [dask] Add unit tests that signatures are the same between Dask and scikit-learn estimators (fixes microsoft#3907)
      
      * [dask] Add unit tests that signatures are the same between Dask and scikit-learn estimators (fixes microsoft#3907)
      
      * [dask] Add unit tests that signatures are the same between Dask and scikit-learn estimators (fixes microsoft#3907)
      
      * [dask] Add unit tests that signatures are the same between Dask and scikit-learn estimators (fixes microsoft#3907)
      
      * [dask] Add unit tests that signatures are the same between Dask and scikit-learn estimators (fixes microsoft#3907)
      
      * [dask] Add unit tests that signatures are the same between Dask and scikit-learn estimators (fixes microsoft#3907)
      
      * [dask] Add unit tests that signatures are the same between Dask and scikit-learn estimators (fixes microsoft#3907)
      
      * [dask] Add unit tests that signatures are the same between Dask and scikit-learn estimators (fixes microsoft#3907)
      
      * [dask] Add unit tests that signatures are the same between Dask and scikit-learn estimators (fixes microsoft#3907)
      
      * [dask] Add unit tests that signatures are the same between Dask and scikit-learn estimators (fixes microsoft#3907)
      
      * [dask] Add unit tests that signatures are the same between Dask and scikit-learn estimators (fixes microsoft#3907)
      
      * [dask] Add unit tests that signatures are the same between Dask and scikit-learn estimators (fixes microsoft#3907)
      6f127847
  2. 06 Feb, 2021 1 commit
  3. 03 Feb, 2021 2 commits
  4. 02 Feb, 2021 1 commit
  5. 29 Jan, 2021 2 commits
    • Nikita Titov's avatar
      217642ca
    • James Lamb's avatar
      [dask] fix teardown issues in Dask tests (fixes #3829) (#3869) · 42d1633a
      James Lamb authored
      * [dask] reduce teardown erros in Dask tests
      
      * azure
      
      * show logs
      
      * try again
      
      * more
      
      * submodules
      
      * try a bunch of sdist tests
      
      * empty commit
      
      * empty commit
      
      * empty commit
      
      * use sh-ubuntu
      
      * 10 sdist tasks
      
      * stuff
      
      * empty commit
      
      * empty commit
      
      * empty commit
      
      * empty commit
      
      * try bdist
      
      * empty commit
      
      * empty commit
      
      * empty commit
      
      * empty commit
      
      * py37
      
      * empty commit
      
      * empty commit
      
      * empty commit
      
      * empty commit
      
      * empty commit
      
      * empty commit
      
      * empty commit
      
      * empty commit
      
      * empty commit
      
      * python 3.8
      
      * empty commit
      
      * empty commit
      
      * empty commit
      
      * empty commit
      
      * empty commit
      
      * cuda config
      
      * comment out cuda again
      
      * setting timeout
      
      * put client close in the right place
      
      * uncomment CI, make timeout 60
      42d1633a
  6. 28 Jan, 2021 1 commit
  7. 27 Jan, 2021 1 commit
  8. 26 Jan, 2021 3 commits
  9. 25 Jan, 2021 3 commits
  10. 24 Jan, 2021 3 commits
  11. 23 Jan, 2021 1 commit
  12. 22 Jan, 2021 7 commits
  13. 21 Jan, 2021 2 commits
  14. 20 Jan, 2021 1 commit
  15. 19 Jan, 2021 1 commit
  16. 18 Jan, 2021 1 commit
  17. 15 Jan, 2021 3 commits
  18. 04 Jan, 2021 1 commit
  19. 03 Jan, 2021 1 commit
  20. 28 Dec, 2020 1 commit
    • Nikita Titov's avatar
      small code and docs refactoring (#3681) · 5a460846
      Nikita Titov authored
      * small code and docs refactoring
      
      * Update CMakeLists.txt
      
      * Update .vsts-ci.yml
      
      * Update test.sh
      
      * continue
      
      * continue
      
      * revert stable sort for all-unique values
      5a460846
  21. 24 Dec, 2020 1 commit
    • Belinda Trotta's avatar
      Trees with linear models at leaves (#3299) · fcfd4132
      Belinda Trotta authored
      * Add Eigen library.
      
      * Working for simple test.
      
      * Apply changes to config params.
      
      * Handle nan data.
      
      * Update docs.
      
      * Add test.
      
      * Only load raw data if boosting=gbdt_linear
      
      * Remove unneeded code.
      
      * Minor updates.
      
      * Update to work with sk-learn interface.
      
      * Update to work with chunked datasets.
      
      * Throw error if we try to create a Booster with an already-constructed dataset having incompatible parameters.
      
      * Save raw data in binary dataset file.
      
      * Update docs and fix parameter checking.
      
      * Fix dataset loading.
      
      * Add test for regularization.
      
      * Fix bugs when saving and loading tree.
      
      * Add test for load/save linear model.
      
      * Remove unneeded code.
      
      * Fix case where not enough leaf data for linear model.
      
      * Simplify code.
      
      * Speed up code.
      
      * Speed up code.
      
      * Simplify code.
      
      * Speed up code.
      
      * Fix bugs.
      
      * Working version.
      
      * Store feature data column-wise (not fully working yet).
      
      * Fix bugs.
      
      * Speed up.
      
      * Speed up.
      
      * Remove unneeded code.
      
      * Small speedup.
      
      * Speed up.
      
      * Minor updates.
      
      * Remove unneeded code.
      
      * Fix bug.
      
      * Fix bug.
      
      * Speed up.
      
      * Speed up.
      
      * Simplify code.
      
      * Remove unneeded code.
      
      * Fix bug, add more tests.
      
      * Fix bug and add test.
      
      * Only store numerical features
      
      * Fix bug and speed up using templates.
      
      * Speed up prediction.
      
      * Fix bug with regularisation
      
      * Visual studio files.
      
      * Working version
      
      * Only check nans if necessary
      
      * Store coeff matrix as an array.
      
      * Align cache lines
      
      * Align cache lines
      
      * Preallocation coefficient calculation matrices
      
      * Small speedups
      
      * Small speedup
      
      * Reverse cache alignment changes
      
      * Change to dynamic schedule
      
      * Update docs.
      
      * Refactor so that linear tree learner is not a separate class.
      
      * Add refit capability.
      
      * Speed up
      
      * Small speedups.
      
      * Speed up add prediction to score.
      
      * Fix bug
      
      * Fix bug and speed up.
      
      * Speed up dataload.
      
      * Speed up dataload
      
      * Use vectors instead of pointers
      
      * Fix bug
      
      * Add OMP exception handling.
      
      * Change return type of LGBM_BoosterGetLinear to bool
      
      * Change return type of LGBM_BoosterGetLinear back to int, only parameter type needed to change
      
      * Remove unused internal_parent_ property of tree
      
      * Remove unused parameter to CreateTreeLearner
      
      * Remove reference to LinearTreeLearner
      
      * Minor style issues
      
      * Remove unneeded check
      
      * Reverse temporary testing change
      
      * Fix Visual Studio project files
      
      * Restore LightGBM.vcxproj.filters
      
      * Speed up
      
      * Speed up
      
      * Simplify code
      
      * Update docs
      
      * Simplify code
      
      * Initialise storage space for max num threads
      
      * Move Eigen to include directory and delete unused files
      
      * Remove old files.
      
      * Fix so it compiles with mingw
      
      * Fix gpu tree learner
      
      * Change AddPredictionToScore back to const
      
      * Fix python lint error
      
      * Fix C++ lint errors
      
      * Change eigen to a submodule
      
      * Update comment
      
      * Add the eigen folder
      
      * Try to fix build issues with eigen
      
      * Remove eigen files
      
      * Add eigen as submodule
      
      * Fix include paths
      
      * Exclude eigen files from Python linter
      
      * Ignore eigen folders for pydocstyle
      
      * Fix C++ linting errors
      
      * Fix docs
      
      * Fix docs
      
      * Exclude eigen directories from doxygen
      
      * Update manifest to include eigen
      
      * Update build_r to include eigen files
      
      * Fix compiler warnings
      
      * Store raw feature data as float
      
      * Use float for calculating linear coefficients
      
      * Remove eigen directory from GLOB
      
      * Don't compile linear model code when building R package
      
      * Fix doxygen issue
      
      * Fix lint issue
      
      * Fix lint issue
      
      * Remove uneeded code
      
      * Restore delected lines
      
      * Restore delected lines
      
      * Change return type of has_raw to bool
      
      * Update docs
      
      * Rename some variables and functions for readability
      
      * Make tree_learner parameter const in AddScore
      
      * Fix style issues
      
      * Pass vectors as const reference when setting tree properties
      
      * Make temporary storage of serial_tree_learner mutable so we can make the object's methods const
      
      * Remove get_raw_size, use num_numeric_features instead
      
      * Fix typo
      
      * Make contains_nan_ and any_nan_ properties immutable again
      
      * Remove data_has_nan_ property of tree
      
      * Remove temporary test code
      
      * Make linear_tree a dataset param
      
      * Fix lint error
      
      * Make LinearTreeLearner a separate class
      
      * Fix lint errors
      
      * Fix lint error
      
      * Add linear_tree_learner.o
      
      * Simulate omp_get_max_threads if openmp is not available
      
      * Update PushOneData to also store raw data.
      
      * Cast size to int
      
      * Fix bug in ReshapeRaw
      
      * Speed up code with multithreading
      
      * Use OMP_NUM_THREADS
      
      * Speed up with multithreading
      
      * Update to use ArrayToString
      
      * Fix tests
      
      * Fix test
      
      * Fix bug introduced in merge
      
      * Minor updates
      
      * Update docs
      fcfd4132
  22. 22 Dec, 2020 1 commit
    • Jan Stiborek's avatar
      [python] [dask] add initial dask integration (#3515) · d90a16d5
      Jan Stiborek authored
      * migrated implementation from dask/dask-lightgbm
      
      * relaxed tests
      
      * tests skipped in case that MPI is used
      
      * fixed python 2.7 import + tests disabled on windows
      
      * python < 3.6 is not supported in tests
      
      * tests enabled only for linux
      
      * tests disabled for mpi interface
      
      * dask version pinned to >= 2.0
      
      * added @jameslamb as code owner
      
      * added missing pandas dependency
      
      * code refactoring, removed code duplication - lightgbm.dask.LGBMClassifier.fit is the same as lightgbm.dask.LGBMRegressor.fit
      
      * fixed refactoring
      
      * code deduplication - fit method moved into mixin class
      
      * fixed CODEOWNERS
      
      * removed unnecessary import
      
      * skip the module execution on python < 3.6 and on platform different than linux.
      
      * removed skip for python < 3.6
      
      * review comments
      
      * removed noqa, renamed API classes, renamed local variables
      d90a16d5