1. 19 Feb, 2020 1 commit
    • Guolin Ke's avatar
      [python] [R-package] refine the parameters for Dataset (#2594) · 9f79e840
      Guolin Ke authored
      
      
      * reset
      
      * fix a bug
      
      * fix test
      
      * Update c_api.h
      
      * support to no filter features by min_data
      
      * add warning in reset config
      
      * refine warnings for override dataset's parameter
      
      * some cleans
      
      * clean code
      
      * clean code
      
      * refine C API function doxygen comments
      
      * refined new param description
      
      * refined doxygen comments for R API function
      
      * removed stuff related to int8
      
      * break long line in warning message
      
      * removed tests which results cannot be validated anymore
      
      * added test for warnings about unchangeable params
      
      * write parameter from dataset to booster
      
      * consider free_raw_data.
      
      * fix params
      
      * fix bug
      
      * implementing R
      
      * fix typo
      
      * filter params in R
      
      * fix R
      
      * not min_data
      
      * refined tests
      
      * fixed linting
      
      * refine
      
      * pilint
      
      * add docstring
      
      * fix docstring
      
      * R lint
      
      * updated description for C API function
      
      * use param aliases in Python
      
      * fixed typo
      
      * fixed typo
      
      * added more params to test
      
      * removed debug print
      
      * fix dataset construct place
      
      * fix merge bug
      
      * Update feature_histogram.hpp
      
      * add is_sparse back
      
      * remove unused parameters
      
      * fix lint
      
      * add data random seed
      
      * update
      
      * [R-package] centrallized Dataset parameter aliases and added tests on Dataset parameter updating (#2767)
      Co-authored-by: default avatarNikita Titov <nekit94-08@mail.ru>
      Co-authored-by: default avatarJames Lamb <jaylamb20@gmail.com>
      9f79e840
  2. 17 Feb, 2020 1 commit
  3. 10 Feb, 2020 1 commit
  4. 08 Feb, 2020 3 commits
    • Nikita Titov's avatar
      various minor style, docs and cpplint improvements (#2747) · 1c1a2765
      Nikita Titov authored
      * various minor style, docs and cpplint improvements
      
      * fixed typo in warning
      
      * fix recently added cpplint errors
      
      * move note for params upper in description for consistency
      1c1a2765
    • Belinda Trotta's avatar
      Extremely randomized trees (#2671) · 446b8b6c
      Belinda Trotta authored
      * Add extra-trees functionality.
      
      * Remove unnecessary code.
      
      * Update docs.
      
      * Use template for FindBestThresholdSequence.
      
      * Use separate random seed. Fix bug.
      446b8b6c
    • jcipar's avatar
      Config static initialization (#2743) · da811d46
      jcipar authored
      * Change static map/set to functions that return static variables
      
      * Fixed newline
      
      * testing CI
      
      * removed commented-out code fixed formatting
      
      * return const references to parameter_set and alias_table
      
      * fixup whitespace
      
      * Too many requests error when running CI on docs. Triggering a rebuild.
      da811d46
  5. 04 Feb, 2020 1 commit
  6. 03 Feb, 2020 2 commits
  7. 02 Feb, 2020 1 commit
    • Guolin Ke's avatar
      Support both row-wise and col-wise multi-threading (#2699) · 509c2e50
      Guolin Ke authored
      
      
      * commit
      
      * fix a bug
      
      * fix bug
      
      * reset to track changes
      
      * refine the auto choose logic
      
      * sort the time stats output
      
      * fix include
      
      * change  multi_val_bin_sparse_threshold
      
      * add cmake
      
      * add _mm_malloc and _mm_free for cross platform
      
      * fix cmake bug
      
      * timer for split
      
      * try to fix cmake
      
      * fix tests
      
      * refactor DataPartition::Split
      
      * fix test
      
      * typo
      
      * formating
      
      * Revert "formating"
      
      This reverts commit 5b8de4f7fb9d975ee23701d276a66d40ee6d4222.
      
      * add document
      
      * [R-package] Added tests on use of force_col_wise and force_row_wise in training (#2719)
      
      * naming
      
      * fix gpu code
      
      * Update include/LightGBM/bin.h
      Co-Authored-By: default avatarJames Lamb <jaylamb20@gmail.com>
      
      * Update src/treelearner/ocl/histogram16.cl
      
      * test: swap compilers for CI
      
      * fix omp
      
      * not avx2
      
      * no aligned for feature histogram
      
      * Revert "refactor DataPartition::Split"
      
      This reverts commit 256e6d9641ade966a1f54da1752e998a1149b6f8.
      
      * slightly refactor data partition
      
      * reduce the memory cost
      Co-authored-by: default avatarJames Lamb <jaylamb20@gmail.com>
      Co-authored-by: default avatarNikita Titov <nekit94-08@mail.ru>
      509c2e50
  8. 30 Jan, 2020 1 commit
    • sbruch's avatar
      Implementation of XE_NDCG_MART for the ranking task (#2620) · 86530988
      sbruch authored
      * Implementation of XE_NDCG loss function for ranking.
      
      * Add citation
      
      * Check in example usage for xe_ndcg loss.
      
      * Seed the generator when a seed is provided in the config. Add unit-tests for xe_ndcg
      
      * Update documentation
      
      * Fix indentation
      
      * Address issues raised by reviewers.
      
      * Clean up include statements.
      
      * Fix issues raised by reviewers.
      
      * Regenerate parameters.rst
      
      * Add a note to explain that reproducing xe_ndcg results requires num_threads to be one.
      
      * Introduce objective_seed and use that in rank_xendcg instead of directly using seed
      
      * Change default value of objective_seed
      86530988
  9. 15 Jan, 2020 1 commit
  10. 14 Jan, 2020 2 commits
  11. 13 Jan, 2020 1 commit
  12. 07 Jan, 2020 1 commit
  13. 29 Dec, 2019 1 commit
  14. 20 Dec, 2019 1 commit
  15. 17 Dec, 2019 1 commit
  16. 12 Dec, 2019 1 commit
    • Belinda Trotta's avatar
      auc-mu metric (#2567) · 222775ca
      Belinda Trotta authored
      * Fix bug where small values of max_bin cause crash.
      
      * Revert "Fix bug where small values of max_bin cause crash."
      
      This reverts commit fe5c8e2547057c1fa5750bcddd359dd7708fab4b.
      
      * Add auc-mu multiclass metric.
      
      * Fix bug where scores are equal.
      
      * Merge.
      
      * Change name to auc_mu everywhere (instead of auc-mu).
      
      * Fix comparison between signed and unsigned int.
      
      * Change name to AUC-mu in docs and output messages.
      
      * Improve test.
      
      * Use prefix increment.
      
      * Update R package.
      
      * Fix style issues.
      
      * Tidy up test code.
      
      * Read all lines first then process.
      
      * Allow passing AUC-mu weights directly as a list in parameters.
      
      * Remove unused code, improve example and docs.
      222775ca
  17. 30 Nov, 2019 1 commit
  18. 29 Nov, 2019 1 commit
  19. 16 Nov, 2019 1 commit
  20. 15 Nov, 2019 2 commits
  21. 12 Nov, 2019 1 commit
  22. 11 Nov, 2019 1 commit
  23. 05 Nov, 2019 1 commit
  24. 01 Nov, 2019 1 commit
  25. 25 Oct, 2019 1 commit
  26. 15 Oct, 2019 1 commit
    • Guolin Ke's avatar
      reduce the buffer when using high dimensional data in distributed mode. (#2485) · 40e56ca7
      Guolin Ke authored
      * reduce the buffer when using high dimensional data in distributed mode.
      
      * Update dataset_loader.cpp
      
      * refix
      
      * typo
      
      * fix number of bin accumulation.
      
      * avoid overflow
      
      * fix warning
      
      * efficient solution.
      
      * Update dataset.h
      
      * fix bin count output
      
      * fix warning
      
      * bug in dist number of feature check
      
      * fix possible edge case
      
      * Update dataset.cpp
      
      * possible bug fix
      
      * fix
      40e56ca7
  27. 09 Oct, 2019 1 commit
  28. 03 Oct, 2019 1 commit
  29. 02 Oct, 2019 1 commit
  30. 01 Oct, 2019 1 commit
  31. 28 Sep, 2019 1 commit
    • Belinda Trotta's avatar
      Predefined bin thresholds (#2325) · cc7a1e27
      Belinda Trotta authored
      * Fix bug where small values of max_bin cause crash.
      
      * Revert "Fix bug where small values of max_bin cause crash."
      
      This reverts commit fe5c8e2547057c1fa5750bcddd359dd7708fab4b.
      
      * Add functionality to force bin thresholds.
      
      * Fix style issues.
      
      * Use stable sort.
      
      * Minor style and doc fixes.
      
      * Add functionality to force bin thresholds.
      
      * Fix style issues.
      
      * Use stable sort.
      
      * Minor style and doc fixes.
      
      * Change binning behavior to be same as PR #2342.
      
      * Add functionality to force bin thresholds.
      
      * Fix style issues.
      
      * Use stable sort.
      
      * Minor style and doc fixes.
      
      * Add functionality to force bin thresholds.
      
      * Fix style issues.
      
      * Use stable sort.
      
      * Minor style and doc fixes.
      
      * Change binning behavior to be same as PR #2342.
      
      * Add functionality to force bin thresholds.
      
      * Fix style issues.
      
      * Minor style and doc fixes.
      
      * Add functionality to force bin thresholds.
      
      * Fix style issues.
      
      * Minor style and doc fixes.
      
      * Change binning behavior to be same as PR #2342.
      
      * Add functionality to force bin thresholds.
      
      * Fix style issues.
      
      * Use stable sort.
      
      * Minor style and doc fixes.
      
      * Add functionality to force bin thresholds.
      
      * Fix style issues.
      
      * Use stable sort.
      
      * Minor style and doc fixes.
      
      * Change binning behavior to be same as PR #2342.
      
      * Use different bin finding function for predefined bounds.
      
      * Fix style issues.
      
      * Minor refactoring, overload FindBinWithZeroAsOneBin.
      
      * Fix style issues.
      
      * Fix bug and add new test.
      
      * Add warning when using categorical features with forced bins.
      
      * Pass forced_upper_bounds by reference.
      
      * Pass container types by const reference.
      
      * Get categorical features using FeatureBinMapper.
      
      * Fix bug for small max_bin.
      
      * Move GetForcedBins to DatasetLoader.
      
      * Find forced bins in dataset_loader.
      
      * Minor fixes.
      cc7a1e27
  32. 27 Sep, 2019 1 commit
  33. 26 Sep, 2019 2 commits
  34. 22 Sep, 2019 1 commit