1. 30 Sep, 2020 1 commit
    • Guolin Ke's avatar
      fix address alignment, required by cran (#3415) · f30dbe87
      Guolin Ke authored
      * fix dataset binary file alignment
      
      * many fixes
      
      * fix warnings
      
      * fix bug
      
      * Update file_io.cpp
      
      * Update file_io.cpp
      
      * simplify code
      
      * Apply suggestions from code review
      
      * general
      
      * remove unneeded alignment
      
      * Update file_io.h
      
      * int32 to byte8 alignment
      
      * Apply suggestions from code review
      
      * Apply suggestions from code review
      f30dbe87
  2. 15 Aug, 2020 1 commit
    • Guolin Ke's avatar
      fix zero bin in categorical split (#3305) · 03910760
      Guolin Ke authored
      * fix zero bin
      
      * some fix
      
      * fix bin mapping
      
      * fix
      
      * fix bug
      
      * use stable sort
      
      * fix cat forced split
      
      * Apply suggestions from code review
      
      * Apply suggestions from code review
      
      * Apply suggestions from code review
      03910760
  3. 05 Jun, 2020 1 commit
  4. 01 Jun, 2020 1 commit
  5. 08 Mar, 2020 1 commit
  6. 04 Mar, 2020 1 commit
  7. 02 Mar, 2020 1 commit
  8. 29 Feb, 2020 1 commit
  9. 27 Feb, 2020 2 commits
  10. 25 Feb, 2020 1 commit
  11. 19 Feb, 2020 1 commit
    • Guolin Ke's avatar
      [python] [R-package] refine the parameters for Dataset (#2594) · 9f79e840
      Guolin Ke authored
      
      
      * reset
      
      * fix a bug
      
      * fix test
      
      * Update c_api.h
      
      * support to no filter features by min_data
      
      * add warning in reset config
      
      * refine warnings for override dataset's parameter
      
      * some cleans
      
      * clean code
      
      * clean code
      
      * refine C API function doxygen comments
      
      * refined new param description
      
      * refined doxygen comments for R API function
      
      * removed stuff related to int8
      
      * break long line in warning message
      
      * removed tests which results cannot be validated anymore
      
      * added test for warnings about unchangeable params
      
      * write parameter from dataset to booster
      
      * consider free_raw_data.
      
      * fix params
      
      * fix bug
      
      * implementing R
      
      * fix typo
      
      * filter params in R
      
      * fix R
      
      * not min_data
      
      * refined tests
      
      * fixed linting
      
      * refine
      
      * pilint
      
      * add docstring
      
      * fix docstring
      
      * R lint
      
      * updated description for C API function
      
      * use param aliases in Python
      
      * fixed typo
      
      * fixed typo
      
      * added more params to test
      
      * removed debug print
      
      * fix dataset construct place
      
      * fix merge bug
      
      * Update feature_histogram.hpp
      
      * add is_sparse back
      
      * remove unused parameters
      
      * fix lint
      
      * add data random seed
      
      * update
      
      * [R-package] centrallized Dataset parameter aliases and added tests on Dataset parameter updating (#2767)
      Co-authored-by: default avatarNikita Titov <nekit94-08@mail.ru>
      Co-authored-by: default avatarJames Lamb <jaylamb20@gmail.com>
      9f79e840
  12. 17 Feb, 2020 1 commit
  13. 08 Feb, 2020 1 commit
  14. 02 Feb, 2020 1 commit
    • Guolin Ke's avatar
      Support both row-wise and col-wise multi-threading (#2699) · 509c2e50
      Guolin Ke authored
      
      
      * commit
      
      * fix a bug
      
      * fix bug
      
      * reset to track changes
      
      * refine the auto choose logic
      
      * sort the time stats output
      
      * fix include
      
      * change  multi_val_bin_sparse_threshold
      
      * add cmake
      
      * add _mm_malloc and _mm_free for cross platform
      
      * fix cmake bug
      
      * timer for split
      
      * try to fix cmake
      
      * fix tests
      
      * refactor DataPartition::Split
      
      * fix test
      
      * typo
      
      * formating
      
      * Revert "formating"
      
      This reverts commit 5b8de4f7fb9d975ee23701d276a66d40ee6d4222.
      
      * add document
      
      * [R-package] Added tests on use of force_col_wise and force_row_wise in training (#2719)
      
      * naming
      
      * fix gpu code
      
      * Update include/LightGBM/bin.h
      Co-Authored-By: default avatarJames Lamb <jaylamb20@gmail.com>
      
      * Update src/treelearner/ocl/histogram16.cl
      
      * test: swap compilers for CI
      
      * fix omp
      
      * not avx2
      
      * no aligned for feature histogram
      
      * Revert "refactor DataPartition::Split"
      
      This reverts commit 256e6d9641ade966a1f54da1752e998a1149b6f8.
      
      * slightly refactor data partition
      
      * reduce the memory cost
      Co-authored-by: default avatarJames Lamb <jaylamb20@gmail.com>
      Co-authored-by: default avatarNikita Titov <nekit94-08@mail.ru>
      509c2e50
  15. 14 Jan, 2020 1 commit
    • Guolin Ke's avatar
      support most frequent bin (#2689) · c7e90393
      Guolin Ke authored
      * implement
      
      * fix warning
      
      * fix bug
      
      * fix a bug
      
      * remove unneed function
      
      * fix data push bug
      
      * fix valid data push
      
      * fix bug for missing_type=zero
      
      * refine split
      
      * renames
      
      * typo
      c7e90393
  16. 15 Oct, 2019 1 commit
    • Guolin Ke's avatar
      reduce the buffer when using high dimensional data in distributed mode. (#2485) · 40e56ca7
      Guolin Ke authored
      * reduce the buffer when using high dimensional data in distributed mode.
      
      * Update dataset_loader.cpp
      
      * refix
      
      * typo
      
      * fix number of bin accumulation.
      
      * avoid overflow
      
      * fix warning
      
      * efficient solution.
      
      * Update dataset.h
      
      * fix bin count output
      
      * fix warning
      
      * bug in dist number of feature check
      
      * fix possible edge case
      
      * Update dataset.cpp
      
      * possible bug fix
      
      * fix
      40e56ca7
  17. 01 Oct, 2019 1 commit
  18. 28 Sep, 2019 1 commit
    • Belinda Trotta's avatar
      Predefined bin thresholds (#2325) · cc7a1e27
      Belinda Trotta authored
      * Fix bug where small values of max_bin cause crash.
      
      * Revert "Fix bug where small values of max_bin cause crash."
      
      This reverts commit fe5c8e2547057c1fa5750bcddd359dd7708fab4b.
      
      * Add functionality to force bin thresholds.
      
      * Fix style issues.
      
      * Use stable sort.
      
      * Minor style and doc fixes.
      
      * Add functionality to force bin thresholds.
      
      * Fix style issues.
      
      * Use stable sort.
      
      * Minor style and doc fixes.
      
      * Change binning behavior to be same as PR #2342.
      
      * Add functionality to force bin thresholds.
      
      * Fix style issues.
      
      * Use stable sort.
      
      * Minor style and doc fixes.
      
      * Add functionality to force bin thresholds.
      
      * Fix style issues.
      
      * Use stable sort.
      
      * Minor style and doc fixes.
      
      * Change binning behavior to be same as PR #2342.
      
      * Add functionality to force bin thresholds.
      
      * Fix style issues.
      
      * Minor style and doc fixes.
      
      * Add functionality to force bin thresholds.
      
      * Fix style issues.
      
      * Minor style and doc fixes.
      
      * Change binning behavior to be same as PR #2342.
      
      * Add functionality to force bin thresholds.
      
      * Fix style issues.
      
      * Use stable sort.
      
      * Minor style and doc fixes.
      
      * Add functionality to force bin thresholds.
      
      * Fix style issues.
      
      * Use stable sort.
      
      * Minor style and doc fixes.
      
      * Change binning behavior to be same as PR #2342.
      
      * Use different bin finding function for predefined bounds.
      
      * Fix style issues.
      
      * Minor refactoring, overload FindBinWithZeroAsOneBin.
      
      * Fix style issues.
      
      * Fix bug and add new test.
      
      * Add warning when using categorical features with forced bins.
      
      * Pass forced_upper_bounds by reference.
      
      * Pass container types by const reference.
      
      * Get categorical features using FeatureBinMapper.
      
      * Fix bug for small max_bin.
      
      * Move GetForcedBins to DatasetLoader.
      
      * Find forced bins in dataset_loader.
      
      * Minor fixes.
      cc7a1e27
  19. 22 Sep, 2019 1 commit
  20. 20 Aug, 2019 1 commit
  21. 16 Aug, 2019 1 commit
    • Belinda Trotta's avatar
      Bug fix: small values of max_bin cause program to crash (#2299) · c421f898
      Belinda Trotta authored
      * Fix bug where small values of max_bin cause crash.
      
      * Revert "Fix bug where small values of max_bin cause crash."
      
      This reverts commit fe5c8e2547057c1fa5750bcddd359dd7708fab4b.
      
      * Fix bug where small values of max_bin cause crash.
      
      * Reset random seed in test, remove extra blank line.
      
      * Minor bug fix. Remove extra blank line.
      
      * Change old test to account for new binning behavior.
      c421f898
  22. 23 Jul, 2019 1 commit
  23. 13 Apr, 2019 1 commit
  24. 11 Apr, 2019 1 commit
  25. 02 Feb, 2019 1 commit
  26. 20 Dec, 2018 1 commit
  27. 10 Oct, 2018 1 commit
  28. 11 Sep, 2018 1 commit
  29. 16 Aug, 2018 1 commit
  30. 29 Jul, 2018 1 commit
  31. 27 Feb, 2018 1 commit
    • ebernhardson's avatar
      Experimental support for HDFS (#1243) · 7e186a57
      ebernhardson authored
      * Read and write datsets from hdfs.
      * Only enabled when cmake is run with -DUSE_HDFS:BOOL=TRUE
      * Introduces VirtualFile(Reader|Writer) to asbtract VFS differences
      7e186a57
  32. 12 Dec, 2017 1 commit
  33. 09 Nov, 2017 1 commit
    • wxchan's avatar
      add init_score & test cpp and python result consistency (#1007) · bc0579c8
      wxchan authored
      * add init_score & test cpp and python result consistency
      
      * try fix common.h
      
      * Fix tests (#3)
      
      * update atof
      
      * fix bug
      
      * fix tests.
      
      * fix bug
      
      * fix dtypes
      
      * fix categorical feature override
      
      * fix protobuf on vs build (#1004)
      
      * [optional] support protobuf
      
      * fix windows/LightGBM.vcxproj
      
      * add doc
      
      * fix doc
      
      * fix vs support (#2)
      
      * fix vs support
      
      * fix cmake
      
      * fix #1012
      
      * [python] add network config api  (#1019)
      
      * add network
      
      * update doc
      
      * add float tolerance in bin finder.
      
      * fix a bug
      
      * update tests
      
      * add double torelance on tree model
      
      * fix tests
      
      * simplify the double comparison
      
      * fix lightsvm zero base
      
      * move double tolerance to the bin finder.
      
      * fix pylint
      
      * clean test.sh
      
      * add sklearn test
      
      * remove underline
      
      * clean codes
      
      * set random_state=None
      
      * add last line
      
      * fix doc
      
      * rename file
      
      * try fix test
      bc0579c8
  34. 16 Oct, 2017 2 commits
  35. 13 Oct, 2017 1 commit
    • Guolin Ke's avatar
      fix #991 (#992) · ef221275
      Guolin Ke authored
      * refine categorical split
      
      * a bug fix
      
      * fix a bug
      ef221275
  36. 30 Aug, 2017 1 commit
  37. 18 Aug, 2017 1 commit
  38. 30 Jul, 2017 1 commit
    • Guolin Ke's avatar
      Better missing value handle (#747) · 00cb04a2
      Guolin Ke authored
      * finish the data loading part
      
      * allow prediction.
      
      * fix bug for decision type.
      
      * finish split finding part
      
      * fix bugs.
      
      * bug fixed. add a test .
      
      * fix pep8 .
      
      * update documents.
      
      * fix test bugs.
      
      * fix a format
      
      * fix import error in python test.
      
      * disable missing handle in categorial features.
      
      * fix a bug.
      
      * add more tests.
      
      * fix pep8
      
      * fix bugs.
      
      * remove the missing handle code for categorical feature.
      00cb04a2