1. 10 Aug, 2022 1 commit
    • Scott Votaw's avatar
      feature: Add true streaming APIs to reduce client-side memory usage (#5299) · 0a5c5838
      Scott Votaw authored
      * Extract streaming to own PR
      
      * small merge fixes and cleanup
      
      * linting fixes
      
      * fix cast warning
      
      * Fix accidental deletion during branch transfer
      
      * responded to initial triage comments
      
      * Added more tests to use create-from-samples APIs
      
      * added mutex and adjusted nclasses logic
      
      * Fix thread-safety for pushing data to sparse bins through Push APIs
      
      * lint and doc fixes
      
      * Small SWIG fix
      
      * nit fix
      
      * Responded to StrikerRUS comments
      
      * fix breaking change after merge with master
      
      * Extract streaming to own PR
      
      * small merge fixes and cleanup
      
      * Fix accidental deletion during branch transfer
      
      * responded to initial triage comments
      
      * Added more tests to use create-from-samples APIs
      
      * Fix rstcheck call in ci
      
      * remove TODOs
      
      * Extract streaming to own PR
      
      * small merge fixes and cleanup
      
      * Fix accidental deletion during branch transfer
      
      * responded to initial triage comments
      
      * Added more tests to use create-from-samples APIs
      
      * Small SWIG fix
      
      * remove ci change
      
      * responded to shiyu1994 comments
      
      * responded to StrikerRUS comments
      
      * Fixes from StrikerRUS comments
      0a5c5838
  2. 26 Mar, 2022 1 commit
  3. 23 Mar, 2022 1 commit
    • shiyu1994's avatar
      [CUDA] New CUDA version Part 1 (#4630) · 6b56a90c
      shiyu1994 authored
      
      
      * new cuda framework
      
      * add histogram construction kernel
      
      * before removing multi-gpu
      
      * new cuda framework
      
      * tree learner cuda kernels
      
      * single tree framework ready
      
      * single tree training framework
      
      * remove comments
      
      * boosting with cuda
      
      * optimize for best split find
      
      * data split
      
      * move boosting into cuda
      
      * parallel synchronize best split point
      
      * merge split data kernels
      
      * before code refactor
      
      * use tasks instead of features as units for split finding
      
      * refactor cuda best split finder
      
      * fix configuration error with small leaves in data split
      
      * skip histogram construction of too small leaf
      
      * skip split finding of invalid leaves
      
      stop when no leaf to split
      
      * support row wise with CUDA
      
      * copy data for split by column
      
      * copy data from host to CPU by column for data partition
      
      * add synchronize best splits for one leaf from multiple blocks
      
      * partition dense row data
      
      * fix sync best split from task blocks
      
      * add support for sparse row wise for CUDA
      
      * remove useless code
      
      * add l2 regression objective
      
      * sparse multi value bin enabled for CUDA
      
      * fix cuda ranking objective
      
      * support for number of items <= 2048 per query
      
      * speedup histogram construction by interleaving global memory access
      
      * split optimization
      
      * add cuda tree predictor
      
      * remove comma
      
      * refactor objective and score updater
      
      * before use struct
      
      * use structure for split information
      
      * use structure for leaf splits
      
      * return CUDASplitInfo directly after finding best split
      
      * split with CUDATree directly
      
      * use cuda row data in cuda histogram constructor
      
      * clean src/treelearner/cuda
      
      * gather shared cuda device functions
      
      * put shared CUDA functions into header file
      
      * change smaller leaf from <= back to < for consistent result with CPU
      
      * add tree predictor
      
      * remove useless cuda_tree_predictor
      
      * predict on CUDA with pipeline
      
      * add global sort algorithms
      
      * add global argsort for queries with many items in ranking tasks
      
      * remove limitation of maximum number of items per query in ranking
      
      * add cuda metrics
      
      * fix CUDA AUC
      
      * remove debug code
      
      * add regression metrics
      
      * remove useless file
      
      * don't use mask in shuffle reduce
      
      * add more regression objectives
      
      * fix cuda mape loss
      
      add cuda xentropy loss
      
      * use template for different versions of BitonicArgSortDevice
      
      * add multiclass metrics
      
      * add ndcg metric
      
      * fix cross entropy objectives and metrics
      
      * fix cross entropy and ndcg metrics
      
      * add support for customized objective in CUDA
      
      * complete multiclass ova for CUDA
      
      * separate cuda tree learner
      
      * use shuffle based prefix sum
      
      * clean up cuda_algorithms.hpp
      
      * add copy subset on CUDA
      
      * add bagging for CUDA
      
      * clean up code
      
      * copy gradients from host to device
      
      * support bagging without using subset
      
      * add support of bagging with subset for CUDAColumnData
      
      * add support of bagging with subset for dense CUDARowData
      
      * refactor copy sparse subrow
      
      * use copy subset for column subset
      
      * add reset train data and reset config for CUDA tree learner
      
      add deconstructors for cuda tree learner
      
      * add USE_CUDA ifdef to cuda tree learner files
      
      * check that dataset doesn't contain CUDA tree learner
      
      * remove printf debug information
      
      * use full new cuda tree learner only when using single GPU
      
      * disable all CUDA code when using CPU version
      
      * recover main.cpp
      
      * add cpp files for multi value bins
      
      * update LightGBM.vcxproj
      
      * update LightGBM.vcxproj
      
      fix lint errors
      
      * fix lint errors
      
      * fix lint errors
      
      * update Makevars
      
      fix lint errors
      
      * fix the case with 0 feature and 0 bin
      
      fix split finding for invalid leaves
      
      create cuda column data when loaded from bin file
      
      * fix lint errors
      
      hide GetRowWiseData when cuda is not used
      
      * recover default device type to cpu
      
      * fix na_as_missing case
      
      fix cuda feature meta information
      
      * fix UpdateDataIndexToLeafIndexKernel
      
      * create CUDA trees when needed in CUDADataPartition::UpdateTrainScore
      
      * add refit by tree for cuda tree learner
      
      * fix test_refit in test_engine.py
      
      * create set of large bin partitions in CUDARowData
      
      * add histogram construction for columns with a large number of bins
      
      * add find best split for categorical features on CUDA
      
      * add bitvectors for categorical split
      
      * cuda data partition split for categorical features
      
      * fix split tree with categorical feature
      
      * fix categorical feature splits
      
      * refactor cuda_data_partition.cu with multi-level templates
      
      * refactor CUDABestSplitFinder by grouping task information into struct
      
      * pre-allocate space for vector split_find_tasks_ in CUDABestSplitFinder
      
      * fix misuse of reference
      
      * remove useless changes
      
      * add support for path smoothing
      
      * virtual destructor for LightGBM::Tree
      
      * fix overlapped cat threshold in best split infos
      
      * reset histogram pointers in data partition and spllit finder in ResetConfig
      
      * comment useless parameter
      
      * fix reverse case when na is missing and default bin is zero
      
      * fix mfb_is_na and mfb_is_zero and is_single_feature_column
      
      * remove debug log
      
      * fix cat_l2 when one-hot
      
      fix gradient copy when data subset is used
      
      * switch shared histogram size according to CUDA version
      
      * gpu_use_dp=true when cuda test
      
      * revert modification in config.h
      
      * fix setting of gpu_use_dp=true in .ci/test.sh
      
      * fix linter errors
      
      * fix linter error
      
      remove useless change
      
      * recover main.cpp
      
      * separate cuda_exp and cuda
      
      * fix ci bash scripts
      
      add description for cuda_exp
      
      * add USE_CUDA_EXP flag
      
      * switch off USE_CUDA_EXP
      
      * revert changes in python-packages
      
      * more careful separation for USE_CUDA_EXP
      
      * fix CUDARowData::DivideCUDAFeatureGroups
      
      fix set fields for cuda metadata
      
      * revert config.h
      
      * fix test settings for cuda experimental version
      
      * skip some tests due to unsupported features or differences in implementation details for CUDA Experimental version
      
      * fix lint issue by adding a blank line
      
      * fix lint errors by resorting imports
      
      * fix lint errors by resorting imports
      
      * fix lint errors by resorting imports
      
      * merge cuda.yml and cuda_exp.yml
      
      * update python version in cuda.yml
      
      * remove cuda_exp.yml
      
      * remove unrelated changes
      
      * fix compilation warnings
      
      fix cuda exp ci task name
      
      * recover task
      
      * use multi-level template in histogram construction
      
      check split only in debug mode
      
      * ignore NVCC related lines in parameter_generator.py
      
      * update job name for CUDA tests
      
      * apply review suggestions
      
      * Update .github/workflows/cuda.yml
      Co-authored-by: default avatarNikita Titov <nekit94-08@mail.ru>
      
      * Update .github/workflows/cuda.yml
      Co-authored-by: default avatarNikita Titov <nekit94-08@mail.ru>
      
      * update header
      
      * remove useless TODOs
      
      * remove [TODO(shiyu1994): constrain the split with min_data_in_group] and record in #5062
      
      * #include <LightGBM/utils/log.h> for USE_CUDA_EXP only
      
      * fix include order
      
      * fix include order
      
      * remove extra space
      
      * address review comments
      
      * add warning when cuda_exp is used together with deterministic
      
      * add comment about gpu_use_dp in .ci/test.sh
      
      * revert changing order of included headers
      Co-authored-by: default avatarYu Shi <shiyu1994@qq.com>
      Co-authored-by: default avatarNikita Titov <nekit94-08@mail.ru>
      6b56a90c
  4. 17 Mar, 2021 1 commit
    • ashok-ponnuswami-msft's avatar
      Range check for DCG position discount lookup (#4069) · 4580393f
      ashok-ponnuswami-msft authored
      * Add check to prevent out of index lookup in the position discount table. Add debug logging to report number of queries found in the data.
      
      * Change debug logging location so that we can print the data file name as well.
      
      * Revert "Change debug logging location so that we can print the data file name as well."
      
      This reverts commit 3981b34bd6e0530f89c4733e78e6b6603bf50d48.
      
      * Add data file name to debug logging.
      
      * Move log line to a place where it is output even when query IDs are read from a separate file.
      
      * Also add the out-of-range check to rank metrics.
      
      * Perform check after number of queries is initialized.
      
      * Update
      4580393f
  5. 19 Feb, 2021 1 commit
  6. 30 Sep, 2020 1 commit
    • Guolin Ke's avatar
      fix address alignment, required by cran (#3415) · f30dbe87
      Guolin Ke authored
      * fix dataset binary file alignment
      
      * many fixes
      
      * fix warnings
      
      * fix bug
      
      * Update file_io.cpp
      
      * Update file_io.cpp
      
      * simplify code
      
      * Apply suggestions from code review
      
      * general
      
      * remove unneeded alignment
      
      * Update file_io.h
      
      * int32 to byte8 alignment
      
      * Apply suggestions from code review
      
      * Apply suggestions from code review
      f30dbe87
  7. 05 Jun, 2020 1 commit
  8. 01 Jun, 2020 1 commit
  9. 22 Feb, 2020 1 commit
    • Guolin Ke's avatar
      some code refactoring (#2769) · 3e80df7e
      Guolin Ke authored
      * some refines
      
      * more omp refactoring
      
      * format define
      
      * fix merge bug
      
      * some fixes
      
      * fix some warnings
      
      * Apply suggestions from code review
      
      * Apply suggestions from code review
      
      * remove dup codes
      3e80df7e
  10. 20 Feb, 2020 1 commit
  11. 08 Feb, 2020 1 commit
  12. 14 Jan, 2020 1 commit
  13. 29 Dec, 2019 1 commit
  14. 07 Sep, 2019 1 commit
  15. 13 Apr, 2019 1 commit
  16. 11 Apr, 2019 1 commit
  17. 02 Feb, 2019 1 commit
  18. 11 May, 2018 1 commit
  19. 27 Feb, 2018 1 commit
    • ebernhardson's avatar
      Experimental support for HDFS (#1243) · 7e186a57
      ebernhardson authored
      * Read and write datsets from hdfs.
      * Only enabled when cmake is run with -DUSE_HDFS:BOOL=TRUE
      * Introduces VirtualFile(Reader|Writer) to asbtract VFS differences
      7e186a57
  20. 17 Dec, 2017 1 commit
  21. 18 Aug, 2017 1 commit
  22. 25 Jan, 2017 1 commit
  23. 10 Jan, 2017 2 commits
  24. 08 Jan, 2017 1 commit
    • Guolin Ke's avatar
      R package (#168) · 551d59ca
      Guolin Ke authored
      * finish R's c_api
      
      * clean code
      
      * fix sizeof pointer in 32bit system.
      
      * add predictor class
      
      * add Dataset class
      
      * format code
      
      * add booster
      
      * add type check for expose function
      
      * add a simple callback
      
      * add all callbacks
      
      * finish the basic training logic
      
      * update docs
      
      * add an simple training interface
      
      * add basic test
      
      * adapt the changes in c_api
      
      * add test for Dataset
      
      * add test for custom obj/eval functions
      
      * fix python test
      
      * fix bug in metadata init
      
      * fix R CMD check
      551d59ca
  25. 28 Dec, 2016 1 commit
  26. 18 Dec, 2016 1 commit
  27. 05 Dec, 2016 1 commit
  28. 02 Dec, 2016 1 commit
    • wxchan's avatar
      Squash into one commit: · eba6d200
      wxchan authored
      1. merge python-package
      2. add dump model to json
      3. fix bugs
      4. clean code with pylint
      5. update python examples
      eba6d200
  29. 26 Nov, 2016 2 commits
  30. 24 Nov, 2016 1 commit
  31. 22 Nov, 2016 1 commit
  32. 18 Nov, 2016 1 commit
    • Guolin Ke's avatar
      Refactor for RAII (#86) · 5442ed78
      Guolin Ke authored
      * RAII for utils, application and c_api(partical)
      
      * raii for class in include folder
      
      * raii for application and boosting
      
      * raii for dataset and dataset loader
      
      * raii for dense bin and parser
      
      * RAII refactor for almost all classes
      
      * RAII for c_api
      
      * clean code
      
      * refine repeated code
      
      * Decouple the "sigmoid" between objective and boosting.
      
      * change std::vector<bool> back to std::vector<char> due to concurrence problem
      
      * slight reduce some memory cost
      5442ed78
  33. 05 Nov, 2016 1 commit
  34. 04 Nov, 2016 1 commit
  35. 03 Nov, 2016 2 commits
  36. 02 Nov, 2016 2 commits