- 12 Apr, 2022 2 commits
- 11 Apr, 2022 10 commits
-
-
Jiarui Fang authored
-
Frank Lee authored
-
Jiarui Fang authored
-
HELSON authored
-
ver217 authored
-
LuGY authored
* fixed bugs of assigning grad states to non leaf nodes * use detach()
-
Frank Lee authored
-
HELSON authored
* adapt post grad hooks for not-shard parameters * adapt optimizer for not-shard parameters * offload gradients for not-replicated parameters
-
ver217 authored
* refactor memstats collector * fix disposable * polish code
-
アマデウス authored
-
- 08 Apr, 2022 7 commits
-
-
HELSON authored
-
binmakeswell authored
-
binmakeswell authored
* add PaLM link
-
ver217 authored
* [WIP] stateful tensor manager * add eviction strategy * polish code * polish code * polish comment * add unit test * fix sampler bug * polish code * fix max sampling cnt resetting bug * fix sampler bug * polish code * fix bug * fix unit test Co-authored-by:jiaruifang <fangjiarui123@gmail.com>
-
ver217 authored
-
Frank Lee authored
-
github-actions[bot] authored
Co-authored-by:github-actions <github-actions@github.com>
-
- 07 Apr, 2022 5 commits
-
-
github-actions[bot] authored
Co-authored-by:github-actions <github-actions@github.com>
-
Frank Lee authored
-
Frank Lee authored
-
HELSON authored
* adapt model weight initialization for methods in Pytorch nn.init
-
YuliangLiu0306 authored
* refactor pipeline---put runtime schedule into engine. * add type hint for schedule Optional[BaseSchedule] * preprocess schedule during engine initializing * infer pipeline schedule params from config
-
- 06 Apr, 2022 16 commits
-
-
Frank Lee authored
-
Jiarui Fang authored
-
Frank Lee authored
-
ver217 authored
-
encmps authored
-
lucasliunju authored
-
shenggan authored
-
FredHuang99 authored
-
MaxT authored
-
Xue Fuzhao authored
-
Cautiousss authored
[NFC] polish colossalai/context/process_group_initializer/initializer_sequence.py colossalai/context/process_group_initializer initializer_tensor.py code style (#639) Co-authored-by:何晓昕 <cautious@r-236-100-25-172.comp.nus.edu.sg>
-
Ziheng Qin authored
-
Sze-qq authored
-
Wangbo Zhao authored
-
ExtremeViscent authored
[NFC] polish colossalai/kernel/cuda_native/csrc/kernels/dropout_kernels.cu and cross_entropy.cu code style (#634)
-
RichardoLuo authored
Co-authored-by:RichardoLuo <14049555596@qq.com>
-