- 17 May, 2022 24 commits
-
-
ziyu huang authored
Co-authored-by:“Arsmart123 <202476410arsmart@gmail.com>
-
superhao1995 authored
[NFC] polish colossalai/kernel/cuda_native/csrc/scaled_upper_triang_masked_softmax.cpp code style (#959)
-
MaxT authored
-
runluo authored
-
doubleHU authored
[NFC] polish colossalai/kernel/cuda_native/csrc/kernels/include/cross_entropy_layer.h code style (#957)
-
RichardoLuo authored
Co-authored-by:RichardoLuo <14049555596@qq.com>
-
Wangbo Zhao(黑色枷锁) authored
-
Luxios22 authored
-
Cautiousss authored
Co-authored-by:何晓昕 <cautious@hexiaoxins-MacBook-Pro.local>
-
Sze-qq authored
-
xyupeng authored
-
JT.Han authored
Co-authored-by:Jiatong <jiatong.han@u.nus.edu>
-
luoling-LC authored
Co-authored-by:jnbai <897086360@qq.com>
-
bajiaoyu517 authored
-
wky authored
-
HaoyuQin authored
[NFC] polish pre-commit run --files colossalai/kernel/cuda_native/csrc/scaled_upper_triang_masked_softmax_cuda.cu code style (#943)
-
XYE authored
Co-authored-by:Xiao Ye <xiaoye2@illinois.edu>
-
Maruyama_Aya authored
-
Geng Zhang authored
-
yuxuan-lou authored
-
BoxiangW authored
-
binmakeswell authored
* [NFC] Polish colossalai/kernel/cuda_native/csrc/multi_tensor_lamb.cu code style. (#937) * [NFC] polish colossalai/kernel/cuda_native/csrc/kernels/include/cuda_util.h code style (#939) * [NFC] polish colossalai/kernel/cuda_native/csrc/cpu_adam.cpp code style (#936) * [NFC] polish colossalai/kernel/cuda_native/csrc/kernels/include/block_reduce.h code style (#938) * [NFC] polish moe_cuda_kernel.cu code style (#940) Co-authored-by:
Xiao Ye <xiaoye2@illinois.edu> * [NFC] polish pre-commit run --files colossalai/kernel/cuda_native/csrc/scaled_upper_triang_masked_softmax_cuda.cu code style (#943) * [NFC] polish colossalai/kernel/cuda_native/csrc/moe_cuda.cpp code style (#942) * [NFC] polish colossalai/kernel/cuda_native/csrc/cpu_adam.h code style (#945) * [NFC] polish colossalai/kernel/jit/bias_gelu.py code style (#946) Co-authored-by:
jnbai <897086360@qq.com> * [NFC] polish colossalai/kernel/cuda_native/csrc/scaled_masked_softmax_cuda.cu code style (#949) Co-authored-by:
Jiatong <jiatong.han@u.nus.edu> * [NFC] polish colossalai/builder/pipeline.py code style (#951) * [NFC] polish colossalai/kernel/cuda_native/csrc/multihead_attention_1d.cpp code style (#952) * [NFC] polish colossalai/kernel/cuda_native/csrc/kernels/cross_entropy.cu code style (#953) Co-authored-by:
何晓昕 <cautious@hexiaoxins-MacBook-Pro.local> * [NFC] polish colossalai/kernel/cuda_native/csrc/kernels/softmax_kernels.cu code style (#954) * [NFC] polish colossalai/kernel/cuda_native/scaled_softmax.py code style (#955) * [NFC] polish colossalai/kernel/cuda_native/csrc/kernels/include/context.h code style (#956) Co-authored-by:
RichardoLuo <14049555596@qq.com> * [NFC] polish colossalai/kernel/cuda_native/csrc/kernels/include/cross_entropy_layer.h code style (#957) * [NFC] polish colossalai/kernel/cuda_native/csrc/multi_tensor_l2norm_kernel.cu code style (#958) * [NFC] polish colossalai/kernel/cuda_native/csrc/multihead_attention_1d.h code style (#962) * [NFC] polish colossalai/kernel/cuda_native/csrc/scaled_upper_triang_masked_softmax.cpp code style (#959) * [NFC] polish colossalai/kernel/cuda_native/csrc/kernels/general_kernels.cu code style (#963) Co-authored-by:
“Arsmart123 <202476410arsmart@gmail.com> * [NFC] polish colossalai/kernel/cuda_native/csrc/kernels/include/softmax.h code style (#964) * [NFC] polish __init__.py code style (#965) * [NFC] polish colossalai/nn/layer/parallel_3d/layers.py code style (#966) * [NFC] polish colossalai/kernel/cuda_native/csrc/kernels/include/feed_forward.h (#968) code style * [NFC] polish colossalai/kernel/cuda_native/csrc/kernels/include/dropout.h code style (#970) * [NFC] polish colossalai/nn/layer/parallel_2p5d/layers.py code style (#972) * [NFC] polish colossalai/kernel/cuda_native/csrc/layer_norm_cuda.cpp code style (#973) * [NFC] polish colossalai/kernel/cuda_native/csrc/kernels/normalize_kernels.cu code style (#974) * [NFC] polish colossalai/kernel/cuda_native/csrc/multi_tensor_scale_kernel.cu code style (#977) * [NFC] polish colossalai/nn/layer/parallel_2d/layers.py code style (#976) * [NFC] polish colossalai/kernel/cuda_native/csrc/multi_tensor_sgd_kernel.cu code style (#978) * [NFC] polish colossalai/kernel/cuda_native/csrc/kernels/dropout_kernels.cu code style (#979) * [NFC] polish colossalai/kernel/cuda_native/layer_norm.py code style (#980) * [NFC] polish colossalai/nn/layer/utils/common.py code style (#983) Co-authored-by:
BoxiangW <45734921+BoxiangW@users.noreply.github.com> Co-authored-by:
yuxuan-lou <83441848+yuxuan-lou@users.noreply.github.com> Co-authored-by:
Geng Zhang <34452939+zxgx@users.noreply.github.com> Co-authored-by:
Maruyama_Aya <38985202+MaruyamaAya@users.noreply.github.com> Co-authored-by:
XYE <92607131+Itok2000u@users.noreply.github.com> Co-authored-by:
Xiao Ye <xiaoye2@illinois.edu> Co-authored-by:
HaoyuQin <79465534+coder-chin@users.noreply.github.com> Co-authored-by:
wky <64853922+wangkuangyi@users.noreply.github.com> Co-authored-by:
bajiaoyu517 <59548007+bajiaoyu517@users.noreply.github.com> Co-authored-by:
luoling-LC <105470086+luoling-LC@users.noreply.github.com> Co-authored-by:
jnbai <897086360@qq.com> Co-authored-by:
JT.Han <59948448+JThh@users.noreply.github.com> Co-authored-by:
Jiatong <jiatong.han@u.nus.edu> Co-authored-by:
xyupeng <99191637+xyupeng@users.noreply.github.com> Co-authored-by:
Sze-qq <68757353+Sze-qq@users.noreply.github.com> Co-authored-by:
Cautiousss <48676630+Cautiousss@users.noreply.github.com> Co-authored-by:
何晓昕 <cautious@hexiaoxins-MacBook-Pro.local> Co-authored-by:
Luxios22 <67457897+Luxios22@users.noreply.github.com> Co-authored-by:
Wangbo Zhao(黑色枷锁) <56866854+wangbo-zhao@users.noreply.github.com> Co-authored-by:
RichardoLuo <50363844+RichardoLuo@users.noreply.github.com> Co-authored-by:
RichardoLuo <14049555596@qq.com> Co-authored-by:
doubleHU <98150031+huxin711@users.noreply.github.com> Co-authored-by:
runluo <68489000+run-qiao@users.noreply.github.com> Co-authored-by:
MaxT <854721132@qq.com> Co-authored-by:
superhao1995 <804673818@qq.com> Co-authored-by:
ziyu huang <huang0ziyu@gmail.com> Co-authored-by:
“Arsmart123 <202476410arsmart@gmail.com> Co-authored-by:
Yuer867 <62204893+Yuer867@users.noreply.github.com> Co-authored-by:
lucasliunju <lucasliunju@gmail.com> Co-authored-by:
LuGY <74758262+Gy-Lu@users.noreply.github.com> Co-authored-by:
ExtremeViscent <zhangyiqi55732@sina.com> Co-authored-by:
Xu Kai <xukai16@foxmail.com> Co-authored-by:
Zirui Zhu <zhuzr21@gmail.com> Co-authored-by:
Ofey Chan <ofey206@gmail.com> Co-authored-by:
DouJS <dujiangsu@163.com> Co-authored-by:
Jie Zhu <chore.08-protist@icloud.com> Co-authored-by:
shenggan <csg19971016@gmail.com> Co-authored-by:
Kai Wang (Victor Kai) <37533040+kaiwang960112@users.noreply.github.com> Co-authored-by:
puck_WCR <46049915+WANG-CR@users.noreply.github.com> Co-authored-by:
Ziheng Qin <37519855+henryqin1997@users.noreply.github.com>
-
ver217 authored
-
- 16 May, 2022 2 commits
-
-
binmakeswell authored
-
ver217 authored
* derive compute pattern from dist spec * polish code
-
- 14 May, 2022 1 commit
-
-
github-actions[bot] authored
Co-authored-by:github-actions <github-actions@github.com>
-
- 13 May, 2022 4 commits
-
-
Ziyue Jiang authored
-
ver217 authored
* add dist spec * update linear op * polish code * polish code * update embedding op * polish unit tests * polish unit tests * polish comments * polish code * add test_dist_spec_mgr * polish code * refactor folder structure * polish unit tests * add get_process_group() for TensorSpec * polish code
-
Ziyue Jiang authored
* add optimizer to bert test * polish
-
github-actions[bot] authored
Co-authored-by:github-actions <github-actions@github.com>
-
- 11 May, 2022 4 commits
-
-
Ziyue Jiang authored
* change torch.Parameter to ColoParameter * fix post assignment for init context * polish * polish
-
Ziyue Jiang authored
* simplify ColoModulize * simplify ColoModulize * polish * polish
-
YuliangLiu0306 authored
* [CLI] add CLI launcher * Revert "[CLI] add CLI launcher" This reverts commit df7e6506d4500af6a9220ef7fe4d3c7b1daebd4c. * [pipelinable]use pipelinable to support GPT model. * fix a bug caused by ShardedModel * polish * fix front func list
-
github-actions[bot] authored
Co-authored-by:github-actions <github-actions@github.com>
-
- 10 May, 2022 1 commit
-
-
ver217 authored
* colo tensor overrides mul * polish code
-
- 09 May, 2022 4 commits
-
-
ver217 authored
* hijack addmm for colo tensor * fix bugs * polish unit test * polish comments
-
Jiarui Fang authored
-
Ziyue Jiang authored
* add from_pretrained support and test * polish * polish * polish * polish
-
ver217 authored
* support more cuda archs * polish code
-