- 06 Jan, 2023 1 commit
-
-
binmakeswell authored
-
- 05 Jan, 2023 10 commits
-
-
Haofan Wang authored
-
ZijianYY authored
-
YuliangLiu0306 authored
* [device] alpha beta profiler * add usage * fix variable name
-
Frank Lee authored
* [setup] make cuda extension build optional * polish code * polish code * polish code
-
Frank Lee authored
-
Fazzie-Maqianli authored
-
Frank Lee authored
-
Frank Lee authored
-
Frank Lee authored
* [workflow] added workflow to release to pypi upon version change * polish code * polish code * polish code
-
Frank Lee authored
-
- 04 Jan, 2023 20 commits
-
-
binmakeswell authored
* [doc] update link
-
Frank Lee authored
-
Jiarui Fang authored
-
Fazzie-Maqianli authored
-
Sze-qq authored
Co-authored-by:siqi <siqi@siqis-MacBook-Pro.local>
-
Junming Wu authored
-
Ofey Chan authored
[NFC] polish colossalai/auto_parallel/tensor_shard/deprecated/op_handler/layer_norm_handler.py code style (#2305)
-
ver217 authored
-
xyupeng authored
-
Zangwei Zheng authored
-
shenggan authored
[NFC] polish colossalai/auto_parallel/tensor_shard/deprecated/op_handler/reshape_handler.py code style (#2292)
-
Ziheng Qin authored
Co-authored-by:henryqin1997 <henryqin1997@gamil.com>
-
アマデウス authored
-
Zirui Zhu authored
[NFC] polish colossalai/auto_parallel/tensor_shard/node_handler/getitem_handler.py code style (#2289)
-
Zihao authored
* add meta_data_computing * add checkpoint_annotation * rename proxy.data to proxy.meta_data and add bias addition pass * polish code * delete meta_prop_pass invoke and rename ori_node to orig_node * add TracerType * unify meta data computing * delete TracerType * handle setitem operation * operator.setitem
-
Jiarui Fang authored
-
HELSON authored
* [amp] add gradient clipping in unit tests * fix bugs
-
HELSON authored
-
Frank Lee authored
-
Boyuan Yao authored
[autockpt] provide option for activation checkpoint search in SPMD solver
-
- 03 Jan, 2023 9 commits
-
-
binmakeswell authored
-
binmakeswell authored
-
binmakeswell authored
-
Jiarui Fang authored
-
Boyuan Yao authored
* [autoparallel] align the data_ptr with the old version of auto activation checkpoint pipeline * [autoparallel] using fwd_time and bwd_time instead of fwd_flop and bwd_flop * [autoparallel] specifycomm nodes' memory cost in construct chain * [autoparallel] fix wrong runtime apply calculation * [autoparallel] fix wrong runtime apply calculation * [autoparallel] fix wrong runtime apply calculation * [autoparallel] bypass metainfo when available and modify BCAST_FUNC_OP
-
Jiarui Fang authored
-
Super Daniel authored
* [autockpt] make it work. * [autockpt] linearize / merge shape-consistency nodes. * [autockpt] considering parameter and optimizer weights. * [hotfix] pass a parameter.
-
ZijianYY authored
-
zbian authored
-