- 14 Oct, 2022 6 commits
-
-
Frank Lee authored
-
HELSON authored
* fixes memory leak when paramter is in fp16 in ZeroDDP init. * bans chunk releasement in CUDA. Only when a chunk is about to offload, it is allowed to release. * adds a constant placement policy. With it, users can allocate a reserved caching memory space for parameters.
-
binmakeswell authored
-
Frank Lee authored
* [autoparallel] refactored the autoparallel module for organization * polish code
-
Frank Lee authored
-
YuliangLiu0306 authored
* [autoparallel] adapt runtime passes v2 * polish code
-
- 13 Oct, 2022 8 commits
-
-
Jiarui Fang authored
-
Frank Lee authored
polish code
-
Frank Lee authored
* [autoparallel] collated all deprecated files * polish code
-
Frank Lee authored
-
YuliangLiu0306 authored
* [autoparallel] adapt solver and CostGraph with new handler * fix test issue
-
YuliangLiu0306 authored
* [autoparallel] add output handler and placeholder handler * Delete test_solver_with_resnet.py * fix test bugs
-
YuliangLiu0306 authored
* [autoparallel] add pooling handler * polish code
-
YuliangLiu0306 authored
* where generator * [autoparallel] where_handler_v2
-
- 12 Oct, 2022 3 commits
-
-
Boyuan Yao authored
-
Jiarui Fang authored
-
Frank Lee authored
-
- 11 Oct, 2022 3 commits
-
-
YuliangLiu0306 authored
-
YuliangLiu0306 authored
-
Super Daniel authored
* [fx/profiler] modify data_ptr into uuid for all tensors. * [fx] modify uuid. * [fx/profiler] tune performance on GPT-2. * [fx] updates. * [fx] debug. * [fx] debug. * [fx] cuda.
-
- 10 Oct, 2022 2 commits
-
-
Kirigaya Kazuto authored
[pipeline/fix-bug] num_microbatches support any integrate | stable chimera | launch tool for rpc pp framework (#1684) * [pipeline/tuning] improve dispatch performance both time and space cost * [pipeline/converge] add interface for testing convergence * [NFC] polish colossalai/utils/multi_tensor_apply/multi_tensor_apply.py code style * Update PipelineBase.py * [pipeline/chimera] reconstruct PipelineBase and Worker to support more feasible custom schedule | finish Chimera * [pipeline/chimera] test chimera | fix bug of initializing * [pipeline/pytree] add pytree to process args and kwargs | provide to process args and kwargs after forward * [pipeline/fix-bug] num_microbatches support any integrate | stable chimera | launch tool for rpc pp framework
-
jim authored
-
- 09 Oct, 2022 6 commits
-
-
Kirigaya Kazuto authored
[pipeline/rank_recorder] fix bug when process data before backward | add a tool for multiple ranks debug (#1681) * [pipeline/tuning] improve dispatch performance both time and space cost * [pipeline/converge] add interface for testing convergence * [NFC] polish colossalai/utils/multi_tensor_apply/multi_tensor_apply.py code style * Update PipelineBase.py * [pipeline/chimera] reconstruct PipelineBase and Worker to support more feasible custom schedule | finish Chimera * [pipeline/chimera] test chimera | fix bug of initializing * [pipeline/pytree] add pytree to process args and kwargs | provide to process args and kwargs after forward
-
YuliangLiu0306 authored
-
YuliangLiu0306 authored
* [autoparallel] add following node generator * polish code * polish code * update name of arguments
-
YuliangLiu0306 authored
* [autoparallel] add layer norm handler v2 * polish code * polish code
-
Fazzie-Maqianli authored
-
HELSON authored
-
- 06 Oct, 2022 1 commit
-
-
Boyuan Yao authored
* [autoparallel] fix insecure subprocess * [fx] fix insecure subprocess
-
- 05 Oct, 2022 1 commit
-
-
Boyuan Yao authored
-
- 04 Oct, 2022 1 commit
-
-
Boyuan Yao authored
* [fx] concreteinfoprop * [fx] add concreteinfoprop * [fx] modify docstring of ConcreteInfoProp * [fx] fix device error * [fx] modify parameter calculation * [fx] modify parameters calculation
-
- 03 Oct, 2022 1 commit
-
-
Boyuan Yao authored
* [autoparallel] add rotor c version * [fx] remove metainfoprop in rotor solver * [autoparallel] modify C code format * [autoparallel] remove build.py * [autoparallel] fix C extension build * [autoparallel] add C solver consistency test * [autoparallel] remove some unused imports * [autoparallel] refactor rotor solver code * [autoparallel] replace print with colossalai logger * [autoparallel] ranks fixed
-
- 29 Sep, 2022 7 commits
-
-
YuliangLiu0306 authored
-
Frank Lee authored
* [autoparallel] added utils for broadcast operation * polish code
-
YuliangLiu0306 authored
-
Frank Lee authored
-
YuliangLiu0306 authored
-
Kirigaya Kazuto authored
[pipeline/pytree] add pytree to process args and kwargs | provide `data_process_func` to process args and kwargs after forward (#1642) * [pipeline/tuning] improve dispatch performance both time and space cost * [pipeline/converge] add interface for testing convergence * [NFC] polish colossalai/utils/multi_tensor_apply/multi_tensor_apply.py code style * Update PipelineBase.py * [pipeline/chimera] reconstruct PipelineBase and Worker to support more feasible custom schedule | finish Chimera * [pipeline/chimera] test chimera | fix bug of initializing * [pipeline/pytree] add pytree to process args and kwargs | provide to process args and kwargs after forward
-
YuliangLiu0306 authored
* [autoparallel] remove no strategy nodes * fix none object iteration issue
-
- 28 Sep, 2022 1 commit
-
-
Frank Lee authored
-