- 18 Oct, 2022 5 commits
-
-
HELSON authored
* add chunk manager init function * fix unit tests * add comment * add flush=True
-
HELSON authored
* update pre-commit * update pre-commit
-
Jiarui Fang authored
-
YuliangLiu0306 authored
-
Super Daniel authored
[fx/meta/rpc] move _meta_registration.py to fx folder / register fx functions with compatibility checks / remove color debug (#1710) * [fx] move meta registration * [fx] fix tests. * [fx] fix test. * [fx] fix. * [meta] refactor meta registration.py. * [fx] add compatibility descriptions. * [fx] polish import. * [fx] add a decorator. * [fx] fix tests. * [fx] remove print. * [fx] edit raise error. * [fx] edit raise error. * [fx] add type hint. * [fx] fix import in experimental. * [rpc] remove color debug. * [meta] fix naming.
-
- 17 Oct, 2022 2 commits
-
-
Frank Lee authored
-
YuliangLiu0306 authored
* [autoparallel] resnet block runtime apply * seperate buffer and parameter in MemoryCost * polish code * add comments and todos * fix test issue
-
- 16 Oct, 2022 1 commit
-
-
Jiarui Fang authored
-
- 14 Oct, 2022 6 commits
-
-
Frank Lee authored
-
HELSON authored
* fixes memory leak when paramter is in fp16 in ZeroDDP init. * bans chunk releasement in CUDA. Only when a chunk is about to offload, it is allowed to release. * adds a constant placement policy. With it, users can allocate a reserved caching memory space for parameters.
-
binmakeswell authored
-
Frank Lee authored
* [autoparallel] refactored the autoparallel module for organization * polish code
-
Frank Lee authored
-
YuliangLiu0306 authored
* [autoparallel] adapt runtime passes v2 * polish code
-
- 13 Oct, 2022 8 commits
-
-
Jiarui Fang authored
-
Frank Lee authored
polish code
-
Frank Lee authored
* [autoparallel] collated all deprecated files * polish code
-
Frank Lee authored
-
YuliangLiu0306 authored
* [autoparallel] adapt solver and CostGraph with new handler * fix test issue
-
YuliangLiu0306 authored
* [autoparallel] add output handler and placeholder handler * Delete test_solver_with_resnet.py * fix test bugs
-
YuliangLiu0306 authored
* [autoparallel] add pooling handler * polish code
-
YuliangLiu0306 authored
* where generator * [autoparallel] where_handler_v2
-
- 12 Oct, 2022 3 commits
-
-
Boyuan Yao authored
-
Jiarui Fang authored
-
Frank Lee authored
-
- 11 Oct, 2022 3 commits
-
-
YuliangLiu0306 authored
-
YuliangLiu0306 authored
-
Super Daniel authored
* [fx/profiler] modify data_ptr into uuid for all tensors. * [fx] modify uuid. * [fx/profiler] tune performance on GPT-2. * [fx] updates. * [fx] debug. * [fx] debug. * [fx] cuda.
-
- 10 Oct, 2022 2 commits
-
-
Kirigaya Kazuto authored
[pipeline/fix-bug] num_microbatches support any integrate | stable chimera | launch tool for rpc pp framework (#1684) * [pipeline/tuning] improve dispatch performance both time and space cost * [pipeline/converge] add interface for testing convergence * [NFC] polish colossalai/utils/multi_tensor_apply/multi_tensor_apply.py code style * Update PipelineBase.py * [pipeline/chimera] reconstruct PipelineBase and Worker to support more feasible custom schedule | finish Chimera * [pipeline/chimera] test chimera | fix bug of initializing * [pipeline/pytree] add pytree to process args and kwargs | provide to process args and kwargs after forward * [pipeline/fix-bug] num_microbatches support any integrate | stable chimera | launch tool for rpc pp framework
-
jim authored
-
- 09 Oct, 2022 6 commits
-
-
Kirigaya Kazuto authored
[pipeline/rank_recorder] fix bug when process data before backward | add a tool for multiple ranks debug (#1681) * [pipeline/tuning] improve dispatch performance both time and space cost * [pipeline/converge] add interface for testing convergence * [NFC] polish colossalai/utils/multi_tensor_apply/multi_tensor_apply.py code style * Update PipelineBase.py * [pipeline/chimera] reconstruct PipelineBase and Worker to support more feasible custom schedule | finish Chimera * [pipeline/chimera] test chimera | fix bug of initializing * [pipeline/pytree] add pytree to process args and kwargs | provide to process args and kwargs after forward
-
YuliangLiu0306 authored
-
YuliangLiu0306 authored
* [autoparallel] add following node generator * polish code * polish code * update name of arguments
-
YuliangLiu0306 authored
* [autoparallel] add layer norm handler v2 * polish code * polish code
-
Fazzie-Maqianli authored
-
HELSON authored
-
- 06 Oct, 2022 1 commit
-
-
Boyuan Yao authored
* [autoparallel] fix insecure subprocess * [fx] fix insecure subprocess
-
- 05 Oct, 2022 1 commit
-
-
Boyuan Yao authored
-
- 04 Oct, 2022 1 commit
-
-
Boyuan Yao authored
* [fx] concreteinfoprop * [fx] add concreteinfoprop * [fx] modify docstring of ConcreteInfoProp * [fx] fix device error * [fx] modify parameter calculation * [fx] modify parameters calculation
-
- 03 Oct, 2022 1 commit
-
-
Boyuan Yao authored
* [autoparallel] add rotor c version * [fx] remove metainfoprop in rotor solver * [autoparallel] modify C code format * [autoparallel] remove build.py * [autoparallel] fix C extension build * [autoparallel] add C solver consistency test * [autoparallel] remove some unused imports * [autoparallel] refactor rotor solver code * [autoparallel] replace print with colossalai logger * [autoparallel] ranks fixed
-