Commits · 765db512b5c7b1ba20d90f3aa4071f25c7afea7a · OpenDAS / ColossalAI

28 Jan, 2022 1 commit
- fixed ddp bug on torch 1.8 (#194) · 765db512
  Frank Lee authored Jan 28, 2022
  
  765db512
25 Jan, 2022 1 commit

Jiarui Fang authored Jan 25, 2022

* add pytorch hooks
fix #175

* remove licenses in src code

* add gpu memory tracer

* replacing print with logger in ophooks.

569357fe

21 Jan, 2022 3 commits
- fix pipeline forward return tensors (#176) · 708404d5
  ver217 authored Jan 21, 2022
  
  708404d5
- update logo · 6fb550ac
  WANG-CR authored Jan 21, 2022
  
  6fb550ac
- Fixed docstring in colossalai (#171) · 0f8c7f98
  HELSON authored Jan 21, 2022
  
  0f8c7f98
20 Jan, 2022 2 commits
- adapted for sequence parallel (#163) · e2089c5c
  Frank Lee authored Jan 20, 2022
  
  e2089c5c
- update readme (#168) · a2e649da
  Frank Lee authored Jan 20, 2022
  
  a2e649da
19 Jan, 2022 6 commits
- fixed submodule url (#167) · 9684bdce
  Frank Lee authored Jan 19, 2022
```
* added examples as submodule

* update submodule url
```
  9684bdce
- Update workflow files and README.md (#166) · bd4840f1
  BoxiangW authored Jan 19, 2022
  
  bd4840f1
- update doc requirements and rtd conf (#165) · 1949d3a8
  ver217 authored Jan 19, 2022
  
  1949d3a8
- removed tutorial markdown and refreshed rst files for consistency · be85a0f3
  Frank Lee authored Jan 19, 2022
  
  be85a0f3
- Set examples as submodule (#162) · ca4ae52d
  Frank Lee authored Jan 19, 2022
```
* remove examples folder

* added examples as submodule

* update .gitmodules
```
  ca4ae52d
- add logo at homepage, add forum in issue template (#161) · 17ce8569
  binmakeswell authored Jan 19, 2022
  
  17ce8569
18 Jan, 2022 6 commits
- AMP docstring/markdown update (#160) · 9473a1b9
  puck_WCR authored Jan 18, 2022
  
  9473a1b9
- update benchmark commit id (#159) · 2499faa2
  Frank Lee authored Jan 18, 2022
  
  2499faa2
- Added rand augment and update the dataloader · d143396c
  LuGY_mac authored Jan 14, 2022
  
  d143396c
- set benchmarks as a git submodule (#156) · c7b8ece7
  Frank Lee authored Jan 18, 2022
```
* remove benchmark folder

* added benchmark submodule

* update .gitmodules
```
  c7b8ece7
- fixed jit default setting (#154) · f3802d6b
  Frank Lee authored Jan 18, 2022
  
  f3802d6b
- added docker documentation (#152) · a1da3900
  Frank Lee authored Jan 18, 2022
  
  a1da3900
17 Jan, 2022 2 commits
- pipeline last stage supports multi output (#151) · 7bf1e98b
  ver217 authored Jan 17, 2022
  
  7bf1e98b
- Added moe parallel example (#140) · 1ff5be36
  HELSON authored Jan 17, 2022
  
  1ff5be36
13 Jan, 2022 1 commit
- refactor kernel (#142) · f68eddfb
  ver217 authored Jan 13, 2022
  
  f68eddfb
10 Jan, 2022 2 commits
- Update layer integration documentations (#108) · 4a3d3446
  BoxiangW authored Jan 10, 2022
```
Update the documentations of layer integration

Update _log_hook.py

Update _operation.py
```
  4a3d3446
- add doc issue template (#133) · 3a61d785
  binmakeswell authored Jan 10, 2022
  
  3a61d785
07 Jan, 2022 5 commits
- try import deepspeed when using zero (#130) · 9ef05ed1
  ver217 authored Jan 07, 2022
  
  9ef05ed1
- add workflow for deploying doc (#129) · b7975d2b
  ver217 authored Jan 07, 2022
  
  b7975d2b
- Added MoE parallel (#127) · dceae851
  HELSON authored Jan 07, 2022
  
  dceae851
- added docker image (#126) · 42741dd4
  Frank Lee authored Jan 07, 2022
  
  42741dd4
- add scatter/gather optim for pipeline (#123) · 293fb40c
  ver217 authored Jan 07, 2022
  
  293fb40c
06 Jan, 2022 2 commits
- Hotfix/gitact (#125) · 404e6f88
  Frank Lee authored Jan 07, 2022
```
* enable CI after PR sync

* Fixed github action
```
  404e6f88
- fix issue template (#118) · 43e7d546
  binmakeswell authored Jan 06, 2022
  
  43e7d546
05 Jan, 2022 1 commit
- fix a bug in timer (#114) · 2c0c85d3
  Jiarui Fang authored Jan 05, 2022
  
  2c0c85d3
04 Jan, 2022 4 commits
- fix layers/schedule for hybrid parallelization (#111) (#112) · 7904baf6
  ver217 authored Jan 04, 2022
  
  7904baf6
- update vit example for new API (#98) (#99) · f03bcb35
  ver217 authored Jan 04, 2022
  
  f03bcb35
- enable CI after PR sync (#97) · d09a79ba
  Frank Lee authored Jan 04, 2022
  
  d09a79ba
- update default logger (#100) (#101) · a951bc60
  ver217 authored Jan 04, 2022
  
  a951bc60
30 Dec, 2021 2 commits

Optimize pipeline schedule (#94) · 96780e6e

ver217 authored Dec 30, 2021



* add pipeline shared module wrapper and update load batch

* added model parallel process group for amp and clip grad (#86)

* added model parallel process group for amp and clip grad

* update amp and clip with model parallel process group

* remove pipeline_prev/next group (#88)

* micro batch offload

* optimize pipeline gpu memory usage

* pipeline can receive tensor shape (#93)

* optimize pipeline gpu memory usage

* fix grad accumulation step counter

* rename classes and functions
Co-authored-by: Frank Lee <somerlee.9@gmail.com>

96780e6e

added gpt model & benchmark (#95) · e5b9f9a0
アマデウス authored Dec 30, 2021

e5b9f9a0

29 Dec, 2021 1 commit

Hotfix/Colossalai layers (#92) · 01a80cd8

アマデウス authored Dec 29, 2021



* optimized 1d layer apis; reorganized nn.layer modules; fixed tests

* fixed 2.5d runtime issue

* reworked split batch, now called in trainer.schedule.load_batch
Co-authored-by: BoxiangW <45734921+BoxiangW@users.noreply.github.com>

01a80cd8

27 Dec, 2021 1 commit

Layer integration (#83) · 0fedef4f

アマデウス authored Dec 27, 2021



* integrated parallel layers for ease of building models

* integrated 2.5d layers

* cleaned codes and unit tests

* added log metric by step hook; updated imagenet benchmark; fixed some bugs

* reworked initialization; cleaned codes
Co-authored-by: BoxiangW <45734921+BoxiangW@users.noreply.github.com>

0fedef4f