- 06 Apr, 2023 1 commit
-
-
Frank Lee authored
* [test] added spawn decorator * polish code * polish code * polish code * polish code * polish code * polish code
-
- 04 Apr, 2023 1 commit
-
-
ver217 authored
* [zero] refactor low-level zero folder structure * [zero] fix legacy zero import path * [zero] fix legacy zero import path * [zero] remove useless import * [zero] refactor gemini folder structure * [zero] refactor gemini folder structure * [zero] refactor legacy zero import path * [zero] refactor gemini folder structure * [zero] refactor gemini folder structure * [zero] refactor gemini folder structure * [zero] refactor legacy zero import path * [zero] fix test import path * [zero] fix test * [zero] fix circular import * [zero] update import
-
- 28 Jan, 2023 1 commit
-
-
HELSON authored
* [zero] add strict ddp mode for chunk init * [gemini] update gpt example
-
- 20 Jan, 2023 1 commit
-
-
HELSON authored
* [zero] add strict ddp mode * [polish] add comments for strict ddp mode * [zero] fix test error
-
- 09 Jan, 2023 1 commit
-
-
HELSON authored
* [gemini] polish code * [testing] remove code * [gemini] make more robust
-
- 24 Nov, 2022 1 commit
-
-
Jiarui Fang authored
-
- 16 Nov, 2022 1 commit
-
-
Jiarui Fang authored
-
- 14 Nov, 2022 1 commit
-
-
Jiarui Fang authored
-
- 18 Oct, 2022 1 commit
-
-
HELSON authored
* add chunk manager init function * fix unit tests * add comment * add flush=True
-
- 09 Oct, 2022 1 commit
-
-
HELSON authored
-
- 26 Sep, 2022 1 commit
-
-
Jiarui Fang authored
This reverts commit 5be118f4.
-
- 24 Sep, 2022 1 commit
-
-
HELSON authored
-
- 25 Jul, 2022 1 commit
-
-
HELSON authored
-
- 19 Jul, 2022 1 commit
-
-
HELSON authored
-
- 18 Jul, 2022 1 commit
-
-
ver217 authored
* process group supports getting ranks in group * chunk mgr receives a process group * update unit test * fix unit tests
-
- 15 Jul, 2022 1 commit
-
-
HELSON authored
-
- 11 Jul, 2022 1 commit
-
-
Jiarui Fang authored
-
- 06 Jul, 2022 1 commit
-
-
Jiarui Fang authored
-
- 04 Jul, 2022 1 commit
-
-
Jiarui Fang authored
-
- 29 Jun, 2022 1 commit
-
-
Jiarui Fang authored
-
- 24 Jun, 2022 1 commit
-
-
Jiarui Fang authored
-
- 23 Jun, 2022 1 commit
-
-
Jiarui Fang authored
-
- 21 Jun, 2022 1 commit
-
-
ver217 authored
* ColoDDP supports overwriting default process group * rename ColoDDPV2 to ZeroDDP * add docstr for ZeroDDP * polish docstr
-
- 17 Jun, 2022 1 commit
-
-
ver217 authored
* fix param op hook * update zero tp test * fix bugs
-
- 15 Jun, 2022 1 commit
-
-
ver217 authored
* update gemini mgr * update chunk * add docstr * polish placement policy * update test chunk * update test zero * polish unit test * remove useless unit test
-
- 10 Jun, 2022 1 commit
-
-
ver217 authored
* add placement policy * add gemini mgr * update mem stats collector * update zero * update zero optim * fix bugs * zero optim monitor os * polish unit test * polish unit test * add assert
-
- 06 Jun, 2022 1 commit
-
-
Jiarui Fang authored
-
- 02 Jun, 2022 1 commit
-
-
ver217 authored
* add zero optimizer * torch ok * unit test ok * polish code * fix bugs * polish unit test * polish zero optim * polish colo ddp v2 * refactor folder structure * add comment * polish unit test * polish zero optim * polish unit test
-
- 31 May, 2022 1 commit
-
-
ver217 authored
* impl chunk manager * impl param op hook * add reduce_chunk * add zero hook v2 * add zero dp * fix TensorInfo * impl load balancing when using zero without chunk * fix zero hook * polish chunk * fix bugs * ddp ok * zero ok * polish code * fix bugs about load balancing * polish code * polish code * add ene-to-end test * polish code * polish code * polish code * fix typo * add test_chunk * fix bugs * fix bugs * polish code
-
- 21 May, 2022 1 commit
-
-
ver217 authored
* impl ColoDDP for ColoTensor * polish code
-
- 20 May, 2022 1 commit
-
-
ver217 authored
* refactor parallel action * polish unit tests
-
- 19 May, 2022 2 commits
-
-
ver217 authored
* polish test_gpt * update op unit tests * update test model
-
ver217 authored
* refactor colo-tensor and update linear op * polish code * polish code * update ops and unit tests * update unit tests * polish code * rename dist_spec module * polish code * polish code * remove unneeded import * fix pipelinable
-