Commits · 4c972a6eb96c250cf40166d2f1845deb7d15fe81 · OpenDAS / deepspeed

12 Oct, 2023 1 commit
- add llvm compile arg for dtk2310 · 4c972a6e
  aiss authored Oct 12, 2023
  
  4c972a6e
25 Jun, 2023 1 commit
- Merge branch 'ds-v0.9.2-rocm' into 'main' · 3fa19ffc
  aiss authored Jun 25, 2023
```
Ds v0.9.2 rocm

See merge request dcutoolkit/deeplearing/deepspeed!3
```
  3fa19ffc
14 Jun, 2023 3 commits
- add block_thread arg · 25d5540b
  aiss authored Jun 14, 2023
  
  25d5540b
- set readme_hip as default · 1c0bde25
  aiss authored Jun 14, 2023
  
  1c0bde25
- get version by cmd · e3b634f8
  aiss authored Jun 14, 2023
  
  e3b634f8
31 May, 2023 3 commits
- modify README_HIP.md · 574cae06
  aiss authored May 31, 2023
  
  574cae06
- modify version and dcu_version · c1ecf444
  aiss authored May 31, 2023
  
  c1ecf444
- modify version and dcu_version · 9a16b7ea
  aiss authored May 31, 2023
  
  9a16b7ea
30 May, 2023 2 commits
- Merge branch 'ds-v0.9.2-rocm' into 'main' · c25a91b6
  aiss authored May 30, 2023
```
Ds v0.9.2 rocm

See merge request dcutoolkit/deeplearing/deepspeed!2
```
  c25a91b6
- modify test folder · af82b300
  aiss authored May 30, 2023
  
  af82b300
29 May, 2023 2 commits
- add dtk version · 8cfd4afa
  aiss authored May 29, 2023
  
  8cfd4afa
- update v0.9.2 · 5bcc463d
  aiss authored May 29, 2023
  
  5bcc463d
11 May, 2023 3 commits
- Merge branch 'ds-v0.8.2-rocm' into 'main' · d1596c94
  aiss authored May 11, 2023
```
Ds v0.8.2 rocm

See merge request aicomponent/deepspeed!3
```
  d1596c94
- update readme_hip · ac5fbab4
  aiss authored May 11, 2023
  
  ac5fbab4
- add readme_hip.md · 141ff533
  aiss authored May 11, 2023
  
  141ff533
27 Apr, 2023 2 commits
- Merge branch 'ds-v0.8.2-rocm' into 'main' · 6a707da5
  aiss authored Apr 27, 2023
```
modify error

See merge request aicomponent/deepspeed!2
```
  6a707da5
- modify error · 0f3656b9
  aiss authored Apr 27, 2023
  
  0f3656b9
26 Apr, 2023 3 commits
- Merge branch 'ds-v0.8.2-rocm' into 'main' · 899b52ce
  aiss authored Apr 26, 2023
```
Ds v0.8.2 rocm, support torch1.13 for the hipify change

See merge request aicomponent/deepspeed!1
```
  899b52ce
- delete hip file · 4acf0e01
  aiss authored Apr 26, 2023
  
  4acf0e01
- support torch1.13 and torch1.10 · 7dd68788
  aiss authored Apr 26, 2023
  
  7dd68788
30 Mar, 2023 1 commit
- push dsv0.8.2 version · 67ea635f
  aiss authored Mar 30, 2023
  
  67ea635f
10 Aug, 2022 1 commit
- modify version code · 1b2721ad
  aiss authored Aug 10, 2022
  
  1b2721ad
14 Jun, 2022 1 commit
- Update setup.py · c3e434ae
  aiss authored Jun 14, 2022
  
  c3e434ae
11 Jun, 2022 4 commits
- Merge branch 'deepspeed-0.6.3-rocm' of... · d335bffa
  aiss authored Jun 11, 2022
```
Merge branch 'deepspeed-0.6.3-rocm' of http://10.0.100.3/dcutoolkit/deeplearing/deepspeed into deepspeed-0.6.3-rocm
version modify
```
  d335bffa
- modify whl name · 5da48343
  aiss authored Jun 11, 2022
  
  5da48343
- Update requirements-sparse_attn.txt · 7fa189a6
  aiss authored Jun 11, 2022
  
  7fa189a6
- add dtk version · 9b6449e6
  aiss authored Jun 11, 2022
  
  9b6449e6
26 May, 2022 1 commit
- modify dtk path · d8669b08
  aiss authored May 26, 2022
  
  d8669b08
25 May, 2022 1 commit
- push Deepspeed 0.6.3 rocm version · 7d1a83a9
  aiss authored May 25, 2022
  
  7d1a83a9
02 Apr, 2021 2 commits

Add link to AML examples. (#916) · ab5534fc
Ammar Ahmad Awan authored Apr 02, 2021
```
Co-authored-by: Jeff Rasley <jerasley@microsoft.com>
```
ab5534fc

Jeff Rasley authored Apr 02, 2021

This test has been giving us trouble for a bit, seeing nondeterministic failures, skipping for now to not break out CI. Need to revisit soon though.

8db4fdf8

01 Apr, 2021 1 commit

zero.Init() clarification (#880) · 5d721e09

Stas Bekman authored Apr 01, 2021



* zero.Init() clarification

clarify that if `model.half()` can't fit into gpu memory `zero.Init()` is a must.

this proposal is via @samyam's clarification shared elsewhere.

Thank you.

* style

* add clarity

* style
Co-authored-by: Olatunji Ruwase <olruwase@microsoft.com>

5d721e09

31 Mar, 2021 3 commits
- [website] we're hiring! · c814abda
  Jeff Rasley authored Mar 30, 2021
  
  c814abda
- [website] We're hiring! + integration posts · c6b497df
  Jeff Rasley authored Mar 30, 2021
  
  c6b497df
- We're hiring! + integration posts · 8c9e16eb
  Jeff Rasley authored Mar 30, 2021
  
  8c9e16eb
30 Mar, 2021 3 commits

Bump kramdown from 2.3.0 to 2.3.1 in /docs (#905) · c0422642

dependabot[bot] authored Mar 30, 2021

Bumps [kramdown](https://github.com/gettalong/kramdown) from 2.3.0 to 2.3.1.
- [Release notes](https://github.com/gettalong/kramdown/releases)
- [Changelog](https://github.com/gettalong/kramdown/blob/master/doc/news.page)
- [Commits](https://github.com/gettalong/kramdown/commits

)
Signed-off-by: dependabot[bot] <support@github.com>
Co-authored-by: dependabot[bot] <49699333+dependabot[bot]@users.noreply.github.com>
Co-authored-by: Jeff Rasley <jerasley@microsoft.com>

c0422642

update backward api doc (#903) · 23ff6cb7
Jeff Rasley authored Mar 30, 2021

23ff6cb7
update kramdown (#901) · af2d8fc5
Jeff Rasley authored Mar 30, 2021
```
security alert related to older kramdown version
```
af2d8fc5

27 Mar, 2021 2 commits

Fix zero stage2 cpu_offload when some model trainable parameters skipped in training (#861) · 7fcc8911

hamlet authored Mar 27, 2021

* Fix zero stage2 cpu_offload when some model trainable parameters skipped in training, as in https://github.com/microsoft/DeepSpeed/issues/707



As some model trainable parameters skipped in training,
their backward hooks in self.create_reduce_and_remove_grad_hooks() will not run, 
so they have no norm_for_param_grads

* Trim space

* Trim space
Co-authored-by: Olatunji Ruwase <olruwase@microsoft.com>

7fcc8911

save_fp16_model consolidated for zero3 (#893) · 39013dd2
Stas Bekman authored Mar 26, 2021
```
Co-authored-by: Olatunji Ruwase <olruwase@microsoft.com>
```
39013dd2