- 21 May, 2023 1 commit
-
-
Arturo Ghinassi authored
Added installation tutorial in the documentation
-
- 19 May, 2023 6 commits
-
-
Rick Ho authored
Cast input to weights type for AMP support
-
Arturo Ghinassi authored
-
Rick Ho authored
Revert "convert input to same type as weight for mixed precision training"
-
Rick Ho authored
-
Rick Ho authored
convert input to same type as weight for mixed precision training
-
Arturo Ghinassi authored
AMP support
-
- 18 May, 2023 1 commit
-
-
Arturo Ghinassi authored
When using CUDA AMP FMoE Linear throws type error as input is half() and weights are float()
-
- 21 Mar, 2023 3 commits
- 20 Mar, 2023 1 commit
-
-
zms1999 authored
-
- 13 Feb, 2023 2 commits
-
-
Rick Ho authored
remove synchronize
-
Fragile-azalea authored
-
- 28 Dec, 2022 3 commits
- 14 Dec, 2022 3 commits
- 14 Nov, 2022 2 commits
- 30 Sep, 2022 2 commits
- 19 Sep, 2022 1 commit
-
-
Rick Ho authored
Document for examples
-
- 16 Sep, 2022 1 commit
-
-
Rick Ho authored
-
- 04 Aug, 2022 2 commits
-
-
Rick Ho authored
Fix GshardGate top1_idx
-
Fragile-azalea authored
-
- 23 Jul, 2022 2 commits
- 18 Jul, 2022 2 commits
- 12 Jun, 2022 1 commit
-
-
Rick Ho authored
Fix Broadcast rank bug in DGDP
-
- 10 Jun, 2022 1 commit
-
-
Rick Ho authored
-
- 01 Jun, 2022 4 commits
- 27 May, 2022 2 commits