"configs/seko_talk/L40s/1gpu/seko_talk_fp8.json" did not exist on "8b230da52026e14833e50ec13001c6837ecb5009"
-
Sangkug Lym authored
remove redundant linear layer class definition add fuse_gradient_accumulation attribute to weights for simple targetting reflect feedback and clean up the codes arg change
83b1e42f